SlideShare a Scribd company logo
1 of 26
DIGITIZATION REVEALED
101 crash course on the fundamentals of digitizing
archival collections from start to finish
MARINA GEORGIEVA
Nevada Library Association Annual Conference
October 14, 2018
Las Vegas, NV
ABOUT ME
 Graduated from University of Wisconsin – Milwaukee
 MLIS with concentration in Information technologies
 Digital libraries
 Graphic and Web design
 Metadata
 Information architecture and UX design
 Visiting Digital Collections Librarian at University of
Nevada – Las Vegas
 Professional passion
 Project management
 Digitization
 Metadata design and management
Digitizing archival manuscripts on
PhaseOne camera (2018)
More about my work at
www.marina-expertise.com
TOPICS
Collection Inventory and Assessment
Funding
Digitization technology
Team
Workflow
Platform
Standards and sustainability
COLLECTION INVENTORY AND ASSESSMENT
WHY DIGITIZE?
• For access
• For preservation
• On demand
• For specific occasion/exhibit
ANALYZE THE COLLECTION
• Priority list
• Research value
• Collection size
• Collection level of processing
(Finding Aid)
• Reuse of existing Finding Aid
• Condition of materials
• Format of materials
• Has it been digitized as part of
another project?
ASSESSMENT
• Large scale or boutique
approach?
• Digitize all or selected materials?
• Who does selection?
• In-house capacities
• Outsourcing options
FUNDING 1|3
1
Funding types
2
Grant-funding opportunities
INTERNAL FUNDING
• Types of digitization projects
• Reducing archival backlog
• Preservation
• Access (esp. high demand collections)
• Scan on demand (for class or event)
• Staffing
• Existing staff
• Students assistants
• Interns
• Volunteers
GRANT FUNDING
• Types of digitization projects
• Specific project that fits the grant scope
• High priority collection that requires more resources and
is eligible for grant
• High research value collections
• Collections in desperate need for preservation
• Staffing
• Specially hired and trained project staff
• Specially trained students assistants
FUNDING 2|3
1
Funding types
FUNDING 3|3
2
Grant-funding opportunities
National Endowment for the Humanities
https://www.neh.gov/grants/listing?keywords=digitization
Library Services and Technology Act
https://nsla.libguides.com/2018LSTA
Institute of Museum and Library Services
https://www.imls.gov/grants/apply-grant/available-grants
National Historical Publications & Records Commission
https://www.archives.gov/nhprc/apply/eligibility.html
DIGITIZATION TECHNOLOGY 1|3
1
Types of digitization technologies
2
Selecting the best fit
DIGITIZATION TECHNOLOGY 2|3
1
Types of digitization technologies
Flatbed scanner
Large format scanner Transparencies scanner
Book scanner Camera systems
Microfilm scanner
DIGITIZATION TECHNOLOGY 3|3
2
Selecting the best fit
CONSIDER THE TECHNOLOGY
• Equipment you already have
• Equipment you need to purchase
• Equipment you need to outsource
CONSIDER THE
COLLECTION
• Format of archival materials
• Microfilms
• Transparencies
• Reflective materials
• Oversized materials
• Manuscripts
• Books
CONSIDER THE PROJECT
• Scale
• Scope
• Timeline
• Team
TEAM 1|3
1
Assessment| decision-making
2
Roles | Training
DECISION-MAKING
• Team size
• Roles (positions)
• Part-time vs. full-time
HIRING
• Internal staff vs. external hires
• Experienced vs. non-trained
• Professional staff vs. student
assistants
1
Assessment| decision-making
PROJECT ASSESSMENT
• Grant-funded vs.
internally funded
• Project priority
• Project deadlines
• Project specifics
• Large-scale or small
• Scan on demand
• Target audience
TEAM 2|3
ASSIGNING ROLES
• Train narrow specialists for particular
tasks vs. train people universally
• Advantages and disadvantages
• Main roles in digitization:
• Workflow manager
• Staff manager
• Metadata manager
• Metadata creators
• Digitization specialists
TRAINING
• New hires training
• Refreshers
• On-going training
• Training documentation
• Self-training guidelines
2
Roles | Training
TEAM 3|3
WORKFLOW 1|10
1
Preparation of physical materials
2
Digitization of archival materials
3
Image processing
4
Batch import
5
Metadata (object description)
6
Quality review
WORKFLOW 2|10
STATUS CONTROL
FILE
• Comprehensive list of all
collections in a project
• Provides vital information
for tracking progress
• Digital ID
• Collection name
• Workflow segments
• Initials and completion
dates
Some definitions
MASTERFILE
• Comprehensive list of all items
in a batch
• Fields are replica of collection
MAP
• Used for data import in DAMS
• Records initial descriptive and
technical metadata
• Digital IDs
• Descriptions
• Technology
• Formats
DIGITAL OBJECTS
• Compound objects
• Single objects
• Not necessarily replica of the structure
of the archival folder
ARCHIVAL OBJECTS
• A.k.a physical objects
• Described in Finding Aids
• Box level
• Folder level
• Item level
WORKFLOW 3|10
Some definitions
Status Control File
MasterFile
WORKFLOW 4|10
1
Preparation of physical materials
OBTAINING COLLECTION
• Finding Aid
• Get collection
• Collection inventory
• Does Finding Aid
description correspond to
what’s in boxes?
• Is Finding Aid description
on folder level or on item
level?
PROCESSING COLLECTION
• Condition of materials
• Documenting objects
• Folder vs item level decision
• Grouping for efficient digitization
• Updating documentation
DOCUMENTATION
• Analyze Finding Aid
• Prepare MasterFile
• Object type
• single objects
• compound objects
• textual
• visual
• Object format
• negatives
• reflective materials
• 3D objects
WORKFLOW 5|10
DIGITIZATION
• Scan!
• Update documentation
simultaneously
• Keep track of items and
batches
• Transcribe all peculiarities of
the objects
DIGITIZATION PREP
• Technology selection
• Set up scanning sessions
• Group in batches if
materials are
heterogeneous
• Set up documentation
• MasterFile
• Status Control File
• Set up file naming
Digitization of archival materials
2
WORKFLOW 6|10
FILE NAMING
• Digital IDs are critical for object
retrieval and file preservation
• Double-check file names
• Follow the collection naming
conventions
• Organize compound objects
• Update documentation
ACTIONS
• Straightening
• Cropping
• Color correction
• OCR (textual objects)
The sequence of actions and used
software depends on the technology
used for scanning
3
Image processing
EXPORTING
• Create a collection
destination folder for
temporary home of
archival images
• Export TIFFs and JPGs
• Organize compound
objects
WORKFLOW 7|10
IMPORT
• Compound objects w/
parent level metadata
• Compound objects w/
children level metadata
• Single objects
• OCR’ed objects w/ .txt files
• Troubleshoot if necessary
DOCUMENTATION
• Assign batch number
• Update MasterFile
• Update Status Control File
• Generate Tab-delimited .txt
file for the import
• Get the path to the archival
images
ASSIGN
• Check the transcripts (for
OCR’ed objects!)
• Communicate the batch
number to the metadata
creators
4
Batch import
WORKFLOW 8|10
5
Metadata (object description)
METADATA ROLES
• Administrator
• Managers
• Creators
OVERVIEW
• Object description
• Promotes consistency
across collections
• Enhances easy object
retrieval
• Supports faceting
• Supports subject searching
| browsing
Most important digitization segment from user perspective!
METADATA GRANULARITY
• Rich vs. basic metadata
• Collections size – granularity
differs across collections
• Large-scale projects
• Boutique collections
• Digital exhibits
• Material format – some collections
need less granularity
• Newspaper collections
WORKFLOW 9|10
INDEXING GUIDELINES
• Cheat sheet for
metadata creators
• Clarifies MAP
• Disambiguates
• Rules and examples
METADATA
APPLICATION PROFILE
• Fields
• Encoding scheme
• Occurrence
• Obligation
• Collection-specific vs.
shared
CONTROLLED VOCABULARIES
• Local CVs
• Authority files
• AAT | Art and Architecture Thesaurus
• FAST | Faceted Application of Subject Terminology
• LCSH | Library of Congress Subject Headings
• LCNAF | Library of Congress Name Authorities
• MESH | Medical Subject Headings
• TGN | Getty Thesaurus of Geographic Names
• ULAN | Union List of Artist Names
• Collection specific vs shared
5
Metadata (object description)
Most important digitization segment from user perspective!
WORKFLOW 10|10
FEEDBACK AND REVISIONS
• Communicate problems to
metadata creators
• Revise and correct
• Keep list of typical mistakes for
training purposes
ACTIONS
• Image quality
• Metadata quality
• Transcripts
• OCR quality
6
Quality review
GOAL
High quality metadata
= consistent collections
= linked data readiness
= increased usability of digital collections
= happy researchers!
PLATFORM
BACK-END FEATURES
• Easy upload of data sets
• Easy export of data sets
• Easy to use by staff
• Intuitive administrator interface
• Supports multiple file formats
• Supports shared vocabularies
across collections
• Cloud-based vs. server-based
The DAMS interface is your presence in the web! Keep it neat to provide outstanding user experience!
FRONT-END FEATURES
• Easy navigation
• User-friendly
• Customizable
• Responsive design
• Supports multiple collections
• Supports searching and
browsing
• Supports advanced search
• Supports faceting
POPULAR DAMS
STANDARDS AND
SUSTAINABILITY
SUSTAINABILITY
• Iterative approach
• Reuse, adapt, improve existing workflows
• Cross-collection controlled vocabularies
• Adopt best practices
• Have internal procedures and guidelines in
place
STANDARDS
• Dublin Core Metadata Element Set (DC)
• Metadata Object Description Schema (MODS)
• Metadata Encoding and Transmission Standard
(METS)
• eXtensible Markup Language (XML)
• Encoded Archival Standard (EAD)
THANK YOU!
QUESTIONS? Visiting Digital Collections Librarian
University of Nevada - Las Vegas
University Libraries
4505 S Maryland Parkway
Box 457041
Las Vegas, NV 89154
Tel: 702-895-2310
marina.georgieva@unlv.edu
www.marina-expertise.com

More Related Content

Similar to Digitization revealed (2018 NLA Annual Conference)

What do you want to discover today? / Janet Aucock, University of St Andrews
What do you want to discover today? / Janet Aucock, University of St AndrewsWhat do you want to discover today? / Janet Aucock, University of St Andrews
What do you want to discover today? / Janet Aucock, University of St AndrewsCIGScotland
 
The Evolution of the UC San Diego Library DAMS
The Evolution of the  UC San Diego Library DAMSThe Evolution of the  UC San Diego Library DAMS
The Evolution of the UC San Diego Library DAMSMatthew Critchlow
 
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...DuraSpace
 
(Nov 2008) Preparing Future Digital Curators
(Nov 2008) Preparing Future Digital Curators(Nov 2008) Preparing Future Digital Curators
(Nov 2008) Preparing Future Digital CuratorsCarolyn Hank
 
S cook ands_ttt2_perth_rdm_training
S cook ands_ttt2_perth_rdm_trainingS cook ands_ttt2_perth_rdm_training
S cook ands_ttt2_perth_rdm_trainingARDC
 
Islandora Webinar: Building a Repository Roadmap
Islandora Webinar: Building a Repository RoadmapIslandora Webinar: Building a Repository Roadmap
Islandora Webinar: Building a Repository Roadmapeohallor
 
IBC 2010 Redefining Search
IBC 2010 Redefining SearchIBC 2010 Redefining Search
IBC 2010 Redefining Searchvrt-medialab
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data LocallyErin D. Foster
 
"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with ArchivematicaJenny Mitcham
 
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...ASIS&T
 
Interdisciplinary Processes at the Digital Repository of Ireland
Interdisciplinary Processes at the Digital Repository of IrelandInterdisciplinary Processes at the Digital Repository of Ireland
Interdisciplinary Processes at the Digital Repository of Irelanddri_ireland
 
Introduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsIntroduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsJenn Riley
 
Incentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production processIncentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production processLouise Corti
 
Data Storage
Data StorageData Storage
Data StorageMoghees1
 
LaunchPad Project Management
LaunchPad Project ManagementLaunchPad Project Management
LaunchPad Project Managementscottsayre
 
A Data Scientist Perspective on Data Curation in the Digital Era
A Data Scientist Perspective on Data Curation in the Digital EraA Data Scientist Perspective on Data Curation in the Digital Era
A Data Scientist Perspective on Data Curation in the Digital EraVicki Ferrini
 

Similar to Digitization revealed (2018 NLA Annual Conference) (20)

What do you want to discover today? / Janet Aucock, University of St Andrews
What do you want to discover today? / Janet Aucock, University of St AndrewsWhat do you want to discover today? / Janet Aucock, University of St Andrews
What do you want to discover today? / Janet Aucock, University of St Andrews
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
The Evolution of the UC San Diego Library DAMS
The Evolution of the  UC San Diego Library DAMSThe Evolution of the  UC San Diego Library DAMS
The Evolution of the UC San Diego Library DAMS
 
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
10-15-13 “Metadata and Repository Services for Research Data Curation” Presen...
 
(Nov 2008) Preparing Future Digital Curators
(Nov 2008) Preparing Future Digital Curators(Nov 2008) Preparing Future Digital Curators
(Nov 2008) Preparing Future Digital Curators
 
S cook ands_ttt2_perth_rdm_training
S cook ands_ttt2_perth_rdm_trainingS cook ands_ttt2_perth_rdm_training
S cook ands_ttt2_perth_rdm_training
 
Digital Preservation with Archivematica: An Introduction
Digital Preservation with Archivematica: An IntroductionDigital Preservation with Archivematica: An Introduction
Digital Preservation with Archivematica: An Introduction
 
Islandora Webinar: Building a Repository Roadmap
Islandora Webinar: Building a Repository RoadmapIslandora Webinar: Building a Repository Roadmap
Islandora Webinar: Building a Repository Roadmap
 
IBC 2010 Redefining Search
IBC 2010 Redefining SearchIBC 2010 Redefining Search
IBC 2010 Redefining Search
 
Digital libraries
Digital librariesDigital libraries
Digital libraries
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data Locally
 
"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica
 
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
 
Hansen Metadata for Institutional Repositories
Hansen Metadata for Institutional RepositoriesHansen Metadata for Institutional Repositories
Hansen Metadata for Institutional Repositories
 
Interdisciplinary Processes at the Digital Repository of Ireland
Interdisciplinary Processes at the Digital Repository of IrelandInterdisciplinary Processes at the Digital Repository of Ireland
Interdisciplinary Processes at the Digital Repository of Ireland
 
Introduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsIntroduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH Fellows
 
Incentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production processIncentivising the uptake of reusable metadata in the survey production process
Incentivising the uptake of reusable metadata in the survey production process
 
Data Storage
Data StorageData Storage
Data Storage
 
LaunchPad Project Management
LaunchPad Project ManagementLaunchPad Project Management
LaunchPad Project Management
 
A Data Scientist Perspective on Data Curation in the Digital Era
A Data Scientist Perspective on Data Curation in the Digital EraA Data Scientist Perspective on Data Curation in the Digital Era
A Data Scientist Perspective on Data Curation in the Digital Era
 

More from Marina Georgieva

Metadata for compound objects | training
Metadata for compound objects | trainingMetadata for compound objects | training
Metadata for compound objects | trainingMarina Georgieva
 
In-house vs. Outsourced Digitization: similarities, key differences and pitfa...
In-house vs. Outsourced Digitization: similarities, key differences and pitfa...In-house vs. Outsourced Digitization: similarities, key differences and pitfa...
In-house vs. Outsourced Digitization: similarities, key differences and pitfa...Marina Georgieva
 
Metadata: An Overview for Digital Collections
Metadata: An Overview for Digital CollectionsMetadata: An Overview for Digital Collections
Metadata: An Overview for Digital CollectionsMarina Georgieva
 
Creating websites and leading librarians to a new level of project engagement
Creating websites and leading librarians to a new level of project engagementCreating websites and leading librarians to a new level of project engagement
Creating websites and leading librarians to a new level of project engagementMarina Georgieva
 
From Temporary to Transformative: Leveraging Externally-Funded Special Collec...
From Temporary to Transformative: Leveraging Externally-Funded Special Collec...From Temporary to Transformative: Leveraging Externally-Funded Special Collec...
From Temporary to Transformative: Leveraging Externally-Funded Special Collec...Marina Georgieva
 
Metadata Remediation: updates, procedures, workflows
Metadata Remediation: updates, procedures, workflowsMetadata Remediation: updates, procedures, workflows
Metadata Remediation: updates, procedures, workflowsMarina Georgieva
 
2018 Professional accomplishments in numbers
2018 Professional accomplishments in numbers2018 Professional accomplishments in numbers
2018 Professional accomplishments in numbersMarina Georgieva
 
Building websites and leading librarians to a new level of project engagement
Building websites and leading librarians to a new level of project engagementBuilding websites and leading librarians to a new level of project engagement
Building websites and leading librarians to a new level of project engagementMarina Georgieva
 
The digital librarian: the liaison between digital collections and digital pr...
The digital librarian: the liaison between digital collections and digital pr...The digital librarian: the liaison between digital collections and digital pr...
The digital librarian: the liaison between digital collections and digital pr...Marina Georgieva
 
Project Management Poster Handout for ALA Annual 2018 attendees
Project Management Poster Handout for ALA Annual 2018 attendeesProject Management Poster Handout for ALA Annual 2018 attendees
Project Management Poster Handout for ALA Annual 2018 attendeesMarina Georgieva
 
Project Management Poster at ALA Annual 2018
Project Management Poster at ALA Annual 2018Project Management Poster at ALA Annual 2018
Project Management Poster at ALA Annual 2018Marina Georgieva
 
ContentDm Landing pages for Digital Collections
ContentDm Landing pages for Digital CollectionsContentDm Landing pages for Digital Collections
ContentDm Landing pages for Digital CollectionsMarina Georgieva
 
Nevada Digital Newspaper Project at the Clark County Nevada Genealogy Meeting
Nevada Digital Newspaper Project at the Clark County Nevada Genealogy MeetingNevada Digital Newspaper Project at the Clark County Nevada Genealogy Meeting
Nevada Digital Newspaper Project at the Clark County Nevada Genealogy MeetingMarina Georgieva
 
Nevada Digital Newspaper Project and Chronicling America Presentation
Nevada Digital Newspaper Project and Chronicling America PresentationNevada Digital Newspaper Project and Chronicling America Presentation
Nevada Digital Newspaper Project and Chronicling America PresentationMarina Georgieva
 
Nevada Digital Newspaper Project | Upcoming events
Nevada Digital Newspaper Project | Upcoming eventsNevada Digital Newspaper Project | Upcoming events
Nevada Digital Newspaper Project | Upcoming eventsMarina Georgieva
 
Nevada Digital Newspaper Project | New addition to Chronicling America
Nevada Digital Newspaper Project | New addition to Chronicling AmericaNevada Digital Newspaper Project | New addition to Chronicling America
Nevada Digital Newspaper Project | New addition to Chronicling AmericaMarina Georgieva
 
Large-scale digitization plan | UNLV Libraries, Dec 2017
Large-scale digitization plan | UNLV Libraries, Dec 2017Large-scale digitization plan | UNLV Libraries, Dec 2017
Large-scale digitization plan | UNLV Libraries, Dec 2017Marina Georgieva
 
Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)
Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)
Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)Marina Georgieva
 
Inforgraphic: facts about NDNP batch
Inforgraphic: facts about NDNP batchInforgraphic: facts about NDNP batch
Inforgraphic: facts about NDNP batchMarina Georgieva
 
Nevada Digital Newspaper Project Midterm Status
Nevada Digital Newspaper Project Midterm StatusNevada Digital Newspaper Project Midterm Status
Nevada Digital Newspaper Project Midterm StatusMarina Georgieva
 

More from Marina Georgieva (20)

Metadata for compound objects | training
Metadata for compound objects | trainingMetadata for compound objects | training
Metadata for compound objects | training
 
In-house vs. Outsourced Digitization: similarities, key differences and pitfa...
In-house vs. Outsourced Digitization: similarities, key differences and pitfa...In-house vs. Outsourced Digitization: similarities, key differences and pitfa...
In-house vs. Outsourced Digitization: similarities, key differences and pitfa...
 
Metadata: An Overview for Digital Collections
Metadata: An Overview for Digital CollectionsMetadata: An Overview for Digital Collections
Metadata: An Overview for Digital Collections
 
Creating websites and leading librarians to a new level of project engagement
Creating websites and leading librarians to a new level of project engagementCreating websites and leading librarians to a new level of project engagement
Creating websites and leading librarians to a new level of project engagement
 
From Temporary to Transformative: Leveraging Externally-Funded Special Collec...
From Temporary to Transformative: Leveraging Externally-Funded Special Collec...From Temporary to Transformative: Leveraging Externally-Funded Special Collec...
From Temporary to Transformative: Leveraging Externally-Funded Special Collec...
 
Metadata Remediation: updates, procedures, workflows
Metadata Remediation: updates, procedures, workflowsMetadata Remediation: updates, procedures, workflows
Metadata Remediation: updates, procedures, workflows
 
2018 Professional accomplishments in numbers
2018 Professional accomplishments in numbers2018 Professional accomplishments in numbers
2018 Professional accomplishments in numbers
 
Building websites and leading librarians to a new level of project engagement
Building websites and leading librarians to a new level of project engagementBuilding websites and leading librarians to a new level of project engagement
Building websites and leading librarians to a new level of project engagement
 
The digital librarian: the liaison between digital collections and digital pr...
The digital librarian: the liaison between digital collections and digital pr...The digital librarian: the liaison between digital collections and digital pr...
The digital librarian: the liaison between digital collections and digital pr...
 
Project Management Poster Handout for ALA Annual 2018 attendees
Project Management Poster Handout for ALA Annual 2018 attendeesProject Management Poster Handout for ALA Annual 2018 attendees
Project Management Poster Handout for ALA Annual 2018 attendees
 
Project Management Poster at ALA Annual 2018
Project Management Poster at ALA Annual 2018Project Management Poster at ALA Annual 2018
Project Management Poster at ALA Annual 2018
 
ContentDm Landing pages for Digital Collections
ContentDm Landing pages for Digital CollectionsContentDm Landing pages for Digital Collections
ContentDm Landing pages for Digital Collections
 
Nevada Digital Newspaper Project at the Clark County Nevada Genealogy Meeting
Nevada Digital Newspaper Project at the Clark County Nevada Genealogy MeetingNevada Digital Newspaper Project at the Clark County Nevada Genealogy Meeting
Nevada Digital Newspaper Project at the Clark County Nevada Genealogy Meeting
 
Nevada Digital Newspaper Project and Chronicling America Presentation
Nevada Digital Newspaper Project and Chronicling America PresentationNevada Digital Newspaper Project and Chronicling America Presentation
Nevada Digital Newspaper Project and Chronicling America Presentation
 
Nevada Digital Newspaper Project | Upcoming events
Nevada Digital Newspaper Project | Upcoming eventsNevada Digital Newspaper Project | Upcoming events
Nevada Digital Newspaper Project | Upcoming events
 
Nevada Digital Newspaper Project | New addition to Chronicling America
Nevada Digital Newspaper Project | New addition to Chronicling AmericaNevada Digital Newspaper Project | New addition to Chronicling America
Nevada Digital Newspaper Project | New addition to Chronicling America
 
Large-scale digitization plan | UNLV Libraries, Dec 2017
Large-scale digitization plan | UNLV Libraries, Dec 2017Large-scale digitization plan | UNLV Libraries, Dec 2017
Large-scale digitization plan | UNLV Libraries, Dec 2017
 
Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)
Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)
Nevada Digital Newspaper Project | SC Division Meeting Update (Feb 2018)
 
Inforgraphic: facts about NDNP batch
Inforgraphic: facts about NDNP batchInforgraphic: facts about NDNP batch
Inforgraphic: facts about NDNP batch
 
Nevada Digital Newspaper Project Midterm Status
Nevada Digital Newspaper Project Midterm StatusNevada Digital Newspaper Project Midterm Status
Nevada Digital Newspaper Project Midterm Status
 

Recently uploaded

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 

Recently uploaded (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 

Digitization revealed (2018 NLA Annual Conference)

  • 1. DIGITIZATION REVEALED 101 crash course on the fundamentals of digitizing archival collections from start to finish MARINA GEORGIEVA Nevada Library Association Annual Conference October 14, 2018 Las Vegas, NV
  • 2. ABOUT ME  Graduated from University of Wisconsin – Milwaukee  MLIS with concentration in Information technologies  Digital libraries  Graphic and Web design  Metadata  Information architecture and UX design  Visiting Digital Collections Librarian at University of Nevada – Las Vegas  Professional passion  Project management  Digitization  Metadata design and management Digitizing archival manuscripts on PhaseOne camera (2018) More about my work at www.marina-expertise.com
  • 3. TOPICS Collection Inventory and Assessment Funding Digitization technology Team Workflow Platform Standards and sustainability
  • 4. COLLECTION INVENTORY AND ASSESSMENT WHY DIGITIZE? • For access • For preservation • On demand • For specific occasion/exhibit ANALYZE THE COLLECTION • Priority list • Research value • Collection size • Collection level of processing (Finding Aid) • Reuse of existing Finding Aid • Condition of materials • Format of materials • Has it been digitized as part of another project? ASSESSMENT • Large scale or boutique approach? • Digitize all or selected materials? • Who does selection? • In-house capacities • Outsourcing options
  • 6. INTERNAL FUNDING • Types of digitization projects • Reducing archival backlog • Preservation • Access (esp. high demand collections) • Scan on demand (for class or event) • Staffing • Existing staff • Students assistants • Interns • Volunteers GRANT FUNDING • Types of digitization projects • Specific project that fits the grant scope • High priority collection that requires more resources and is eligible for grant • High research value collections • Collections in desperate need for preservation • Staffing • Specially hired and trained project staff • Specially trained students assistants FUNDING 2|3 1 Funding types
  • 7. FUNDING 3|3 2 Grant-funding opportunities National Endowment for the Humanities https://www.neh.gov/grants/listing?keywords=digitization Library Services and Technology Act https://nsla.libguides.com/2018LSTA Institute of Museum and Library Services https://www.imls.gov/grants/apply-grant/available-grants National Historical Publications & Records Commission https://www.archives.gov/nhprc/apply/eligibility.html
  • 8. DIGITIZATION TECHNOLOGY 1|3 1 Types of digitization technologies 2 Selecting the best fit
  • 9. DIGITIZATION TECHNOLOGY 2|3 1 Types of digitization technologies Flatbed scanner Large format scanner Transparencies scanner Book scanner Camera systems Microfilm scanner
  • 10. DIGITIZATION TECHNOLOGY 3|3 2 Selecting the best fit CONSIDER THE TECHNOLOGY • Equipment you already have • Equipment you need to purchase • Equipment you need to outsource CONSIDER THE COLLECTION • Format of archival materials • Microfilms • Transparencies • Reflective materials • Oversized materials • Manuscripts • Books CONSIDER THE PROJECT • Scale • Scope • Timeline • Team
  • 12. DECISION-MAKING • Team size • Roles (positions) • Part-time vs. full-time HIRING • Internal staff vs. external hires • Experienced vs. non-trained • Professional staff vs. student assistants 1 Assessment| decision-making PROJECT ASSESSMENT • Grant-funded vs. internally funded • Project priority • Project deadlines • Project specifics • Large-scale or small • Scan on demand • Target audience TEAM 2|3
  • 13. ASSIGNING ROLES • Train narrow specialists for particular tasks vs. train people universally • Advantages and disadvantages • Main roles in digitization: • Workflow manager • Staff manager • Metadata manager • Metadata creators • Digitization specialists TRAINING • New hires training • Refreshers • On-going training • Training documentation • Self-training guidelines 2 Roles | Training TEAM 3|3
  • 14. WORKFLOW 1|10 1 Preparation of physical materials 2 Digitization of archival materials 3 Image processing 4 Batch import 5 Metadata (object description) 6 Quality review
  • 15. WORKFLOW 2|10 STATUS CONTROL FILE • Comprehensive list of all collections in a project • Provides vital information for tracking progress • Digital ID • Collection name • Workflow segments • Initials and completion dates Some definitions MASTERFILE • Comprehensive list of all items in a batch • Fields are replica of collection MAP • Used for data import in DAMS • Records initial descriptive and technical metadata • Digital IDs • Descriptions • Technology • Formats DIGITAL OBJECTS • Compound objects • Single objects • Not necessarily replica of the structure of the archival folder ARCHIVAL OBJECTS • A.k.a physical objects • Described in Finding Aids • Box level • Folder level • Item level
  • 16. WORKFLOW 3|10 Some definitions Status Control File MasterFile
  • 17. WORKFLOW 4|10 1 Preparation of physical materials OBTAINING COLLECTION • Finding Aid • Get collection • Collection inventory • Does Finding Aid description correspond to what’s in boxes? • Is Finding Aid description on folder level or on item level? PROCESSING COLLECTION • Condition of materials • Documenting objects • Folder vs item level decision • Grouping for efficient digitization • Updating documentation DOCUMENTATION • Analyze Finding Aid • Prepare MasterFile • Object type • single objects • compound objects • textual • visual • Object format • negatives • reflective materials • 3D objects
  • 18. WORKFLOW 5|10 DIGITIZATION • Scan! • Update documentation simultaneously • Keep track of items and batches • Transcribe all peculiarities of the objects DIGITIZATION PREP • Technology selection • Set up scanning sessions • Group in batches if materials are heterogeneous • Set up documentation • MasterFile • Status Control File • Set up file naming Digitization of archival materials 2
  • 19. WORKFLOW 6|10 FILE NAMING • Digital IDs are critical for object retrieval and file preservation • Double-check file names • Follow the collection naming conventions • Organize compound objects • Update documentation ACTIONS • Straightening • Cropping • Color correction • OCR (textual objects) The sequence of actions and used software depends on the technology used for scanning 3 Image processing EXPORTING • Create a collection destination folder for temporary home of archival images • Export TIFFs and JPGs • Organize compound objects
  • 20. WORKFLOW 7|10 IMPORT • Compound objects w/ parent level metadata • Compound objects w/ children level metadata • Single objects • OCR’ed objects w/ .txt files • Troubleshoot if necessary DOCUMENTATION • Assign batch number • Update MasterFile • Update Status Control File • Generate Tab-delimited .txt file for the import • Get the path to the archival images ASSIGN • Check the transcripts (for OCR’ed objects!) • Communicate the batch number to the metadata creators 4 Batch import
  • 21. WORKFLOW 8|10 5 Metadata (object description) METADATA ROLES • Administrator • Managers • Creators OVERVIEW • Object description • Promotes consistency across collections • Enhances easy object retrieval • Supports faceting • Supports subject searching | browsing Most important digitization segment from user perspective! METADATA GRANULARITY • Rich vs. basic metadata • Collections size – granularity differs across collections • Large-scale projects • Boutique collections • Digital exhibits • Material format – some collections need less granularity • Newspaper collections
  • 22. WORKFLOW 9|10 INDEXING GUIDELINES • Cheat sheet for metadata creators • Clarifies MAP • Disambiguates • Rules and examples METADATA APPLICATION PROFILE • Fields • Encoding scheme • Occurrence • Obligation • Collection-specific vs. shared CONTROLLED VOCABULARIES • Local CVs • Authority files • AAT | Art and Architecture Thesaurus • FAST | Faceted Application of Subject Terminology • LCSH | Library of Congress Subject Headings • LCNAF | Library of Congress Name Authorities • MESH | Medical Subject Headings • TGN | Getty Thesaurus of Geographic Names • ULAN | Union List of Artist Names • Collection specific vs shared 5 Metadata (object description) Most important digitization segment from user perspective!
  • 23. WORKFLOW 10|10 FEEDBACK AND REVISIONS • Communicate problems to metadata creators • Revise and correct • Keep list of typical mistakes for training purposes ACTIONS • Image quality • Metadata quality • Transcripts • OCR quality 6 Quality review GOAL High quality metadata = consistent collections = linked data readiness = increased usability of digital collections = happy researchers!
  • 24. PLATFORM BACK-END FEATURES • Easy upload of data sets • Easy export of data sets • Easy to use by staff • Intuitive administrator interface • Supports multiple file formats • Supports shared vocabularies across collections • Cloud-based vs. server-based The DAMS interface is your presence in the web! Keep it neat to provide outstanding user experience! FRONT-END FEATURES • Easy navigation • User-friendly • Customizable • Responsive design • Supports multiple collections • Supports searching and browsing • Supports advanced search • Supports faceting POPULAR DAMS
  • 25. STANDARDS AND SUSTAINABILITY SUSTAINABILITY • Iterative approach • Reuse, adapt, improve existing workflows • Cross-collection controlled vocabularies • Adopt best practices • Have internal procedures and guidelines in place STANDARDS • Dublin Core Metadata Element Set (DC) • Metadata Object Description Schema (MODS) • Metadata Encoding and Transmission Standard (METS) • eXtensible Markup Language (XML) • Encoded Archival Standard (EAD)
  • 26. THANK YOU! QUESTIONS? Visiting Digital Collections Librarian University of Nevada - Las Vegas University Libraries 4505 S Maryland Parkway Box 457041 Las Vegas, NV 89154 Tel: 702-895-2310 marina.georgieva@unlv.edu www.marina-expertise.com

Editor's Notes

  1. The agenda today will provide a large picture of what digitization project looks like.
  2. Format of archival materials – one type or a mix? Project scale – is it large-scale or small scale? Does speed and efficiency matter? Project timeline – how tight the final deadlines are? Are they flexible? Project team – do you have staff trained to use the piece of equipment you are planning to use? Can you provide training?
  3. AAT: Art and Architecture Thesaurus FAST: Faceted Application of Subject Terminology LCSH: Library of Congress Subject Headings MESH: Medical Subject Headings TGN: Getty Thesaurus of Geographic Names ULAN: Union List of Artist Names
  4. AAT: Art and Architecture Thesaurus FAST: Faceted Application of Subject Terminology LCSH: Library of Congress Subject Headings MESH: Medical Subject Headings TGN: Getty Thesaurus of Geographic Names ULAN: Union List of Artist Names