SlideShare a Scribd company logo
Rapid Digitization of Latin American Ephemera with Hydra 
Project History and Goals 
Princeton University Library began to collect and build an archive of Latin American ephemera and gray 
literature in the mid 1970s to document the activities of political and social organizations and movements, as 
well as the broader political, socioeconomic and cultural developments of the region. Access to the material 
was provided by slowly accumulating and organizing thematic sub-collections, creating finding aids, and 
microfilming selected curated sub-collections. Reproductions of the microfilm were commercially distributed 
and resulting royalties were used to fund new acquisitions. That model gradually become unsustainable 
during the past decade and microfilming was halted in 2008. 
Hydra breathes new life into this project by providing us with a 
framework for creating an end-to-end application that will facilitate rapid 
digitization, cataloging, and access to this important collection. Since 
the system went into production in April of 2014, nearly 1500 items 
have been cataloged, with the throughput rate ultimately accelerating to 
over 300 items per month in August. 
Princeton has several projects and workflows similar to “LAE”, and we 
expect many of the components of this applications to reusable in future 
projects. 
Item received and placed in 
Folder w/ barcode 
Preliminary metadata 
recorded, Folder placed in 
Box* w/ barcode 
Full Box sent offsite for 
imaging and OCR 
Hard drives returned to library, 
boxes to off site storage 
Images qc’d and ingested 
(barcode associates with 
metadata) 
Metadata completed from 
images 
Final quality control 
Production 
Returned drive is 
organized by 
box/folder barcodes. It 
also contains a 
JHOVE audit with 
checksums 
Generously sponsored by the Council on Library and Information Resources (CLIR; http://www.clir.org/) 
and Latin Americanist Research Resources Project (LARRP; http://www.crl.edu/grn/larrp) 
Image QC includes: 
● Checksums verified 
● Color profiles confirmed 
● Confirm (1) OCR for (1) file 
Process returns a file of arguments which: 
● Instantiating Page (image) and Folder 
objects 
● Creating the appropriate associations 
● This process also creates a JP2 that is 
copied to our image server (Loris). 
A Folder is instantiated in repo. Folder 
barcode, box barcode (via scanner), title, 
country, and genre MUST be included 
before it can be saved. 
State is set to “Has Preliminary 
Metadata” 
Folders move through their 
remaining states: 
● “Has Core Metadata" 
● “Needs QC" 
● "In Production" 
Item Cataloging and Imaging Workflow 
* Boxes have a similar workflow. States 
are: 
1. "New" 
2. "Ready to Ship" 
3. "Shipped" 
4. "Received" 
5. "All in Production" 
@prefix dc: <http://purl.org/dc/terms/> . 
@prefix isolang: <http://id.loc.gov/vocabulary/iso639-2/> . 
@prefix lcco: <http://id.loc.gov/vocabulary/countries/> . 
@prefix lcga: <http://id.loc.gov/vocabulary/geographicAreas/> . 
@prefix lcsh: <http://id.loc.gov/authorities/subjects/> . 
@prefix marcrel: <"http://id.loc.gov/vocabulary/relators/> . 
@prefix puls: <http://pul-store.princeton.edu/> . 
@prefix pulterms: <http://princeton.edu/pulstore/terms/> . 
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . 
@prefix tgm: <http://id.loc.gov/vocabulary/graphicMaterials/> . 
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> . 
puls:00b84 dc:title "Memoria 2003."@es; 
marcrel:mfp lcga:cl; 
pulterms:heightInCM "21"^^xsd:int; 
pulterms:isPartOfProject puls:0005p; 
pulterms:pageCount "15"^^xsd:int; 
pulterms:widthInCM "15"^^xsd:int; 
dc:coverage lcga:cl; 
dc:created "2003"^^xsd:gYear; 
dc:format tgm:tgm007415; 
dc:language isolang:spa; 
dc:publisher "Asociación para la Cooperación en el Sur (ACSUR)-Las Segovias"; 
dc:rights "This digital reproduction is intended to support research, teaching, 
and private study. Users are responsible for determining any copyright questions." 
@en; 
dc:subject lcsh:sh85040810, 
"Economic policy--Social aspects"@en, 
"Economics--Political aspects"@en, 
lcsh:sh85067385 . 
puls:004kr dc:title "The Workforce of Puerto Rico: Fringe Benefits."@en; 
marcrel:mfp lcco:pr; 
pulterms:heightInCM "22"^^xsd:int; 
pulterms:isPartOfProject puls:0005p; 
pulterms:pageCount "17"^^xsd:int; 
[...] 
Rich Metadata Available as RDF/Linked Data

More Related Content

What's hot

2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
Jun Zhao
 
Rdf Overview Presentation
Rdf Overview PresentationRdf Overview Presentation
Rdf Overview Presentation
Ken Varnum
 
2015 09 rda-pre-meeting_jk
2015 09 rda-pre-meeting_jk2015 09 rda-pre-meeting_jk
2015 09 rda-pre-meeting_jk
Johannes Keizer
 
Using Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's TablesUsing Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's Tables
祺傑 林
 
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQLVALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
Jane Frazier
 
Converting GHO to RDF
Converting GHO to RDFConverting GHO to RDF
Converting GHO to RDF
Amrapali Zaveri, PhD
 
Using Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's TablesUsing Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's Tables
Emir Muñoz
 
Keynote session - LOD2014 W3C event
Keynote session - LOD2014 W3C eventKeynote session - LOD2014 W3C event
Keynote session - LOD2014 W3C event
Diego Valerio Camarda
 
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
Diego Valerio Camarda
 
Federated Query Formulation and Processing Through BioFed
Federated Query Formulation and Processing Through BioFedFederated Query Formulation and Processing Through BioFed
Federated Query Formulation and Processing Through BioFed
Muhammad Saleem
 
Federated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 TutorialFederated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 Tutorial
Muhammad Saleem
 
FAIR Projector Builder
FAIR Projector BuilderFAIR Projector Builder
FAIR Projector Builder
Mark Wilkinson
 
RDF data model
RDF data modelRDF data model
RDF data model
Jose Emilio Labra Gayo
 
1 bioline & t space or2013 final
1 bioline & t space or2013 final1 bioline & t space or2013 final
1 bioline & t space or2013 final
KellliBee
 
Harnessing The Semantic Web
Harnessing The Semantic WebHarnessing The Semantic Web
Harnessing The Semantic Web
william_greenly
 
Ukgovld registry-intro
Ukgovld registry-introUkgovld registry-intro
Ukgovld registry-intro
Dave Reynolds
 
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
Diego Valerio Camarda
 
Linked open Vocabularies for Linked Open Data - the role of AGROVOC
Linked open Vocabularies for Linked Open Data - the role of AGROVOCLinked open Vocabularies for Linked Open Data - the role of AGROVOC
Linked open Vocabularies for Linked Open Data - the role of AGROVOC
AIMS (Agricultural Information Management Standards)
 
Grails And The Semantic Web
Grails And The Semantic WebGrails And The Semantic Web
Grails And The Semantic Web
william_greenly
 
Semantic Web introduction
Semantic Web introductionSemantic Web introduction
Semantic Web introduction
Graphity
 

What's hot (20)

2009 0807 Lod Gmod
2009 0807 Lod Gmod2009 0807 Lod Gmod
2009 0807 Lod Gmod
 
Rdf Overview Presentation
Rdf Overview PresentationRdf Overview Presentation
Rdf Overview Presentation
 
2015 09 rda-pre-meeting_jk
2015 09 rda-pre-meeting_jk2015 09 rda-pre-meeting_jk
2015 09 rda-pre-meeting_jk
 
Using Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's TablesUsing Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's Tables
 
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQLVALA Tech Camp 2017: Intro to Wikidata & SPARQL
VALA Tech Camp 2017: Intro to Wikidata & SPARQL
 
Converting GHO to RDF
Converting GHO to RDFConverting GHO to RDF
Converting GHO to RDF
 
Using Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's TablesUsing Linked Data to Mine RDF from Wikipedia's Tables
Using Linked Data to Mine RDF from Wikipedia's Tables
 
Keynote session - LOD2014 W3C event
Keynote session - LOD2014 W3C eventKeynote session - LOD2014 W3C event
Keynote session - LOD2014 W3C event
 
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
18 ° Nexa Lunch Seminar - Lo stato dell'arte dei Linked Open Data italiani
 
Federated Query Formulation and Processing Through BioFed
Federated Query Formulation and Processing Through BioFedFederated Query Formulation and Processing Through BioFed
Federated Query Formulation and Processing Through BioFed
 
Federated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 TutorialFederated SPARQL Query Processing ISWC2015 Tutorial
Federated SPARQL Query Processing ISWC2015 Tutorial
 
FAIR Projector Builder
FAIR Projector BuilderFAIR Projector Builder
FAIR Projector Builder
 
RDF data model
RDF data modelRDF data model
RDF data model
 
1 bioline & t space or2013 final
1 bioline & t space or2013 final1 bioline & t space or2013 final
1 bioline & t space or2013 final
 
Harnessing The Semantic Web
Harnessing The Semantic WebHarnessing The Semantic Web
Harnessing The Semantic Web
 
Ukgovld registry-intro
Ukgovld registry-introUkgovld registry-intro
Ukgovld registry-intro
 
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
 
Linked open Vocabularies for Linked Open Data - the role of AGROVOC
Linked open Vocabularies for Linked Open Data - the role of AGROVOCLinked open Vocabularies for Linked Open Data - the role of AGROVOC
Linked open Vocabularies for Linked Open Data - the role of AGROVOC
 
Grails And The Semantic Web
Grails And The Semantic WebGrails And The Semantic Web
Grails And The Semantic Web
 
Semantic Web introduction
Semantic Web introductionSemantic Web introduction
Semantic Web introduction
 

Similar to Rapid Digitization of Latin American Ephemera with Hydra

Preservation Metadata, CARLI Metadata Matters series, December 2010
Preservation Metadata, CARLI Metadata Matters series, December 2010Preservation Metadata, CARLI Metadata Matters series, December 2010
Preservation Metadata, CARLI Metadata Matters series, December 2010
Claire Stewart
 
Cultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collections
lljohnston
 
Smarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesSmarter Data for Smarter Libraries
Smarter Data for Smarter Libraries
OCLC
 
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
Trevor Owens
 
Berlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony HeyBerlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony Hey
Cornelius Puschmann
 
Crossing Institutional Boundaries to Create Permanent Public Access for Gover...
Crossing Institutional Boundaries to Create Permanent Public Access for Gover...Crossing Institutional Boundaries to Create Permanent Public Access for Gover...
Crossing Institutional Boundaries to Create Permanent Public Access for Gover...
rhonabwy
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Anita de Waard
 
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Marcus Smith
 
Cooperative advantages: Lessons in scale and sustainability from ReCAP
Cooperative advantages: Lessons in scale and sustainability from ReCAPCooperative advantages: Lessons in scale and sustainability from ReCAP
Cooperative advantages: Lessons in scale and sustainability from ReCAP
Cathal McCauley
 
LOCAH Project and Considerations of Linked Data Approaches
LOCAH Project and Considerations of Linked Data ApproachesLOCAH Project and Considerations of Linked Data Approaches
LOCAH Project and Considerations of Linked Data Approaches
Adrian Stevenson
 
Government Documents Disposition Project Made Easy with Aleph V.18
Government Documents Disposition Project Made Easy with Aleph V.18Government Documents Disposition Project Made Easy with Aleph V.18
Government Documents Disposition Project Made Easy with Aleph V.18
guest61f1b7d
 
BatIg
BatIgBatIg
Leslie Johnston: Challenges of Preserving Every Digital Format, 2012
Leslie Johnston: Challenges of Preserving Every Digital Format, 2012Leslie Johnston: Challenges of Preserving Every Digital Format, 2012
Leslie Johnston: Challenges of Preserving Every Digital Format, 2012
lljohnston
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
Carole Goble
 
A Guide for Reproducible Research
A Guide for Reproducible ResearchA Guide for Reproducible Research
A Guide for Reproducible Research
Yasmin AlNoamany, PhD
 
The Digital Archaeological Workflow: A Case Study from Sweden
The Digital Archaeological Workflow: A Case Study from SwedenThe Digital Archaeological Workflow: A Case Study from Sweden
The Digital Archaeological Workflow: A Case Study from Sweden
Marcus Smith
 
Digital Library Project Proposal
Digital Library Project ProposalDigital Library Project Proposal
Digital Library Project Proposal
Micah Vandegrift
 
160606 data lifecycle project outline
160606 data lifecycle project outline160606 data lifecycle project outline
160606 data lifecycle project outline
Ian Duncan
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
National Information Standards Organization (NISO)
 
Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?
Adrian Stevenson
 

Similar to Rapid Digitization of Latin American Ephemera with Hydra (20)

Preservation Metadata, CARLI Metadata Matters series, December 2010
Preservation Metadata, CARLI Metadata Matters series, December 2010Preservation Metadata, CARLI Metadata Matters series, December 2010
Preservation Metadata, CARLI Metadata Matters series, December 2010
 
Cultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collections
 
Smarter Data for Smarter Libraries
Smarter Data for Smarter LibrariesSmarter Data for Smarter Libraries
Smarter Data for Smarter Libraries
 
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
We Have Interesting Problems: Some Applied Grand Challenges from Digital Libr...
 
Berlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony HeyBerlin 6 Open Access Conference: Tony Hey
Berlin 6 Open Access Conference: Tony Hey
 
Crossing Institutional Boundaries to Create Permanent Public Access for Gover...
Crossing Institutional Boundaries to Create Permanent Public Access for Gover...Crossing Institutional Boundaries to Create Permanent Public Access for Gover...
Crossing Institutional Boundaries to Create Permanent Public Access for Gover...
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
Linked Open Data and The Digital Archaeological Workflow at the Swedish Natio...
 
Cooperative advantages: Lessons in scale and sustainability from ReCAP
Cooperative advantages: Lessons in scale and sustainability from ReCAPCooperative advantages: Lessons in scale and sustainability from ReCAP
Cooperative advantages: Lessons in scale and sustainability from ReCAP
 
LOCAH Project and Considerations of Linked Data Approaches
LOCAH Project and Considerations of Linked Data ApproachesLOCAH Project and Considerations of Linked Data Approaches
LOCAH Project and Considerations of Linked Data Approaches
 
Government Documents Disposition Project Made Easy with Aleph V.18
Government Documents Disposition Project Made Easy with Aleph V.18Government Documents Disposition Project Made Easy with Aleph V.18
Government Documents Disposition Project Made Easy with Aleph V.18
 
BatIg
BatIgBatIg
BatIg
 
Leslie Johnston: Challenges of Preserving Every Digital Format, 2012
Leslie Johnston: Challenges of Preserving Every Digital Format, 2012Leslie Johnston: Challenges of Preserving Every Digital Format, 2012
Leslie Johnston: Challenges of Preserving Every Digital Format, 2012
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
A Guide for Reproducible Research
A Guide for Reproducible ResearchA Guide for Reproducible Research
A Guide for Reproducible Research
 
The Digital Archaeological Workflow: A Case Study from Sweden
The Digital Archaeological Workflow: A Case Study from SwedenThe Digital Archaeological Workflow: A Case Study from Sweden
The Digital Archaeological Workflow: A Case Study from Sweden
 
Digital Library Project Proposal
Digital Library Project ProposalDigital Library Project Proposal
Digital Library Project Proposal
 
160606 data lifecycle project outline
160606 data lifecycle project outline160606 data lifecycle project outline
160606 data lifecycle project outline
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Worl...
 
Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?Linked Data - the Future for Open Repositories?
Linked Data - the Future for Open Repositories?
 

More from Jon Stroop

A more Worthwhile Sufia: Now with PCDM
A more Worthwhile Sufia: Now with PCDMA more Worthwhile Sufia: Now with PCDM
A more Worthwhile Sufia: Now with PCDM
Jon Stroop
 
Introduction to the IIIF Image API
Introduction to the IIIF Image APIIntroduction to the IIIF Image API
Introduction to the IIIF Image API
Jon Stroop
 
IIIF Technology for VRA33, 14 March 2015, Denver, CO
IIIF Technology for VRA33, 14 March 2015, Denver, COIIIF Technology for VRA33, 14 March 2015, Denver, CO
IIIF Technology for VRA33, 14 March 2015, Denver, CO
Jon Stroop
 
IIIF API Specifications Overview
IIIF API Specifications OverviewIIIF API Specifications Overview
IIIF API Specifications Overview
Jon Stroop
 
Meet Loris and OpenSeadragon
Meet Loris and OpenSeadragonMeet Loris and OpenSeadragon
Meet Loris and OpenSeadragon
Jon Stroop
 
IIIF for Index of Christian Art
IIIF for Index of Christian ArtIIIF for Index of Christian Art
IIIF for Index of Christian Art
Jon Stroop
 

More from Jon Stroop (6)

A more Worthwhile Sufia: Now with PCDM
A more Worthwhile Sufia: Now with PCDMA more Worthwhile Sufia: Now with PCDM
A more Worthwhile Sufia: Now with PCDM
 
Introduction to the IIIF Image API
Introduction to the IIIF Image APIIntroduction to the IIIF Image API
Introduction to the IIIF Image API
 
IIIF Technology for VRA33, 14 March 2015, Denver, CO
IIIF Technology for VRA33, 14 March 2015, Denver, COIIIF Technology for VRA33, 14 March 2015, Denver, CO
IIIF Technology for VRA33, 14 March 2015, Denver, CO
 
IIIF API Specifications Overview
IIIF API Specifications OverviewIIIF API Specifications Overview
IIIF API Specifications Overview
 
Meet Loris and OpenSeadragon
Meet Loris and OpenSeadragonMeet Loris and OpenSeadragon
Meet Loris and OpenSeadragon
 
IIIF for Index of Christian Art
IIIF for Index of Christian ArtIIIF for Index of Christian Art
IIIF for Index of Christian Art
 

Recently uploaded

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
shyamraj55
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
TIPNGVN2
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
Uni Systems S.M.S.A.
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
Rohit Gautam
 

Recently uploaded (20)

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with SlackLet's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
Microsoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdfMicrosoft - Power Platform_G.Aspiotis.pdf
Microsoft - Power Platform_G.Aspiotis.pdf
 
Large Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial ApplicationsLarge Language Model (LLM) and it’s Geospatial Applications
Large Language Model (LLM) and it’s Geospatial Applications
 

Rapid Digitization of Latin American Ephemera with Hydra

  • 1. Rapid Digitization of Latin American Ephemera with Hydra Project History and Goals Princeton University Library began to collect and build an archive of Latin American ephemera and gray literature in the mid 1970s to document the activities of political and social organizations and movements, as well as the broader political, socioeconomic and cultural developments of the region. Access to the material was provided by slowly accumulating and organizing thematic sub-collections, creating finding aids, and microfilming selected curated sub-collections. Reproductions of the microfilm were commercially distributed and resulting royalties were used to fund new acquisitions. That model gradually become unsustainable during the past decade and microfilming was halted in 2008. Hydra breathes new life into this project by providing us with a framework for creating an end-to-end application that will facilitate rapid digitization, cataloging, and access to this important collection. Since the system went into production in April of 2014, nearly 1500 items have been cataloged, with the throughput rate ultimately accelerating to over 300 items per month in August. Princeton has several projects and workflows similar to “LAE”, and we expect many of the components of this applications to reusable in future projects. Item received and placed in Folder w/ barcode Preliminary metadata recorded, Folder placed in Box* w/ barcode Full Box sent offsite for imaging and OCR Hard drives returned to library, boxes to off site storage Images qc’d and ingested (barcode associates with metadata) Metadata completed from images Final quality control Production Returned drive is organized by box/folder barcodes. It also contains a JHOVE audit with checksums Generously sponsored by the Council on Library and Information Resources (CLIR; http://www.clir.org/) and Latin Americanist Research Resources Project (LARRP; http://www.crl.edu/grn/larrp) Image QC includes: ● Checksums verified ● Color profiles confirmed ● Confirm (1) OCR for (1) file Process returns a file of arguments which: ● Instantiating Page (image) and Folder objects ● Creating the appropriate associations ● This process also creates a JP2 that is copied to our image server (Loris). A Folder is instantiated in repo. Folder barcode, box barcode (via scanner), title, country, and genre MUST be included before it can be saved. State is set to “Has Preliminary Metadata” Folders move through their remaining states: ● “Has Core Metadata" ● “Needs QC" ● "In Production" Item Cataloging and Imaging Workflow * Boxes have a similar workflow. States are: 1. "New" 2. "Ready to Ship" 3. "Shipped" 4. "Received" 5. "All in Production" @prefix dc: <http://purl.org/dc/terms/> . @prefix isolang: <http://id.loc.gov/vocabulary/iso639-2/> . @prefix lcco: <http://id.loc.gov/vocabulary/countries/> . @prefix lcga: <http://id.loc.gov/vocabulary/geographicAreas/> . @prefix lcsh: <http://id.loc.gov/authorities/subjects/> . @prefix marcrel: <"http://id.loc.gov/vocabulary/relators/> . @prefix puls: <http://pul-store.princeton.edu/> . @prefix pulterms: <http://princeton.edu/pulstore/terms/> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix tgm: <http://id.loc.gov/vocabulary/graphicMaterials/> . @prefix xsd: <http://www.w3.org/2001/XMLSchema#> . puls:00b84 dc:title "Memoria 2003."@es; marcrel:mfp lcga:cl; pulterms:heightInCM "21"^^xsd:int; pulterms:isPartOfProject puls:0005p; pulterms:pageCount "15"^^xsd:int; pulterms:widthInCM "15"^^xsd:int; dc:coverage lcga:cl; dc:created "2003"^^xsd:gYear; dc:format tgm:tgm007415; dc:language isolang:spa; dc:publisher "Asociación para la Cooperación en el Sur (ACSUR)-Las Segovias"; dc:rights "This digital reproduction is intended to support research, teaching, and private study. Users are responsible for determining any copyright questions." @en; dc:subject lcsh:sh85040810, "Economic policy--Social aspects"@en, "Economics--Political aspects"@en, lcsh:sh85067385 . puls:004kr dc:title "The Workforce of Puerto Rico: Fringe Benefits."@en; marcrel:mfp lcco:pr; pulterms:heightInCM "22"^^xsd:int; pulterms:isPartOfProject puls:0005p; pulterms:pageCount "17"^^xsd:int; [...] Rich Metadata Available as RDF/Linked Data