SlideShare a Scribd company logo
JPEG 2000 at the Wellcome
Library
Christy Henshaw
Digitisation Programme Manager – Wellcome Library
JP2 Summit
12-13 May 2011
Library of Congress
The Wellcome Trust
•A global charitable foundation
•Achieving extraordinary improvements in human and animal health
•Supporting the brightest minds in biomedical research and the
medical humanities
•Exploring medicine in historical and cultural contexts
The Wellcome Library
The Wellcome Library
Collections of books, manuscripts, archives, films and pictures on the
history of medicine from the earliest times to the present day .
The Wellcome Digital Library pilot,
2010-2013
Genetics and its Modern Foundations
A new online resource for everyone interested in the history of
human and animal health.

Aims
• build sustainable/expandable mechanism – foundation stone
for WDL
• digitise key library holdings - relating to a major Trust
challenge area
• digitise important third party content – linked to theme
• use innovative content and tools – to encourage discovery and
use
• explore commercial partnerships – enhance access to nontheme material
JPEG 2000 conversion – scope
•Wellcome Images – image library, legacy images, 300,000
images in the archive
•Current projects – pilot digitisation projects, 7m images 2010 2014

• Long-term plans – digitisation of large proportion of our
collections (mainly special collections), 15m – 25m images 2014
and beyond
Type of content
•Printed books – early printed books, modern books
(monographs), pamphlets, reports
•Archives – personal papers, institutional papers, unpublished
works, mostly 20th century
• Manuscripts – unpublished, handwritten “manuscript books” and
related materials, mostly 17th, 18th and 19th century, can be fragile

• Artworks – prints, paintings, posters, drawings, glass slides, etc.
The Francis Crick Archive
Books related to genetic research
Early printed books
Artworks, manuscripts
Decision to adopt JP2
JPEG 2000 was found to answer the following needs:
•Storage costs –20/30m TIFFs stored on online, backed-up
storage = multiple petabytes. Needed something cost-effective.
•Quality – needed a high-quality compressed format that would
cover a wide range of content types.
• Robustness – needed a well-established image format with a
high chance of long-term support.
• Practical – feasible to use in a Library digitisation workflow.
Finding our way
Working with JP2 opened up a whole new world – reading
specifications, finding conversion software, so many choices.

Commissioned the report:
JPEG 2000 as a Preservation and Access Format for the Wellcome Trust Libr
Goal to find a single version of JPEG 2000 that would meet the
needs of both long-term preservation and flexible delivery needs.
The result
Parameter

Settings

File format

Part 1 (.jp2)

Compression

Lossy (6:1, 10:1)

Tiling

1024 x 1024

Progression order

RLCP

Decomp levels

5

Quality layers

8

Code block size

6, 64x64

Regions of interest

No

TLM markers

Yes

Bypass

N/A
Embedding JP2
Chose LuraWave command line tool
• Some issues (bugs, or inconvenient implementations) arose, and
all have been successfully addressed by LuraTech
• Created a firm consensus to use JP2 as the format for all stillimage digital imaging (with one or two exceptions)
• No plans to use JP2 for digital video – but never say never
• Internal information sharing – digital archivists, systems
administrators, IT department, programme board members
• External communication and networking
Current status, future plans
• Conversion of all new digital images is now carried out as
standard
• Nearing the final stages of a project to convert 450k image
backlog to JP2 (reducing current footprint from 20 Tb to 5.5 Tb)
• Large projects use lossy JP2, legacy picture library uses lossless
• Developed a strategy to determine compression levels
• Currently using the GUI, but will use the command line interface
with our new workflow system, streamlining conversion and QA
• Medium term, will look at automating compression level selection
Quality control for compression
• Visual inspection
• Color shifts, loss of detail, halo effects, pixelation, blurring, etc.
• Collection-based, representative sample
• Test range of compressions with intervals such as 2:1, 4:1, 6:1
• Once artefacts are discovered, step back to previous
compression ratio
• Worst-performing image rules, for any particular collection
• Efficient for homogenous collections – less so for heterogenous
collections with wide variety of content
• Archives particularly difficult – black and white compresses very
well – colour drawings and photographs, not so well
Establishing the JP2K-UK group
• Unknown who in the UK were using JPEG 2000, or considering it
• Unknown who was even interested in JPEG 2000
• No one wants to work in a vacuum…
• Discovered a high level of interest: British Library, The National
Archives, Oxford, King’s College London, Cambridge and
Southampton Universities, Digital Preservation Coalition,
commercial companies/consultants
• Loose affiliation of the like-minded – a user group
Remit of the JP2K-UK group
• Initial meeting in December 2009
• Everyone had a little knowledge – no one knew enough
• Agreed the need to approach JP2 implementation from
practitioner’s point of view
• “Practitioner” meaning those who manage digital imaging
strategies and implementation
• Agreed need to share information and collaborate
• Discussed ideas for a conference, and creating some guidelines
for the user community
• Wellcome encouraged to write a blog about specific experiences
working with JP2
Ouputs
• JPEG 2000 Seminar, held in London in November 2010
> 80 attendees
> UK and European speakers and delegates
> mostly non-technical audience
• Advocacy for practitioner’s needs
> discussing and airing the needs and concerns of
practitioners has influenced software developers, and even the
JPEG Committee
> JPEG

2000 at the Wellcome Library blog
www.jpeg2000wellcomelibrary.blogspot.com
Future plans for JP2K-UK
• Guidance for practitioners
> Human readable
> Focus on practicalities
> Enable practitioners to make informed choices
> Advice on implementation
• Community building
> Case studies
> Lessons learned
> Networking (nationally and internationally)

More Related Content

Similar to Jpeg2000 at Wellcome Library

Newman, DAM + Image Intellectual Property Management
Newman, DAM + Image Intellectual Property ManagementNewman, DAM + Image Intellectual Property Management
Newman, DAM + Image Intellectual Property Management
Alan Newman
 
Wordofa presentation icadla2
Wordofa presentation icadla2Wordofa presentation icadla2
Wordofa presentation icadla2Johannes Phaladi
 
Wordofa presentation icadla2
Wordofa presentation icadla2Wordofa presentation icadla2
Wordofa presentation icadla2Johannes Phaladi
 
Of Communities and Practices: Digital Preservation Innovation & Research
Of Communities  and Practices: Digital Preservation Innovation & ResearchOf Communities  and Practices: Digital Preservation Innovation & Research
Of Communities and Practices: Digital Preservation Innovation & Research
Erwin Verbruggen
 
Digitizing Spectator - Libraries Digital Program
Digitizing Spectator - Libraries Digital ProgramDigitizing Spectator - Libraries Digital Program
Digitizing Spectator - Libraries Digital ProgramRobert Frech
 
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
FIAT/IFTA
 
Ariadne overview
Ariadne overviewAriadne overview
Ariadne overview
ariadnenetwork
 
Evolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of MedicineEvolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of Medicine
John Rees
 
20yrs: 2004 iPRES Beijing e-journals
20yrs: 2004 iPRES Beijing e-journals20yrs: 2004 iPRES Beijing e-journals
20yrs: 2004 iPRES Beijing e-journals
Neil Beagrie
 
"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica
Jenny Mitcham
 
Digital projects best practices [xxxiii reunión nacional de archivos 201111]
Digital projects best practices [xxxiii reunión nacional de archivos 201111]Digital projects best practices [xxxiii reunión nacional de archivos 201111]
Digital projects best practices [xxxiii reunión nacional de archivos 201111]Frederick Zarndt
 
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and Ceremony
Archiver
 
Research in the digital age - circa 2005
Research in the digital age - circa 2005Research in the digital age - circa 2005
Research in the digital age - circa 2005Larry Naukam
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
ASIS&T
 
Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)
EDINA, University of Edinburgh
 
Constructing bottomup
Constructing bottomupConstructing bottomup
Constructing bottomup
Alex Hardisty
 
SCAPE Presentation at the Elag2013 conference in Gent/Belgium
SCAPE Presentation at the Elag2013 conference in Gent/BelgiumSCAPE Presentation at the Elag2013 conference in Gent/Belgium
SCAPE Presentation at the Elag2013 conference in Gent/Belgium
Sven Schlarb
 
Digitising Hansard
Digitising HansardDigitising Hansard
Digitising Hansard
ALISS
 
"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica
Jenny Mitcham
 
Resource description and new media : challenges and opportunities. Authors: E...
Resource description and new media : challenges and opportunities. Authors: E...Resource description and new media : challenges and opportunities. Authors: E...
Resource description and new media : challenges and opportunities. Authors: E...
UCD Library
 

Similar to Jpeg2000 at Wellcome Library (20)

Newman, DAM + Image Intellectual Property Management
Newman, DAM + Image Intellectual Property ManagementNewman, DAM + Image Intellectual Property Management
Newman, DAM + Image Intellectual Property Management
 
Wordofa presentation icadla2
Wordofa presentation icadla2Wordofa presentation icadla2
Wordofa presentation icadla2
 
Wordofa presentation icadla2
Wordofa presentation icadla2Wordofa presentation icadla2
Wordofa presentation icadla2
 
Of Communities and Practices: Digital Preservation Innovation & Research
Of Communities  and Practices: Digital Preservation Innovation & ResearchOf Communities  and Practices: Digital Preservation Innovation & Research
Of Communities and Practices: Digital Preservation Innovation & Research
 
Digitizing Spectator - Libraries Digital Program
Digitizing Spectator - Libraries Digital ProgramDigitizing Spectator - Libraries Digital Program
Digitizing Spectator - Libraries Digital Program
 
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
 
Ariadne overview
Ariadne overviewAriadne overview
Ariadne overview
 
Evolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of MedicineEvolution of motion picture digitization at the National Library of Medicine
Evolution of motion picture digitization at the National Library of Medicine
 
20yrs: 2004 iPRES Beijing e-journals
20yrs: 2004 iPRES Beijing e-journals20yrs: 2004 iPRES Beijing e-journals
20yrs: 2004 iPRES Beijing e-journals
 
"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica"Filling the Digital Preservation Gap" with Archivematica
"Filling the Digital Preservation Gap" with Archivematica
 
Digital projects best practices [xxxiii reunión nacional de archivos 201111]
Digital projects best practices [xxxiii reunión nacional de archivos 201111]Digital projects best practices [xxxiii reunión nacional de archivos 201111]
Digital projects best practices [xxxiii reunión nacional de archivos 201111]
 
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and Ceremony
 
Research in the digital age - circa 2005
Research in the digital age - circa 2005Research in the digital age - circa 2005
Research in the digital age - circa 2005
 
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12Research Cyberinfrastructure at UCSD - David Minor - RDAP12
Research Cyberinfrastructure at UCSD - David Minor - RDAP12
 
Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)
 
Constructing bottomup
Constructing bottomupConstructing bottomup
Constructing bottomup
 
SCAPE Presentation at the Elag2013 conference in Gent/Belgium
SCAPE Presentation at the Elag2013 conference in Gent/BelgiumSCAPE Presentation at the Elag2013 conference in Gent/Belgium
SCAPE Presentation at the Elag2013 conference in Gent/Belgium
 
Digitising Hansard
Digitising HansardDigitising Hansard
Digitising Hansard
 
"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica"Filling the digital preservation gap" with Archivematica
"Filling the digital preservation gap" with Archivematica
 
Resource description and new media : challenges and opportunities. Authors: E...
Resource description and new media : challenges and opportunities. Authors: E...Resource description and new media : challenges and opportunities. Authors: E...
Resource description and new media : challenges and opportunities. Authors: E...
 

More from Wellcome Library

ProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner PerspectiveProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner Perspective
Wellcome Library
 
Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wellcome Library
 
Doing Projects: 10 laws of digitisation
Doing Projects: 10 laws of digitisationDoing Projects: 10 laws of digitisation
Doing Projects: 10 laws of digitisation
Wellcome Library
 
Systems and Processes: making order out of chaos
Systems and Processes: making order out of chaosSystems and Processes: making order out of chaos
Systems and Processes: making order out of chaos
Wellcome Library
 
Copyright clearance for genetics books - a pilot project at the Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome LibraryCopyright clearance for genetics books - a pilot project at the Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome LibraryWellcome Library
 
Systems, processes & how we stop the wheels falling off
Systems, processes & how we stop the wheels falling offSystems, processes & how we stop the wheels falling off
Systems, processes & how we stop the wheels falling off
Wellcome Library
 
Digitisation Projects at Wellcome Library
Digitisation Projects at Wellcome LibraryDigitisation Projects at Wellcome Library
Digitisation Projects at Wellcome Library
Wellcome Library
 
How will history remember you…?
How will history remember you…?How will history remember you…?
How will history remember you…?
Wellcome Library
 
Conservation for Digitisation
Conservation for DigitisationConservation for Digitisation
Conservation for Digitisation
Wellcome Library
 
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome LibraryCopyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Wellcome Library
 
Mandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome TrustMandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome Trust
Wellcome Library
 

More from Wellcome Library (12)

ProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner PerspectiveProQuest Early European Books: Partner Perspective
ProQuest Early European Books: Partner Perspective
 
Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9Wt dnt digitisation_open_day_v9
Wt dnt digitisation_open_day_v9
 
Doing Projects: 10 laws of digitisation
Doing Projects: 10 laws of digitisationDoing Projects: 10 laws of digitisation
Doing Projects: 10 laws of digitisation
 
Systems and Processes: making order out of chaos
Systems and Processes: making order out of chaosSystems and Processes: making order out of chaos
Systems and Processes: making order out of chaos
 
Copyright clearance for genetics books - a pilot project at the Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome LibraryCopyright clearance for genetics books - a pilot project at the Wellcome Library
Copyright clearance for genetics books - a pilot project at the Wellcome Library
 
Systems, processes & how we stop the wheels falling off
Systems, processes & how we stop the wheels falling offSystems, processes & how we stop the wheels falling off
Systems, processes & how we stop the wheels falling off
 
Digitisation Projects at Wellcome Library
Digitisation Projects at Wellcome LibraryDigitisation Projects at Wellcome Library
Digitisation Projects at Wellcome Library
 
How will history remember you…?
How will history remember you…?How will history remember you…?
How will history remember you…?
 
Image Capture
Image CaptureImage Capture
Image Capture
 
Conservation for Digitisation
Conservation for DigitisationConservation for Digitisation
Conservation for Digitisation
 
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome LibraryCopyright Clearance for Genetics Books, A pilot project at the Wellcome Library
Copyright Clearance for Genetics Books, A pilot project at the Wellcome Library
 
Mandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome TrustMandating Open Access - Wellcome Trust
Mandating Open Access - Wellcome Trust
 

Recently uploaded

Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
Product School
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 

Recently uploaded (20)

Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 

Jpeg2000 at Wellcome Library

  • 1. JPEG 2000 at the Wellcome Library Christy Henshaw Digitisation Programme Manager – Wellcome Library JP2 Summit 12-13 May 2011 Library of Congress
  • 2. The Wellcome Trust •A global charitable foundation •Achieving extraordinary improvements in human and animal health •Supporting the brightest minds in biomedical research and the medical humanities •Exploring medicine in historical and cultural contexts
  • 4. The Wellcome Library Collections of books, manuscripts, archives, films and pictures on the history of medicine from the earliest times to the present day .
  • 5. The Wellcome Digital Library pilot, 2010-2013 Genetics and its Modern Foundations A new online resource for everyone interested in the history of human and animal health. Aims • build sustainable/expandable mechanism – foundation stone for WDL • digitise key library holdings - relating to a major Trust challenge area • digitise important third party content – linked to theme • use innovative content and tools – to encourage discovery and use • explore commercial partnerships – enhance access to nontheme material
  • 6. JPEG 2000 conversion – scope •Wellcome Images – image library, legacy images, 300,000 images in the archive •Current projects – pilot digitisation projects, 7m images 2010 2014 • Long-term plans – digitisation of large proportion of our collections (mainly special collections), 15m – 25m images 2014 and beyond
  • 7. Type of content •Printed books – early printed books, modern books (monographs), pamphlets, reports •Archives – personal papers, institutional papers, unpublished works, mostly 20th century • Manuscripts – unpublished, handwritten “manuscript books” and related materials, mostly 17th, 18th and 19th century, can be fragile • Artworks – prints, paintings, posters, drawings, glass slides, etc.
  • 9. Books related to genetic research
  • 12. Decision to adopt JP2 JPEG 2000 was found to answer the following needs: •Storage costs –20/30m TIFFs stored on online, backed-up storage = multiple petabytes. Needed something cost-effective. •Quality – needed a high-quality compressed format that would cover a wide range of content types. • Robustness – needed a well-established image format with a high chance of long-term support. • Practical – feasible to use in a Library digitisation workflow.
  • 13. Finding our way Working with JP2 opened up a whole new world – reading specifications, finding conversion software, so many choices. Commissioned the report: JPEG 2000 as a Preservation and Access Format for the Wellcome Trust Libr Goal to find a single version of JPEG 2000 that would meet the needs of both long-term preservation and flexible delivery needs.
  • 14. The result Parameter Settings File format Part 1 (.jp2) Compression Lossy (6:1, 10:1) Tiling 1024 x 1024 Progression order RLCP Decomp levels 5 Quality layers 8 Code block size 6, 64x64 Regions of interest No TLM markers Yes Bypass N/A
  • 15. Embedding JP2 Chose LuraWave command line tool • Some issues (bugs, or inconvenient implementations) arose, and all have been successfully addressed by LuraTech • Created a firm consensus to use JP2 as the format for all stillimage digital imaging (with one or two exceptions) • No plans to use JP2 for digital video – but never say never • Internal information sharing – digital archivists, systems administrators, IT department, programme board members • External communication and networking
  • 16. Current status, future plans • Conversion of all new digital images is now carried out as standard • Nearing the final stages of a project to convert 450k image backlog to JP2 (reducing current footprint from 20 Tb to 5.5 Tb) • Large projects use lossy JP2, legacy picture library uses lossless • Developed a strategy to determine compression levels • Currently using the GUI, but will use the command line interface with our new workflow system, streamlining conversion and QA • Medium term, will look at automating compression level selection
  • 17. Quality control for compression • Visual inspection • Color shifts, loss of detail, halo effects, pixelation, blurring, etc. • Collection-based, representative sample • Test range of compressions with intervals such as 2:1, 4:1, 6:1 • Once artefacts are discovered, step back to previous compression ratio • Worst-performing image rules, for any particular collection • Efficient for homogenous collections – less so for heterogenous collections with wide variety of content • Archives particularly difficult – black and white compresses very well – colour drawings and photographs, not so well
  • 18. Establishing the JP2K-UK group • Unknown who in the UK were using JPEG 2000, or considering it • Unknown who was even interested in JPEG 2000 • No one wants to work in a vacuum… • Discovered a high level of interest: British Library, The National Archives, Oxford, King’s College London, Cambridge and Southampton Universities, Digital Preservation Coalition, commercial companies/consultants • Loose affiliation of the like-minded – a user group
  • 19. Remit of the JP2K-UK group • Initial meeting in December 2009 • Everyone had a little knowledge – no one knew enough • Agreed the need to approach JP2 implementation from practitioner’s point of view • “Practitioner” meaning those who manage digital imaging strategies and implementation • Agreed need to share information and collaborate • Discussed ideas for a conference, and creating some guidelines for the user community • Wellcome encouraged to write a blog about specific experiences working with JP2
  • 20. Ouputs • JPEG 2000 Seminar, held in London in November 2010 > 80 attendees > UK and European speakers and delegates > mostly non-technical audience • Advocacy for practitioner’s needs > discussing and airing the needs and concerns of practitioners has influenced software developers, and even the JPEG Committee > JPEG 2000 at the Wellcome Library blog www.jpeg2000wellcomelibrary.blogspot.com
  • 21. Future plans for JP2K-UK • Guidance for practitioners > Human readable > Focus on practicalities > Enable practitioners to make informed choices > Advice on implementation • Community building > Case studies > Lessons learned > Networking (nationally and internationally)