SlideShare a Scribd company logo
1 of 35
Download to read offline
Getty Common Image Service
Research & Design Report
Stefano Cossu, Software Architect
J. P. Getty Trust <scossu@getty.edu>
About Getty Digital (GDI)
~2 year-old department
Very active evolution
Created to consolidate IT services for all Getty programs:
Hardware & networking infrastructure
Software development
Information management
Information access & security
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 2
GDI's Grand Plan for IIIF
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 3
1. Align with GDI mission
One infrastructure to serve images from all Getty programs:
Image delivery services
Metadata (IIIF Presentation) services
Access policies
Discovery services
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 4
2. Start Simple
~150K images
Edward Ruscha's Streets of Los Angeles (GRI)
Museum Collection Open Content images
All media cleared for open access (i.e. no auth)
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 5
3. Prepare to Grow Inde nitely
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 6
3. Prepare to Grow Inde nitely
50M images
A/V media
Access control
Discovery
Annotations
ID management
ETL pipelines
…
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 7
The Report
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 8
Why?
Explore all options (easier with no production data!)
Scienti c approach to a challenging project
Stimulate discourse over choices within Getty and the IIIF
community
Possibly improve areas that we nd in need of improvement
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 9
Areas Covered
Source image format
Image encoding (compression)
Image Server
Ancillary tools (caching, ETL, etc.)
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 10
1+2. Source Image Format & Encoding
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 11
Source Image Benchmark
Criteria
Compatibility with selected image servers
Decoding speed
Image size
Encoding speed
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 12
Source Image Benchmark
Methodology
Run conversion on a batch of sample images with different color
topologies and geometries
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 13
Source Image Benchmark
Format: Pyramidal TIFF (PTIFF) our pick
Very fast decoding
Established standard
Flexible compression options & tools
Limited IIIF server support
Lots of manual tweaking
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 14
Source Image Benchmark
Format: JPEG 2000 (JP2)
Very fast encoding (with Kakadu)
Slower decoding
Depends on proprietary software for decent performance
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 15
Source Image Benchmark
Encoding: JPEG our pick
very space- & CPU-ef cient
A classic…
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 16
Source Image Benchmark
Encoding: WebP
Higher image quality than JPEG for the same storage size
Limited support
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 17
Source Image Benchmark
Encoding: Lossless (LZW, LZMA, ZIP, etc.)
Highest image quality
CPU-intensive
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 18
Source Image Benchmark
Encoding: Uncompressed Data
Highest image quality
Lowest CPU usage
Not viable for the data volumes handled
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 19
3. Image Server
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 20
Image Server Benchmark
Criteria
Stability
Decoding speed
Support
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 21
Image Server Benchmark
Methodology
HTTP load test using Locust on several "axes":
Server software (in Docker containers)
Source image size (<10 Mp, 10÷75 Mp, >75 Mp)
Derivative size and type (region, full)
Number of concurrent requests (10, 100, 1000)
All caches turned off
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 22
Image Server Benchmark
IIPImage our pick
Most well-established software
Fastest processing by far
Most reliable delivery (0% failure rate)
Smallest resource footprint
Version 1.1 is out!
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 23
Image Server Benchmark
Cantaloupe
Fast (not quite as IIPImage)
Reliable (not quite as IIPImage)
High resource usage
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 24
Image Server Benchmark
Loris
Customizable (GDI is a Python shop)
Very high failure rate under stress
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 25
Outcome of Locust tests
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 26
Outcome of Locust tests
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 27
Load averages for iipsrv with 10, 100, 1000 connections
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 28
4. Other Components (Custom Built)
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 29
Gateway Service
Provides a point of entry & exit for several request handling services:
API versioning
Caching
ID management and redirect service
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 30
ETL Pipeline
Provides batch conversion of original images to PTIFFs and source
system metadata to IIIF manifests.
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 31
The Stuff
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 32
INFO The report has a Getty-centric scope, however some outcomes
could be useful to the general public.
WARNING The Getty does not intend to maintain the software used for
the report in the long term.
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 33
Report Contents
Google Drive Folder containing:
Report (PDF)
Docker containers with server setups
Reference data set
Benchmark tools
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 34
Thank You.
Questions? scossu@getty.edu
Discussion? iiif-discuss@googlegroups.com
Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 35

More Related Content

Similar to Scossu gdi iiif_r+d_report_2019

Overview of Selected Current MPEG Activities
Overview of Selected Current MPEG ActivitiesOverview of Selected Current MPEG Activities
Overview of Selected Current MPEG ActivitiesAlpen-Adria-Universität
 
Overview of Selected Current MPEG Activities
Overview of Selected Current MPEG ActivitiesOverview of Selected Current MPEG Activities
Overview of Selected Current MPEG ActivitiesAlpen-Adria-Universität
 
Matthias Deeg - Bypassing an Enterprise-Grade Biometric Face Authentication S...
Matthias Deeg - Bypassing an Enterprise-Grade Biometric Face Authentication S...Matthias Deeg - Bypassing an Enterprise-Grade Biometric Face Authentication S...
Matthias Deeg - Bypassing an Enterprise-Grade Biometric Face Authentication S...hacktivity
 
Open Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME SoftwareOpen Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME SoftwareKNIMESlides
 
Resolving Platform Specific Models at runtime using an MDE-based Trading appr...
Resolving Platform Specific Models at runtime using an MDE-based Trading appr...Resolving Platform Specific Models at runtime using an MDE-based Trading appr...
Resolving Platform Specific Models at runtime using an MDE-based Trading appr...Applied Computing Group
 
Lessons from Analyzing All of Europe's Data
Lessons from Analyzing All of Europe's DataLessons from Analyzing All of Europe's Data
Lessons from Analyzing All of Europe's DataInspirient
 
Cloud-based dynamic distributed optimisation of integrated process planning a...
Cloud-based dynamic distributed optimisation of integrated process planning a...Cloud-based dynamic distributed optimisation of integrated process planning a...
Cloud-based dynamic distributed optimisation of integrated process planning a...Piotr Dziurzanski
 
Getting more out of Matplotlib with GR
Getting more out of Matplotlib with GRGetting more out of Matplotlib with GR
Getting more out of Matplotlib with GRJosef Heinen
 
TechEvent DWH Modernization
TechEvent DWH ModernizationTechEvent DWH Modernization
TechEvent DWH ModernizationTrivadis
 
Imagically Image Forensic Tool
Imagically Image Forensic ToolImagically Image Forensic Tool
Imagically Image Forensic Toolijtsrd
 
Trustworthy Analytics with Generative AI: Four Use Cases for ChatGPT / GPT-4
Trustworthy Analytics with Generative AI: Four Use Cases for ChatGPT / GPT-4Trustworthy Analytics with Generative AI: Four Use Cases for ChatGPT / GPT-4
Trustworthy Analytics with Generative AI: Four Use Cases for ChatGPT / GPT-4Inspirient
 
CWIN17 India / g analyzer-presentation - daya kiran melam
CWIN17 India / g analyzer-presentation - daya kiran melamCWIN17 India / g analyzer-presentation - daya kiran melam
CWIN17 India / g analyzer-presentation - daya kiran melamCapgemini
 
Data-driven Deep Learning for Proactive Terminal Process Management
Data-driven Deep Learning for Proactive Terminal Process ManagementData-driven Deep Learning for Proactive Terminal Process Management
Data-driven Deep Learning for Proactive Terminal Process Management Andreas Metzger
 
Organizational memory in the Process Industry
Organizational memory in the Process  IndustryOrganizational memory in the Process  Industry
Organizational memory in the Process IndustryGlen Alleman
 
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...Hong-Linh Truong
 
IBM Z for the Digital Enterprise - IBM Z Software Keynote
IBM Z for the Digital Enterprise - IBM Z Software KeynoteIBM Z for the Digital Enterprise - IBM Z Software Keynote
IBM Z for the Digital Enterprise - IBM Z Software KeynoteDevOps for Enterprise Systems
 
Insights beyond Human Intuition: Comprehensively Mining Survey Data
Insights beyond Human Intuition: Comprehensively Mining Survey DataInsights beyond Human Intuition: Comprehensively Mining Survey Data
Insights beyond Human Intuition: Comprehensively Mining Survey DataInspirient
 
Miplm Case Study Smart Production
Miplm Case Study Smart ProductionMiplm Case Study Smart Production
Miplm Case Study Smart ProductionMIPLM
 

Similar to Scossu gdi iiif_r+d_report_2019 (20)

Overview of Selected Current MPEG Activities
Overview of Selected Current MPEG ActivitiesOverview of Selected Current MPEG Activities
Overview of Selected Current MPEG Activities
 
Overview of Selected Current MPEG Activities
Overview of Selected Current MPEG ActivitiesOverview of Selected Current MPEG Activities
Overview of Selected Current MPEG Activities
 
Matthias Deeg - Bypassing an Enterprise-Grade Biometric Face Authentication S...
Matthias Deeg - Bypassing an Enterprise-Grade Biometric Face Authentication S...Matthias Deeg - Bypassing an Enterprise-Grade Biometric Face Authentication S...
Matthias Deeg - Bypassing an Enterprise-Grade Biometric Face Authentication S...
 
The Reachout Methodology
The Reachout Methodology The Reachout Methodology
The Reachout Methodology
 
Open Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME SoftwareOpen Source Story and what’s new in KNIME Software
Open Source Story and what’s new in KNIME Software
 
Resolving Platform Specific Models at runtime using an MDE-based Trading appr...
Resolving Platform Specific Models at runtime using an MDE-based Trading appr...Resolving Platform Specific Models at runtime using an MDE-based Trading appr...
Resolving Platform Specific Models at runtime using an MDE-based Trading appr...
 
Lessons from Analyzing All of Europe's Data
Lessons from Analyzing All of Europe's DataLessons from Analyzing All of Europe's Data
Lessons from Analyzing All of Europe's Data
 
Cloud-based dynamic distributed optimisation of integrated process planning a...
Cloud-based dynamic distributed optimisation of integrated process planning a...Cloud-based dynamic distributed optimisation of integrated process planning a...
Cloud-based dynamic distributed optimisation of integrated process planning a...
 
Getting more out of Matplotlib with GR
Getting more out of Matplotlib with GRGetting more out of Matplotlib with GR
Getting more out of Matplotlib with GR
 
TechEvent DWH Modernization
TechEvent DWH ModernizationTechEvent DWH Modernization
TechEvent DWH Modernization
 
Imagically Image Forensic Tool
Imagically Image Forensic ToolImagically Image Forensic Tool
Imagically Image Forensic Tool
 
Trustworthy Analytics with Generative AI: Four Use Cases for ChatGPT / GPT-4
Trustworthy Analytics with Generative AI: Four Use Cases for ChatGPT / GPT-4Trustworthy Analytics with Generative AI: Four Use Cases for ChatGPT / GPT-4
Trustworthy Analytics with Generative AI: Four Use Cases for ChatGPT / GPT-4
 
CWIN17 India / g analyzer-presentation - daya kiran melam
CWIN17 India / g analyzer-presentation - daya kiran melamCWIN17 India / g analyzer-presentation - daya kiran melam
CWIN17 India / g analyzer-presentation - daya kiran melam
 
Data-driven Deep Learning for Proactive Terminal Process Management
Data-driven Deep Learning for Proactive Terminal Process ManagementData-driven Deep Learning for Proactive Terminal Process Management
Data-driven Deep Learning for Proactive Terminal Process Management
 
Goobi
GoobiGoobi
Goobi
 
Organizational memory in the Process Industry
Organizational memory in the Process  IndustryOrganizational memory in the Process  Industry
Organizational memory in the Process Industry
 
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
 
IBM Z for the Digital Enterprise - IBM Z Software Keynote
IBM Z for the Digital Enterprise - IBM Z Software KeynoteIBM Z for the Digital Enterprise - IBM Z Software Keynote
IBM Z for the Digital Enterprise - IBM Z Software Keynote
 
Insights beyond Human Intuition: Comprehensively Mining Survey Data
Insights beyond Human Intuition: Comprehensively Mining Survey DataInsights beyond Human Intuition: Comprehensively Mining Survey Data
Insights beyond Human Intuition: Comprehensively Mining Survey Data
 
Miplm Case Study Smart Production
Miplm Case Study Smart ProductionMiplm Case Study Smart Production
Miplm Case Study Smart Production
 

More from Stefano Cossu

Stefano_Cossu_OR23_deck.pdf
Stefano_Cossu_OR23_deck.pdfStefano_Cossu_OR23_deck.pdf
Stefano_Cossu_OR23_deck.pdfStefano Cossu
 
The Oxford Common File Layout
The Oxford Common File LayoutThe Oxford Common File Layout
The Oxford Common File LayoutStefano Cossu
 
Brace yourselves, the Archives are Coming – Code4Lib 2020, Pittsburgh
Brace yourselves, the Archives are Coming – Code4Lib 2020, PittsburghBrace yourselves, the Archives are Coming – Code4Lib 2020, Pittsburgh
Brace yourselves, the Archives are Coming – Code4Lib 2020, PittsburghStefano Cossu
 
IIIF at the Getty: Vision & Tactics
IIIF at the Getty: Vision & TacticsIIIF at the Getty: Vision & Tactics
IIIF at the Getty: Vision & TacticsStefano Cossu
 
Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018
Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018 Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018
Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018 Stefano Cossu
 
Labours of Love & Convenience - Open Repositories 2018
Labours of Love & Convenience - Open Repositories 2018Labours of Love & Convenience - Open Repositories 2018
Labours of Love & Convenience - Open Repositories 2018Stefano Cossu
 
Cossu ford the_lake_experience_mw2017
Cossu ford the_lake_experience_mw2017Cossu ford the_lake_experience_mw2017
Cossu ford the_lake_experience_mw2017Stefano Cossu
 
A Little Sweat Goes A Long Way - Museums and The Web 2016
A Little Sweat Goes A Long Way - Museums and The Web 2016A Little Sweat Goes A Long Way - Museums and The Web 2016
A Little Sweat Goes A Long Way - Museums and The Web 2016Stefano Cossu
 
Libraries, Archives, Museums discussion - MCN 2015
Libraries, Archives, Museums discussion - MCN 2015Libraries, Archives, Museums discussion - MCN 2015
Libraries, Archives, Museums discussion - MCN 2015Stefano Cossu
 
AIC Linked Open Data panel Museums and the Web 2015
AIC Linked Open Data panel Museums and the Web 2015AIC Linked Open Data panel Museums and the Web 2015
AIC Linked Open Data panel Museums and the Web 2015Stefano Cossu
 
Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...
Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...
Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...Stefano Cossu
 

More from Stefano Cossu (12)

Stefano_Cossu_OR23_deck.pdf
Stefano_Cossu_OR23_deck.pdfStefano_Cossu_OR23_deck.pdf
Stefano_Cossu_OR23_deck.pdf
 
The Oxford Common File Layout
The Oxford Common File LayoutThe Oxford Common File Layout
The Oxford Common File Layout
 
Brace yourselves, the Archives are Coming – Code4Lib 2020, Pittsburgh
Brace yourselves, the Archives are Coming – Code4Lib 2020, PittsburghBrace yourselves, the Archives are Coming – Code4Lib 2020, Pittsburgh
Brace yourselves, the Archives are Coming – Code4Lib 2020, Pittsburgh
 
Behind 12 sunsets
Behind 12 sunsetsBehind 12 sunsets
Behind 12 sunsets
 
IIIF at the Getty: Vision & Tactics
IIIF at the Getty: Vision & TacticsIIIF at the Getty: Vision & Tactics
IIIF at the Getty: Vision & Tactics
 
Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018
Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018 Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018
Reconciliation is a Necessity – IIIF Meeting, Edinburgh 2018
 
Labours of Love & Convenience - Open Repositories 2018
Labours of Love & Convenience - Open Repositories 2018Labours of Love & Convenience - Open Repositories 2018
Labours of Love & Convenience - Open Repositories 2018
 
Cossu ford the_lake_experience_mw2017
Cossu ford the_lake_experience_mw2017Cossu ford the_lake_experience_mw2017
Cossu ford the_lake_experience_mw2017
 
A Little Sweat Goes A Long Way - Museums and The Web 2016
A Little Sweat Goes A Long Way - Museums and The Web 2016A Little Sweat Goes A Long Way - Museums and The Web 2016
A Little Sweat Goes A Long Way - Museums and The Web 2016
 
Libraries, Archives, Museums discussion - MCN 2015
Libraries, Archives, Museums discussion - MCN 2015Libraries, Archives, Museums discussion - MCN 2015
Libraries, Archives, Museums discussion - MCN 2015
 
AIC Linked Open Data panel Museums and the Web 2015
AIC Linked Open Data panel Museums and the Web 2015AIC Linked Open Data panel Museums and the Web 2015
AIC Linked Open Data panel Museums and the Web 2015
 
Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...
Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...
Stefano Cossu, The Art Institute of Chicago - Open Repositories 2014 presenta...
 

Recently uploaded

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAnitaRaj43
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)Samir Dash
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 

Recently uploaded (20)

Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 

Scossu gdi iiif_r+d_report_2019

  • 1. Getty Common Image Service Research & Design Report Stefano Cossu, Software Architect J. P. Getty Trust <scossu@getty.edu>
  • 2. About Getty Digital (GDI) ~2 year-old department Very active evolution Created to consolidate IT services for all Getty programs: Hardware & networking infrastructure Software development Information management Information access & security Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 2
  • 3. GDI's Grand Plan for IIIF Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 3
  • 4. 1. Align with GDI mission One infrastructure to serve images from all Getty programs: Image delivery services Metadata (IIIF Presentation) services Access policies Discovery services Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 4
  • 5. 2. Start Simple ~150K images Edward Ruscha's Streets of Los Angeles (GRI) Museum Collection Open Content images All media cleared for open access (i.e. no auth) Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 5
  • 6. 3. Prepare to Grow Inde nitely Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 6
  • 7. 3. Prepare to Grow Inde nitely 50M images A/V media Access control Discovery Annotations ID management ETL pipelines … Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 7
  • 8. The Report Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 8
  • 9. Why? Explore all options (easier with no production data!) Scienti c approach to a challenging project Stimulate discourse over choices within Getty and the IIIF community Possibly improve areas that we nd in need of improvement Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 9
  • 10. Areas Covered Source image format Image encoding (compression) Image Server Ancillary tools (caching, ETL, etc.) Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 10
  • 11. 1+2. Source Image Format & Encoding Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 11
  • 12. Source Image Benchmark Criteria Compatibility with selected image servers Decoding speed Image size Encoding speed Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 12
  • 13. Source Image Benchmark Methodology Run conversion on a batch of sample images with different color topologies and geometries Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 13
  • 14. Source Image Benchmark Format: Pyramidal TIFF (PTIFF) our pick Very fast decoding Established standard Flexible compression options & tools Limited IIIF server support Lots of manual tweaking Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 14
  • 15. Source Image Benchmark Format: JPEG 2000 (JP2) Very fast encoding (with Kakadu) Slower decoding Depends on proprietary software for decent performance Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 15
  • 16. Source Image Benchmark Encoding: JPEG our pick very space- & CPU-ef cient A classic… Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 16
  • 17. Source Image Benchmark Encoding: WebP Higher image quality than JPEG for the same storage size Limited support Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 17
  • 18. Source Image Benchmark Encoding: Lossless (LZW, LZMA, ZIP, etc.) Highest image quality CPU-intensive Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 18
  • 19. Source Image Benchmark Encoding: Uncompressed Data Highest image quality Lowest CPU usage Not viable for the data volumes handled Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 19
  • 20. 3. Image Server Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 20
  • 21. Image Server Benchmark Criteria Stability Decoding speed Support Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 21
  • 22. Image Server Benchmark Methodology HTTP load test using Locust on several "axes": Server software (in Docker containers) Source image size (<10 Mp, 10÷75 Mp, >75 Mp) Derivative size and type (region, full) Number of concurrent requests (10, 100, 1000) All caches turned off Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 22
  • 23. Image Server Benchmark IIPImage our pick Most well-established software Fastest processing by far Most reliable delivery (0% failure rate) Smallest resource footprint Version 1.1 is out! Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 23
  • 24. Image Server Benchmark Cantaloupe Fast (not quite as IIPImage) Reliable (not quite as IIPImage) High resource usage Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 24
  • 25. Image Server Benchmark Loris Customizable (GDI is a Python shop) Very high failure rate under stress Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 25
  • 26. Outcome of Locust tests Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 26
  • 27. Outcome of Locust tests Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 27
  • 28. Load averages for iipsrv with 10, 100, 1000 connections Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 28
  • 29. 4. Other Components (Custom Built) Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 29
  • 30. Gateway Service Provides a point of entry & exit for several request handling services: API versioning Caching ID management and redirect service Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 30
  • 31. ETL Pipeline Provides batch conversion of original images to PTIFFs and source system metadata to IIIF manifests. Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 31
  • 32. The Stuff Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 32
  • 33. INFO The report has a Getty-centric scope, however some outcomes could be useful to the general public. WARNING The Getty does not intend to maintain the software used for the report in the long term. Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 33
  • 34. Report Contents Google Drive Folder containing: Report (PDF) Docker containers with server setups Reference data set Benchmark tools Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 34
  • 35. Thank You. Questions? scossu@getty.edu Discussion? iiif-discuss@googlegroups.com Getty Common Image Service R&D Report—IIIF Meeting, Göttingen, Germany, June 2019 35