SlideShare a Scribd company logo
1 of 35
Download to read offline
Lazy distribution of
container images
Current implementation status of containerd remote snapshotter
Akihiro Suda
FOSDEM (February 1, 2020)
Credit to Kohei Tokunaga (NTT) for containerd impl. & benchmark scripts
• Run containers before completion of
downloading the images
• Lots of alternative image formats are
proposed to support this
• stargz is getting wide adoption
(containerd & Podman)
2
Summary
Demo:
Lazy distribution of
docker.io/library/python:3.7
The problems of the current
Docker / OCI format
• Open Containers Initiative (OCI) defines the standard
specifications for containers
– Docker/Moby, Podman, Kubernetes (containerd, CRI-O, …), Singularity…
• OCI Image Spec: defines the tar ball structure and the JSON
metadata format
– Based on Docker Image Manifest V2 Schema 2
• OCI Distribution Spec: defines the API for distributing images
via HTTP
– Based on Docker Registry HTTP API
• Focuses on legacy rather than on innovation ☹
5
Current Docker / OCI format
• Appeared in 1970s
• Originally designed for
magnetic tapes
• No random access
6
TAR: Tape ARchiver
https://en.wikipedia.org/wiki/PDP-11
• Without scanning the whole "tape“,
file metadata cannot be listed up
→ Can't be mounted as a filesystem
7
Problem 1: Requires scanning the whole "tape"
Metadata 0
File 0
Metadata 1
File 1
Metadata {n-1}
File {n-1}
Terminal zero bytes
...
File name, permission, ...
Content
• Having an external index file can solve the problem?
→ No, because gzip can’t be seek-ed
(discussed later)
8
Problem 1: Requires scanning the whole "tape"
Metadata 0
File 0
Metadata 1
File 1
Metadata {n-1}
File {n-1}
Terminal zero bytes
...
Metadata 0
Metadata 1
Metadata {n-1}
…
Index file
• A registry might contain very similar images
– Different versions
– Different architectures
– Different configuration files
• Tar balls of these images are likely to waste the storage for
identical/similar files
• But not a serious issue when you have enough budget for the
cloud storage
9
Problem 2: No deduplication
1. Requires scanning the whole "tape"
2. No deduplication
10
Problems of Docker / OCI image format
https://en.wikipedia.org/wiki/Magnetic_tape
The main focus
towards lazy
distribution
• “pulling packages accounts for 76% of container start time, but
only 6.4% of that data is read.”
– Harter, Tyler, et al. "Slacker: Fast Distribution with Lazy Docker
Containers." FAST 2016
11
Why do we want lazy distribution?
• “dev stage” images of multi-stage Dockerfiles
– No need to consider tolerance against remote registry failures
(because `RUN apt-get install` instructions are already flaky anyway)
12
Expected use-cases
FROM example.com/heavy-dev-env:lazy AS dev
RUN apt-get update && 
apt-get install -y some-additional-libs
COPY src .
RUN ./configure && 
make static && 
cp bin/foo /foo
# the stage switches here
FROM scratch
COPY --from=dev /foo /foo
ENTRYPOINT /foo
• Other use-cases are also valid, but mind fault tolerance
(until the image gets 100% cached locally)
– Kubernetes readinessProbe
• FaaS
• Web apps with huge number of HTML files and graphic files
• Jupyter Notebooks with big data samples included
• Full GNOME/KDE desktop
– Will 2020 be the year of the containerized Linux desktop?
13
Expected use-cases
Our first attempt (2017)
Our first attempt (2017)
… and post-mortem
• No tar balls
• Composed of a protobuf index file (continuity manifest) +
content-addressable blob files
16
Our first attempt : FILEgrain (2017)
• No tar balls
• Composed of a protobuf index file (continuity manifest) +
content-addressable blob files
17
Our first attempt : FILEgrain (2017)
message Metadata {
repeated string path;
int64 uid;
int64 gid;
uint32 mode;
uint64 size;
repeated string sha256Digest;
...
}
Metadata 0
Metadata 1
Metadata {n-1}
…
blobs/sha256/deadbeef…
blobs/sha256/cafebabe…
• Incompatibility with legacy tar balls
• Chicken-and-egg: hard to finalize the spec when no
implementation exists; hard to promote implementation when
the spec is not finalized
• Use-cases were unclear; didn’t need to focus on deduplication
• Performance overhead due to huge numbers of HTTP requests
for reading small files
18
FILEgrain post-mortem
The solution in 2020: stargz
• Proposed by Brad Fitzpatrick (Google, at that time)
for accelerating the CI of the Go language project
• No focus on data deduplication
20
stargz: seekable tar.gz
Metadata 0
File 0
Metadata 1
File 1
Metadata {n-1}
File {n-1}
Terminal zero bytes
...
gzip
legacy tar.gz
Metadata 0
File 0
gzip
Metadata 1
File 1
gzip
...
Metadata {n-1}
File {n-1}
gzip
Metadata for s.i.j.
stargz.index.json
(Metadata 0…{n-1})
gzip
Terminal zero bytes
empty stream
stargz
gzip
• Fully compatible with legacy tar.gz
• But contains extra “stargz.index.json” entry
21
stargz: seekable tar.gz
Metadata 0
File 0
Metadata 1
File 1
Metadata {n-1}
File {n-1}
Terminal zero bytes
...
gzip
legacy tar.gz
Metadata 0
File 0
gzip
Metadata 1
File 1
gzip
...
Metadata {n-1}
File {n-1}
gzip
Metadata for s.i.j.
stargz.index.json
(Metadata 0…{n-1})
gzip
Terminal zero bytes
empty stream
stargz
gzip
• Only stargz.index.json is required for mounting the image
• Actual files in the archive can be fetched on demand
(when HTTP Range Requests are supported)
22
stargz: seekable tar.gz
gzip
This gzip header contains pointer
for stargz.index.json
stargz
Metadata 1
File 1
gzip
...
Metadata {n-1}
File {n-1}
gzip
Metadata for s.i.j.
stargz.index.json
(Metadata 0…{n-1})
gzip
Terminal zero bytes
empty stream
Metadata 0
File 0
gzip
• containerd: https://github.com/ktock/stargz-snapshotter
– By Kohei Tokunaga (NTT)
– Implemented as a containerd snapshotter plugin
– stargz archives are mounted as read-only FUSE filesystems
– OverlayFS is used for supporting writing
– Supports more aggressive optimization (discussed later)
• Podman: https://github.com/giuseppe/crfs-plugin
– By Giuseppe Scrivano (Red Hat)
– Implemented as a fuse-overlayfs plugin
23
stargz adoption in the ecosystem
• Profiles actual file access patterns by running an equivalent of
docker run
– Future: static analysis using ldd(-ish) ? Machine learning?
• Reorders file entries in the archive so that relevant files can be
prefetched in a single HTTP request
24
stargz optimizer for containerd
/usr/bin/apt-get
/bin/ls
/bin/vi
/lib/libc.so
/lib/libjpeg.so
/usr/bin/python3
.../usr/lib/python3/.../foo
/usr/lib/python3/.../bar
/app.py
/bin/ls
/app.py
/usr/bin/python3
/lib/libc.so
/usr/lib/python3/.../foo
/usr/lib/python3/.../bar
.../bin/vi
/lib/libjpeg.so
/usr/bin/apt-get
• Registry: Docker Hub (docker.io)
• containerd host location: EC2 Oregon
• Benchmark: execute typical base images with
“compile hello world” command
25
Benchmark results
26
Benchmark results
Credit to Kohei Tokunaga (NTT) for containerd impl. & benchmark scripts
27
Benchmark results
Credit to Kohei Tokunaga (NTT) for containerd impl. & benchmark scripts
28
Benchmark results
Credit to Kohei Tokunaga (NTT) for containerd impl. & benchmark scripts
29
Benchmark results
Credit to Kohei Tokunaga (NTT) for containerd impl. & benchmark scripts
• Impl: Parallelize HTTP operations across image layers
– https://github.com/ktock/stargz-snapshotter/issues/37
• Spec: Use zstd instead of gzip (“starzstd”?)
– Proposed by Giuseppe
https://github.com/golang/go/issues/30829#issuecomment-541532402
– Suitable for images with many small files
– Not compatible with OCI Image Spec v1.0.1
– Compatible with OCI Image Spec v.Next
30
More optimizations are to come
• BuildKit: modern OCI image builder
– Concurrent execution
– Efficient caching
– Rootless
– (pseudo-)daemonless
– Clustering on Kubernetes
– And a lot of innovative features
• stargz support is on our plan, stay tuned!
– Producing stargz images
– Consuming stargz images as base images
31
stargz integration for BuildKit
• CernVM-FS
– Not compatible with OCI tar balls
– Has been already widely deployed in CERN and their friends
– Implementation available for containerd:
https://github.com/ktock/remote-snapshotter/pull/27
• Unofficial “OCI v2”
– Proposed by Aleksa Sarai (SUSE)
– Not compatible with OCI v1 tarballs
– Focuses on deduplication, using Restic algorithm
– WIP implementation available for umoci (image manipulation tool):
https://github.com/openSUSE/umoci/tree/experimental/ociv2
– No runtime implementation seems to exist
32
Other post-OCI formats
• IPCS
– Proposed by Edgar Lee (Netflix)
– Built on IPFS (P2P CAS) protocol
– Not compatible with OCI tar balls
– Implementation available for containerd:
https://github.com/hinshun/ipcs
• Azure Container Registry “Project Teleport”
– Built on SMB protocol and VHD images
– Not FLOSS
33
Other post-OCI formats
• Lots of alternative image formats are proposed for lazy
distribution, but compatibility matters
• stargz is getting wide adoption (containerd & Podman)
• containerd supports sort+prefetch optimization for stargz
https://github.com/ktock/stargz-snapshotter
34
Recap
• Valid & invalid use cases?
• More efficient optimization techniques?
• Issues/PRs are welcome at
https://github.com/ktock/stargz-snapshotter
(Expected to be moved under github.com/containerd soon)
35
Request for comments

More Related Content

What's hot

Introduction and Deep Dive Into Containerd
Introduction and Deep Dive Into ContainerdIntroduction and Deep Dive Into Containerd
Introduction and Deep Dive Into ContainerdKohei Tokunaga
 
[KubeConEU] Building images efficiently and securely on Kubernetes with BuildKit
[KubeConEU] Building images efficiently and securely on Kubernetes with BuildKit[KubeConEU] Building images efficiently and securely on Kubernetes with BuildKit
[KubeConEU] Building images efficiently and securely on Kubernetes with BuildKitAkihiro Suda
 
SCALE 2011 Deploying OpenStack with Chef
SCALE 2011 Deploying OpenStack with ChefSCALE 2011 Deploying OpenStack with Chef
SCALE 2011 Deploying OpenStack with ChefMatt Ray
 
Comparing Next-Generation Container Image Building Tools
 Comparing Next-Generation Container Image Building Tools Comparing Next-Generation Container Image Building Tools
Comparing Next-Generation Container Image Building ToolsAkihiro Suda
 
Usernetes: Kubernetes as a non-root user
Usernetes: Kubernetes as a non-root userUsernetes: Kubernetes as a non-root user
Usernetes: Kubernetes as a non-root userAkihiro Suda
 
The State of Rootless Containers
The State of Rootless ContainersThe State of Rootless Containers
The State of Rootless ContainersAkihiro Suda
 
Upstate DevOps - Containers 101 - March 28, 2019
Upstate DevOps - Containers 101 - March 28, 2019Upstate DevOps - Containers 101 - March 28, 2019
Upstate DevOps - Containers 101 - March 28, 2019Allen Vailliencourt
 
Introduction to docker
Introduction to dockerIntroduction to docker
Introduction to dockerJustyna Ilczuk
 
Secure container: Kata container and gVisor
Secure container: Kata container and gVisorSecure container: Kata container and gVisor
Secure container: Kata container and gVisorChing-Hsuan Yen
 
[DockerCon 2019] Hardening Docker daemon with Rootless mode
[DockerCon 2019] Hardening Docker daemon with Rootless mode[DockerCon 2019] Hardening Docker daemon with Rootless mode
[DockerCon 2019] Hardening Docker daemon with Rootless modeAkihiro Suda
 
OpenVZ, Virtuozzo and Docker
OpenVZ, Virtuozzo and DockerOpenVZ, Virtuozzo and Docker
OpenVZ, Virtuozzo and DockerKirill Kolyshkin
 
A Gentle Introduction To Docker And All Things Containers
A Gentle Introduction To Docker And All Things ContainersA Gentle Introduction To Docker And All Things Containers
A Gentle Introduction To Docker And All Things ContainersJérôme Petazzoni
 
Containers, Docker, and Security: State Of The Union (LinuxCon and ContainerC...
Containers, Docker, and Security: State Of The Union (LinuxCon and ContainerC...Containers, Docker, and Security: State Of The Union (LinuxCon and ContainerC...
Containers, Docker, and Security: State Of The Union (LinuxCon and ContainerC...Jérôme Petazzoni
 
LXC, Docker, and the future of software delivery | LinuxCon 2013
LXC, Docker, and the future of software delivery | LinuxCon 2013LXC, Docker, and the future of software delivery | LinuxCon 2013
LXC, Docker, and the future of software delivery | LinuxCon 2013dotCloud
 
Why Docker? Dayton PHP, April 2017
Why Docker? Dayton PHP, April 2017Why Docker? Dayton PHP, April 2017
Why Docker? Dayton PHP, April 2017Chris Tankersley
 
Docker and OpenStack Boston Meetup
Docker and OpenStack Boston MeetupDocker and OpenStack Boston Meetup
Docker and OpenStack Boston MeetupKamesh Pemmaraju
 
Docker and Go: why did we decide to write Docker in Go?
Docker and Go: why did we decide to write Docker in Go?Docker and Go: why did we decide to write Docker in Go?
Docker and Go: why did we decide to write Docker in Go?Jérôme Petazzoni
 
Docker open stack boston
Docker open stack bostonDocker open stack boston
Docker open stack bostondotCloud
 

What's hot (20)

Introduction and Deep Dive Into Containerd
Introduction and Deep Dive Into ContainerdIntroduction and Deep Dive Into Containerd
Introduction and Deep Dive Into Containerd
 
[KubeConEU] Building images efficiently and securely on Kubernetes with BuildKit
[KubeConEU] Building images efficiently and securely on Kubernetes with BuildKit[KubeConEU] Building images efficiently and securely on Kubernetes with BuildKit
[KubeConEU] Building images efficiently and securely on Kubernetes with BuildKit
 
SCALE 2011 Deploying OpenStack with Chef
SCALE 2011 Deploying OpenStack with ChefSCALE 2011 Deploying OpenStack with Chef
SCALE 2011 Deploying OpenStack with Chef
 
ISC HPCW talks
ISC HPCW talksISC HPCW talks
ISC HPCW talks
 
Comparing Next-Generation Container Image Building Tools
 Comparing Next-Generation Container Image Building Tools Comparing Next-Generation Container Image Building Tools
Comparing Next-Generation Container Image Building Tools
 
Usernetes: Kubernetes as a non-root user
Usernetes: Kubernetes as a non-root userUsernetes: Kubernetes as a non-root user
Usernetes: Kubernetes as a non-root user
 
The State of Rootless Containers
The State of Rootless ContainersThe State of Rootless Containers
The State of Rootless Containers
 
Upstate DevOps - Containers 101 - March 28, 2019
Upstate DevOps - Containers 101 - March 28, 2019Upstate DevOps - Containers 101 - March 28, 2019
Upstate DevOps - Containers 101 - March 28, 2019
 
App container rkt
App container rktApp container rkt
App container rkt
 
Introduction to docker
Introduction to dockerIntroduction to docker
Introduction to docker
 
Secure container: Kata container and gVisor
Secure container: Kata container and gVisorSecure container: Kata container and gVisor
Secure container: Kata container and gVisor
 
[DockerCon 2019] Hardening Docker daemon with Rootless mode
[DockerCon 2019] Hardening Docker daemon with Rootless mode[DockerCon 2019] Hardening Docker daemon with Rootless mode
[DockerCon 2019] Hardening Docker daemon with Rootless mode
 
OpenVZ, Virtuozzo and Docker
OpenVZ, Virtuozzo and DockerOpenVZ, Virtuozzo and Docker
OpenVZ, Virtuozzo and Docker
 
A Gentle Introduction To Docker And All Things Containers
A Gentle Introduction To Docker And All Things ContainersA Gentle Introduction To Docker And All Things Containers
A Gentle Introduction To Docker And All Things Containers
 
Containers, Docker, and Security: State Of The Union (LinuxCon and ContainerC...
Containers, Docker, and Security: State Of The Union (LinuxCon and ContainerC...Containers, Docker, and Security: State Of The Union (LinuxCon and ContainerC...
Containers, Docker, and Security: State Of The Union (LinuxCon and ContainerC...
 
LXC, Docker, and the future of software delivery | LinuxCon 2013
LXC, Docker, and the future of software delivery | LinuxCon 2013LXC, Docker, and the future of software delivery | LinuxCon 2013
LXC, Docker, and the future of software delivery | LinuxCon 2013
 
Why Docker? Dayton PHP, April 2017
Why Docker? Dayton PHP, April 2017Why Docker? Dayton PHP, April 2017
Why Docker? Dayton PHP, April 2017
 
Docker and OpenStack Boston Meetup
Docker and OpenStack Boston MeetupDocker and OpenStack Boston Meetup
Docker and OpenStack Boston Meetup
 
Docker and Go: why did we decide to write Docker in Go?
Docker and Go: why did we decide to write Docker in Go?Docker and Go: why did we decide to write Docker in Go?
Docker and Go: why did we decide to write Docker in Go?
 
Docker open stack boston
Docker open stack bostonDocker open stack boston
Docker open stack boston
 

Similar to Lazy distribution of container images with stargz

Faster Container Image Distribution on a Variety of Tools with Lazy Pulling
Faster Container Image Distribution on a Variety of Tools with Lazy PullingFaster Container Image Distribution on a Variety of Tools with Lazy Pulling
Faster Container Image Distribution on a Variety of Tools with Lazy PullingKohei Tokunaga
 
Startup Containers in Lightning Speed with Lazy Image Distribution
Startup Containers in Lightning Speed with Lazy Image DistributionStartup Containers in Lightning Speed with Lazy Image Distribution
Startup Containers in Lightning Speed with Lazy Image DistributionKohei Tokunaga
 
Build and Run Containers With Lazy Pulling - Adoption status of containerd St...
Build and Run Containers With Lazy Pulling - Adoption status of containerd St...Build and Run Containers With Lazy Pulling - Adoption status of containerd St...
Build and Run Containers With Lazy Pulling - Adoption status of containerd St...Kohei Tokunaga
 
DockerCon SF 2015: Maintaining the official node.js docker image
DockerCon SF 2015: Maintaining the official node.js docker imageDockerCon SF 2015: Maintaining the official node.js docker image
DockerCon SF 2015: Maintaining the official node.js docker imageDocker, Inc.
 
DockerCon SF 2015: Maintaining the Official Node.js Docker Image
DockerCon SF 2015: Maintaining the Official Node.js Docker ImageDockerCon SF 2015: Maintaining the Official Node.js Docker Image
DockerCon SF 2015: Maintaining the Official Node.js Docker ImageDocker, Inc.
 
Cloud Native Landscape (CNCF and OCI)
Cloud Native Landscape (CNCF and OCI)Cloud Native Landscape (CNCF and OCI)
Cloud Native Landscape (CNCF and OCI)Chris Aniszczyk
 
2014 11-05 hpcac-kniep_christian_dockermpi
2014 11-05 hpcac-kniep_christian_dockermpi2014 11-05 hpcac-kniep_christian_dockermpi
2014 11-05 hpcac-kniep_christian_dockermpiQNIB Solutions
 
Commit to excellence - Java in containers
Commit to excellence - Java in containersCommit to excellence - Java in containers
Commit to excellence - Java in containersRed Hat Developers
 
Leonid Vasilyev "Building, deploying and running production code at Dropbox"
Leonid Vasilyev  "Building, deploying and running production code at Dropbox"Leonid Vasilyev  "Building, deploying and running production code at Dropbox"
Leonid Vasilyev "Building, deploying and running production code at Dropbox"IT Event
 
Whose Job Is It Anyway? Kubernetes, CRI, & Container Runtimes
Whose Job Is It Anyway? Kubernetes, CRI, & Container RuntimesWhose Job Is It Anyway? Kubernetes, CRI, & Container Runtimes
Whose Job Is It Anyway? Kubernetes, CRI, & Container RuntimesPhil Estes
 
FILEgrain: Transport-Agnostic, Fine-Grained Content-Addressable Container Ima...
FILEgrain: Transport-Agnostic, Fine-Grained Content-Addressable Container Ima...FILEgrain: Transport-Agnostic, Fine-Grained Content-Addressable Container Ima...
FILEgrain: Transport-Agnostic, Fine-Grained Content-Addressable Container Ima...Akihiro Suda
 
Introduction to Docker and all things containers, Docker Meetup at RelateIQ
Introduction to Docker and all things containers, Docker Meetup at RelateIQIntroduction to Docker and all things containers, Docker Meetup at RelateIQ
Introduction to Docker and all things containers, Docker Meetup at RelateIQdotCloud
 
The overview of lazypull with containerd Remote Snapshotter & Stargz Snapshotter
The overview of lazypull with containerd Remote Snapshotter & Stargz SnapshotterThe overview of lazypull with containerd Remote Snapshotter & Stargz Snapshotter
The overview of lazypull with containerd Remote Snapshotter & Stargz SnapshotterKohei Tokunaga
 
Tips For Maintaining OSS Projects
Tips For Maintaining OSS ProjectsTips For Maintaining OSS Projects
Tips For Maintaining OSS ProjectsTaro L. Saito
 
gVisor, Kata Containers, Firecracker, Docker: Who is Who in the Container Space?
gVisor, Kata Containers, Firecracker, Docker: Who is Who in the Container Space?gVisor, Kata Containers, Firecracker, Docker: Who is Who in the Container Space?
gVisor, Kata Containers, Firecracker, Docker: Who is Who in the Container Space?ArangoDB Database
 
Docker 1 0 1 0 1: a Docker introduction, actualized for the stable release of...
Docker 1 0 1 0 1: a Docker introduction, actualized for the stable release of...Docker 1 0 1 0 1: a Docker introduction, actualized for the stable release of...
Docker 1 0 1 0 1: a Docker introduction, actualized for the stable release of...Jérôme Petazzoni
 

Similar to Lazy distribution of container images with stargz (20)

Faster Container Image Distribution on a Variety of Tools with Lazy Pulling
Faster Container Image Distribution on a Variety of Tools with Lazy PullingFaster Container Image Distribution on a Variety of Tools with Lazy Pulling
Faster Container Image Distribution on a Variety of Tools with Lazy Pulling
 
Startup Containers in Lightning Speed with Lazy Image Distribution
Startup Containers in Lightning Speed with Lazy Image DistributionStartup Containers in Lightning Speed with Lazy Image Distribution
Startup Containers in Lightning Speed with Lazy Image Distribution
 
Build and Run Containers With Lazy Pulling - Adoption status of containerd St...
Build and Run Containers With Lazy Pulling - Adoption status of containerd St...Build and Run Containers With Lazy Pulling - Adoption status of containerd St...
Build and Run Containers With Lazy Pulling - Adoption status of containerd St...
 
DockerCon SF 2015: Maintaining the official node.js docker image
DockerCon SF 2015: Maintaining the official node.js docker imageDockerCon SF 2015: Maintaining the official node.js docker image
DockerCon SF 2015: Maintaining the official node.js docker image
 
DockerCon SF 2015: Maintaining the Official Node.js Docker Image
DockerCon SF 2015: Maintaining the Official Node.js Docker ImageDockerCon SF 2015: Maintaining the Official Node.js Docker Image
DockerCon SF 2015: Maintaining the Official Node.js Docker Image
 
Cloud Native Landscape (CNCF and OCI)
Cloud Native Landscape (CNCF and OCI)Cloud Native Landscape (CNCF and OCI)
Cloud Native Landscape (CNCF and OCI)
 
2014 11-05 hpcac-kniep_christian_dockermpi
2014 11-05 hpcac-kniep_christian_dockermpi2014 11-05 hpcac-kniep_christian_dockermpi
2014 11-05 hpcac-kniep_christian_dockermpi
 
cadec-2017-golang
cadec-2017-golangcadec-2017-golang
cadec-2017-golang
 
Containerization using docker and its applications
Containerization using docker and its applicationsContainerization using docker and its applications
Containerization using docker and its applications
 
Containerization using docker and its applications
Containerization using docker and its applicationsContainerization using docker and its applications
Containerization using docker and its applications
 
Commit to excellence - Java in containers
Commit to excellence - Java in containersCommit to excellence - Java in containers
Commit to excellence - Java in containers
 
Leonid Vasilyev "Building, deploying and running production code at Dropbox"
Leonid Vasilyev  "Building, deploying and running production code at Dropbox"Leonid Vasilyev  "Building, deploying and running production code at Dropbox"
Leonid Vasilyev "Building, deploying and running production code at Dropbox"
 
Whose Job Is It Anyway? Kubernetes, CRI, & Container Runtimes
Whose Job Is It Anyway? Kubernetes, CRI, & Container RuntimesWhose Job Is It Anyway? Kubernetes, CRI, & Container Runtimes
Whose Job Is It Anyway? Kubernetes, CRI, & Container Runtimes
 
FILEgrain: Transport-Agnostic, Fine-Grained Content-Addressable Container Ima...
FILEgrain: Transport-Agnostic, Fine-Grained Content-Addressable Container Ima...FILEgrain: Transport-Agnostic, Fine-Grained Content-Addressable Container Ima...
FILEgrain: Transport-Agnostic, Fine-Grained Content-Addressable Container Ima...
 
Introduction to Docker and all things containers, Docker Meetup at RelateIQ
Introduction to Docker and all things containers, Docker Meetup at RelateIQIntroduction to Docker and all things containers, Docker Meetup at RelateIQ
Introduction to Docker and all things containers, Docker Meetup at RelateIQ
 
The overview of lazypull with containerd Remote Snapshotter & Stargz Snapshotter
The overview of lazypull with containerd Remote Snapshotter & Stargz SnapshotterThe overview of lazypull with containerd Remote Snapshotter & Stargz Snapshotter
The overview of lazypull with containerd Remote Snapshotter & Stargz Snapshotter
 
Tips For Maintaining OSS Projects
Tips For Maintaining OSS ProjectsTips For Maintaining OSS Projects
Tips For Maintaining OSS Projects
 
gVisor, Kata Containers, Firecracker, Docker: Who is Who in the Container Space?
gVisor, Kata Containers, Firecracker, Docker: Who is Who in the Container Space?gVisor, Kata Containers, Firecracker, Docker: Who is Who in the Container Space?
gVisor, Kata Containers, Firecracker, Docker: Who is Who in the Container Space?
 
Docker meetup-20-apr-17-openshit
Docker meetup-20-apr-17-openshitDocker meetup-20-apr-17-openshit
Docker meetup-20-apr-17-openshit
 
Docker 1 0 1 0 1: a Docker introduction, actualized for the stable release of...
Docker 1 0 1 0 1: a Docker introduction, actualized for the stable release of...Docker 1 0 1 0 1: a Docker introduction, actualized for the stable release of...
Docker 1 0 1 0 1: a Docker introduction, actualized for the stable release of...
 

More from Akihiro Suda

20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...Akihiro Suda
 
20240321 [KubeCon EU Pavilion] Lima.pdf_
20240321 [KubeCon EU Pavilion] Lima.pdf_20240321 [KubeCon EU Pavilion] Lima.pdf_
20240321 [KubeCon EU Pavilion] Lima.pdf_Akihiro Suda
 
20240320 [KubeCon EU Pavilion] containerd.pdf
20240320 [KubeCon EU Pavilion] containerd.pdf20240320 [KubeCon EU Pavilion] containerd.pdf
20240320 [KubeCon EU Pavilion] containerd.pdfAkihiro Suda
 
20240201 [HPC Containers] Rootless Containers.pdf
20240201 [HPC Containers] Rootless Containers.pdf20240201 [HPC Containers] Rootless Containers.pdf
20240201 [HPC Containers] Rootless Containers.pdfAkihiro Suda
 
[Podman Special Event] Kubernetes in Rootless Podman
[Podman Special Event] Kubernetes in Rootless Podman[Podman Special Event] Kubernetes in Rootless Podman
[Podman Special Event] Kubernetes in Rootless PodmanAkihiro Suda
 
[KubeConNA2023] Lima pavilion
[KubeConNA2023] Lima pavilion[KubeConNA2023] Lima pavilion
[KubeConNA2023] Lima pavilionAkihiro Suda
 
[KubeConNA2023] containerd pavilion
[KubeConNA2023] containerd pavilion[KubeConNA2023] containerd pavilion
[KubeConNA2023] containerd pavilionAkihiro Suda
 
[DockerConハイライト] OpenPubKeyによるイメージの署名と検証.pdf
[DockerConハイライト] OpenPubKeyによるイメージの署名と検証.pdf[DockerConハイライト] OpenPubKeyによるイメージの署名と検証.pdf
[DockerConハイライト] OpenPubKeyによるイメージの署名と検証.pdfAkihiro Suda
 
[CNCF TAG-Runtime] Usernetes Gen2
[CNCF TAG-Runtime] Usernetes Gen2[CNCF TAG-Runtime] Usernetes Gen2
[CNCF TAG-Runtime] Usernetes Gen2Akihiro Suda
 
[DockerCon 2023] Reproducible builds with BuildKit for software supply chain ...
[DockerCon 2023] Reproducible builds with BuildKit for software supply chain ...[DockerCon 2023] Reproducible builds with BuildKit for software supply chain ...
[DockerCon 2023] Reproducible builds with BuildKit for software supply chain ...Akihiro Suda
 
The internals and the latest trends of container runtimes
The internals and the latest trends of container runtimesThe internals and the latest trends of container runtimes
The internals and the latest trends of container runtimesAkihiro Suda
 
[KubeConEU2023] Lima pavilion
[KubeConEU2023] Lima pavilion[KubeConEU2023] Lima pavilion
[KubeConEU2023] Lima pavilionAkihiro Suda
 
[KubeConEU2023] containerd pavilion
[KubeConEU2023] containerd pavilion[KubeConEU2023] containerd pavilion
[KubeConEU2023] containerd pavilionAkihiro Suda
 
[Container Plumbing Days 2023] Why was nerdctl made?
[Container Plumbing Days 2023] Why was nerdctl made?[Container Plumbing Days 2023] Why was nerdctl made?
[Container Plumbing Days 2023] Why was nerdctl made?Akihiro Suda
 
[FOSDEM2023] Bit-for-bit reproducible builds with Dockerfile
[FOSDEM2023] Bit-for-bit reproducible builds with Dockerfile[FOSDEM2023] Bit-for-bit reproducible builds with Dockerfile
[FOSDEM2023] Bit-for-bit reproducible builds with DockerfileAkihiro Suda
 
[CNCF TAG-Runtime 2022-10-06] Lima
[CNCF TAG-Runtime 2022-10-06] Lima[CNCF TAG-Runtime 2022-10-06] Lima
[CNCF TAG-Runtime 2022-10-06] LimaAkihiro Suda
 
[KubeCon EU 2022] Running containerd and k3s on macOS
[KubeCon EU 2022] Running containerd and k3s on macOS[KubeCon EU 2022] Running containerd and k3s on macOS
[KubeCon EU 2022] Running containerd and k3s on macOSAkihiro Suda
 
Dockerからcontainerdへの移行
Dockerからcontainerdへの移行Dockerからcontainerdへの移行
Dockerからcontainerdへの移行Akihiro Suda
 
[Docker Tokyo #35] Docker 20.10
[Docker Tokyo #35] Docker 20.10[Docker Tokyo #35] Docker 20.10
[Docker Tokyo #35] Docker 20.10Akihiro Suda
 
DockerとPodmanの比較
DockerとPodmanの比較DockerとPodmanの比較
DockerとPodmanの比較Akihiro Suda
 

More from Akihiro Suda (20)

20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
20240415 [Container Plumbing Days] Usernetes Gen2 - Kubernetes in Rootless Do...
 
20240321 [KubeCon EU Pavilion] Lima.pdf_
20240321 [KubeCon EU Pavilion] Lima.pdf_20240321 [KubeCon EU Pavilion] Lima.pdf_
20240321 [KubeCon EU Pavilion] Lima.pdf_
 
20240320 [KubeCon EU Pavilion] containerd.pdf
20240320 [KubeCon EU Pavilion] containerd.pdf20240320 [KubeCon EU Pavilion] containerd.pdf
20240320 [KubeCon EU Pavilion] containerd.pdf
 
20240201 [HPC Containers] Rootless Containers.pdf
20240201 [HPC Containers] Rootless Containers.pdf20240201 [HPC Containers] Rootless Containers.pdf
20240201 [HPC Containers] Rootless Containers.pdf
 
[Podman Special Event] Kubernetes in Rootless Podman
[Podman Special Event] Kubernetes in Rootless Podman[Podman Special Event] Kubernetes in Rootless Podman
[Podman Special Event] Kubernetes in Rootless Podman
 
[KubeConNA2023] Lima pavilion
[KubeConNA2023] Lima pavilion[KubeConNA2023] Lima pavilion
[KubeConNA2023] Lima pavilion
 
[KubeConNA2023] containerd pavilion
[KubeConNA2023] containerd pavilion[KubeConNA2023] containerd pavilion
[KubeConNA2023] containerd pavilion
 
[DockerConハイライト] OpenPubKeyによるイメージの署名と検証.pdf
[DockerConハイライト] OpenPubKeyによるイメージの署名と検証.pdf[DockerConハイライト] OpenPubKeyによるイメージの署名と検証.pdf
[DockerConハイライト] OpenPubKeyによるイメージの署名と検証.pdf
 
[CNCF TAG-Runtime] Usernetes Gen2
[CNCF TAG-Runtime] Usernetes Gen2[CNCF TAG-Runtime] Usernetes Gen2
[CNCF TAG-Runtime] Usernetes Gen2
 
[DockerCon 2023] Reproducible builds with BuildKit for software supply chain ...
[DockerCon 2023] Reproducible builds with BuildKit for software supply chain ...[DockerCon 2023] Reproducible builds with BuildKit for software supply chain ...
[DockerCon 2023] Reproducible builds with BuildKit for software supply chain ...
 
The internals and the latest trends of container runtimes
The internals and the latest trends of container runtimesThe internals and the latest trends of container runtimes
The internals and the latest trends of container runtimes
 
[KubeConEU2023] Lima pavilion
[KubeConEU2023] Lima pavilion[KubeConEU2023] Lima pavilion
[KubeConEU2023] Lima pavilion
 
[KubeConEU2023] containerd pavilion
[KubeConEU2023] containerd pavilion[KubeConEU2023] containerd pavilion
[KubeConEU2023] containerd pavilion
 
[Container Plumbing Days 2023] Why was nerdctl made?
[Container Plumbing Days 2023] Why was nerdctl made?[Container Plumbing Days 2023] Why was nerdctl made?
[Container Plumbing Days 2023] Why was nerdctl made?
 
[FOSDEM2023] Bit-for-bit reproducible builds with Dockerfile
[FOSDEM2023] Bit-for-bit reproducible builds with Dockerfile[FOSDEM2023] Bit-for-bit reproducible builds with Dockerfile
[FOSDEM2023] Bit-for-bit reproducible builds with Dockerfile
 
[CNCF TAG-Runtime 2022-10-06] Lima
[CNCF TAG-Runtime 2022-10-06] Lima[CNCF TAG-Runtime 2022-10-06] Lima
[CNCF TAG-Runtime 2022-10-06] Lima
 
[KubeCon EU 2022] Running containerd and k3s on macOS
[KubeCon EU 2022] Running containerd and k3s on macOS[KubeCon EU 2022] Running containerd and k3s on macOS
[KubeCon EU 2022] Running containerd and k3s on macOS
 
Dockerからcontainerdへの移行
Dockerからcontainerdへの移行Dockerからcontainerdへの移行
Dockerからcontainerdへの移行
 
[Docker Tokyo #35] Docker 20.10
[Docker Tokyo #35] Docker 20.10[Docker Tokyo #35] Docker 20.10
[Docker Tokyo #35] Docker 20.10
 
DockerとPodmanの比較
DockerとPodmanの比較DockerとPodmanの比較
DockerとPodmanの比較
 

Recently uploaded

Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 

Recently uploaded (20)

Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 

Lazy distribution of container images with stargz

  • 1. Lazy distribution of container images Current implementation status of containerd remote snapshotter Akihiro Suda FOSDEM (February 1, 2020) Credit to Kohei Tokunaga (NTT) for containerd impl. & benchmark scripts
  • 2. • Run containers before completion of downloading the images • Lots of alternative image formats are proposed to support this • stargz is getting wide adoption (containerd & Podman) 2 Summary
  • 4. The problems of the current Docker / OCI format
  • 5. • Open Containers Initiative (OCI) defines the standard specifications for containers – Docker/Moby, Podman, Kubernetes (containerd, CRI-O, …), Singularity… • OCI Image Spec: defines the tar ball structure and the JSON metadata format – Based on Docker Image Manifest V2 Schema 2 • OCI Distribution Spec: defines the API for distributing images via HTTP – Based on Docker Registry HTTP API • Focuses on legacy rather than on innovation ☹ 5 Current Docker / OCI format
  • 6. • Appeared in 1970s • Originally designed for magnetic tapes • No random access 6 TAR: Tape ARchiver https://en.wikipedia.org/wiki/PDP-11
  • 7. • Without scanning the whole "tape“, file metadata cannot be listed up → Can't be mounted as a filesystem 7 Problem 1: Requires scanning the whole "tape" Metadata 0 File 0 Metadata 1 File 1 Metadata {n-1} File {n-1} Terminal zero bytes ... File name, permission, ... Content
  • 8. • Having an external index file can solve the problem? → No, because gzip can’t be seek-ed (discussed later) 8 Problem 1: Requires scanning the whole "tape" Metadata 0 File 0 Metadata 1 File 1 Metadata {n-1} File {n-1} Terminal zero bytes ... Metadata 0 Metadata 1 Metadata {n-1} … Index file
  • 9. • A registry might contain very similar images – Different versions – Different architectures – Different configuration files • Tar balls of these images are likely to waste the storage for identical/similar files • But not a serious issue when you have enough budget for the cloud storage 9 Problem 2: No deduplication
  • 10. 1. Requires scanning the whole "tape" 2. No deduplication 10 Problems of Docker / OCI image format https://en.wikipedia.org/wiki/Magnetic_tape The main focus towards lazy distribution
  • 11. • “pulling packages accounts for 76% of container start time, but only 6.4% of that data is read.” – Harter, Tyler, et al. "Slacker: Fast Distribution with Lazy Docker Containers." FAST 2016 11 Why do we want lazy distribution?
  • 12. • “dev stage” images of multi-stage Dockerfiles – No need to consider tolerance against remote registry failures (because `RUN apt-get install` instructions are already flaky anyway) 12 Expected use-cases FROM example.com/heavy-dev-env:lazy AS dev RUN apt-get update && apt-get install -y some-additional-libs COPY src . RUN ./configure && make static && cp bin/foo /foo # the stage switches here FROM scratch COPY --from=dev /foo /foo ENTRYPOINT /foo
  • 13. • Other use-cases are also valid, but mind fault tolerance (until the image gets 100% cached locally) – Kubernetes readinessProbe • FaaS • Web apps with huge number of HTML files and graphic files • Jupyter Notebooks with big data samples included • Full GNOME/KDE desktop – Will 2020 be the year of the containerized Linux desktop? 13 Expected use-cases
  • 15. Our first attempt (2017) … and post-mortem
  • 16. • No tar balls • Composed of a protobuf index file (continuity manifest) + content-addressable blob files 16 Our first attempt : FILEgrain (2017)
  • 17. • No tar balls • Composed of a protobuf index file (continuity manifest) + content-addressable blob files 17 Our first attempt : FILEgrain (2017) message Metadata { repeated string path; int64 uid; int64 gid; uint32 mode; uint64 size; repeated string sha256Digest; ... } Metadata 0 Metadata 1 Metadata {n-1} … blobs/sha256/deadbeef… blobs/sha256/cafebabe…
  • 18. • Incompatibility with legacy tar balls • Chicken-and-egg: hard to finalize the spec when no implementation exists; hard to promote implementation when the spec is not finalized • Use-cases were unclear; didn’t need to focus on deduplication • Performance overhead due to huge numbers of HTTP requests for reading small files 18 FILEgrain post-mortem
  • 19. The solution in 2020: stargz
  • 20. • Proposed by Brad Fitzpatrick (Google, at that time) for accelerating the CI of the Go language project • No focus on data deduplication 20 stargz: seekable tar.gz Metadata 0 File 0 Metadata 1 File 1 Metadata {n-1} File {n-1} Terminal zero bytes ... gzip legacy tar.gz Metadata 0 File 0 gzip Metadata 1 File 1 gzip ... Metadata {n-1} File {n-1} gzip Metadata for s.i.j. stargz.index.json (Metadata 0…{n-1}) gzip Terminal zero bytes empty stream stargz gzip
  • 21. • Fully compatible with legacy tar.gz • But contains extra “stargz.index.json” entry 21 stargz: seekable tar.gz Metadata 0 File 0 Metadata 1 File 1 Metadata {n-1} File {n-1} Terminal zero bytes ... gzip legacy tar.gz Metadata 0 File 0 gzip Metadata 1 File 1 gzip ... Metadata {n-1} File {n-1} gzip Metadata for s.i.j. stargz.index.json (Metadata 0…{n-1}) gzip Terminal zero bytes empty stream stargz gzip
  • 22. • Only stargz.index.json is required for mounting the image • Actual files in the archive can be fetched on demand (when HTTP Range Requests are supported) 22 stargz: seekable tar.gz gzip This gzip header contains pointer for stargz.index.json stargz Metadata 1 File 1 gzip ... Metadata {n-1} File {n-1} gzip Metadata for s.i.j. stargz.index.json (Metadata 0…{n-1}) gzip Terminal zero bytes empty stream Metadata 0 File 0 gzip
  • 23. • containerd: https://github.com/ktock/stargz-snapshotter – By Kohei Tokunaga (NTT) – Implemented as a containerd snapshotter plugin – stargz archives are mounted as read-only FUSE filesystems – OverlayFS is used for supporting writing – Supports more aggressive optimization (discussed later) • Podman: https://github.com/giuseppe/crfs-plugin – By Giuseppe Scrivano (Red Hat) – Implemented as a fuse-overlayfs plugin 23 stargz adoption in the ecosystem
  • 24. • Profiles actual file access patterns by running an equivalent of docker run – Future: static analysis using ldd(-ish) ? Machine learning? • Reorders file entries in the archive so that relevant files can be prefetched in a single HTTP request 24 stargz optimizer for containerd /usr/bin/apt-get /bin/ls /bin/vi /lib/libc.so /lib/libjpeg.so /usr/bin/python3 .../usr/lib/python3/.../foo /usr/lib/python3/.../bar /app.py /bin/ls /app.py /usr/bin/python3 /lib/libc.so /usr/lib/python3/.../foo /usr/lib/python3/.../bar .../bin/vi /lib/libjpeg.so /usr/bin/apt-get
  • 25. • Registry: Docker Hub (docker.io) • containerd host location: EC2 Oregon • Benchmark: execute typical base images with “compile hello world” command 25 Benchmark results
  • 26. 26 Benchmark results Credit to Kohei Tokunaga (NTT) for containerd impl. & benchmark scripts
  • 27. 27 Benchmark results Credit to Kohei Tokunaga (NTT) for containerd impl. & benchmark scripts
  • 28. 28 Benchmark results Credit to Kohei Tokunaga (NTT) for containerd impl. & benchmark scripts
  • 29. 29 Benchmark results Credit to Kohei Tokunaga (NTT) for containerd impl. & benchmark scripts
  • 30. • Impl: Parallelize HTTP operations across image layers – https://github.com/ktock/stargz-snapshotter/issues/37 • Spec: Use zstd instead of gzip (“starzstd”?) – Proposed by Giuseppe https://github.com/golang/go/issues/30829#issuecomment-541532402 – Suitable for images with many small files – Not compatible with OCI Image Spec v1.0.1 – Compatible with OCI Image Spec v.Next 30 More optimizations are to come
  • 31. • BuildKit: modern OCI image builder – Concurrent execution – Efficient caching – Rootless – (pseudo-)daemonless – Clustering on Kubernetes – And a lot of innovative features • stargz support is on our plan, stay tuned! – Producing stargz images – Consuming stargz images as base images 31 stargz integration for BuildKit
  • 32. • CernVM-FS – Not compatible with OCI tar balls – Has been already widely deployed in CERN and their friends – Implementation available for containerd: https://github.com/ktock/remote-snapshotter/pull/27 • Unofficial “OCI v2” – Proposed by Aleksa Sarai (SUSE) – Not compatible with OCI v1 tarballs – Focuses on deduplication, using Restic algorithm – WIP implementation available for umoci (image manipulation tool): https://github.com/openSUSE/umoci/tree/experimental/ociv2 – No runtime implementation seems to exist 32 Other post-OCI formats
  • 33. • IPCS – Proposed by Edgar Lee (Netflix) – Built on IPFS (P2P CAS) protocol – Not compatible with OCI tar balls – Implementation available for containerd: https://github.com/hinshun/ipcs • Azure Container Registry “Project Teleport” – Built on SMB protocol and VHD images – Not FLOSS 33 Other post-OCI formats
  • 34. • Lots of alternative image formats are proposed for lazy distribution, but compatibility matters • stargz is getting wide adoption (containerd & Podman) • containerd supports sort+prefetch optimization for stargz https://github.com/ktock/stargz-snapshotter 34 Recap
  • 35. • Valid & invalid use cases? • More efficient optimization techniques? • Issues/PRs are welcome at https://github.com/ktock/stargz-snapshotter (Expected to be moved under github.com/containerd soon) 35 Request for comments