Activity report on Deep-learning based compression

Institut Mines-Télécom
Compression meets Deep Learning:
Breakthrough or breakdown?
Report on research in
DL+Compression @ Multimedia group
Marco Cagnazzo, Attilio Fiandrotti, Andrei Purica

Context
DL and compression
Connexion with compression problems
■ Learn the best choices for classical encoders
• E.g., fast mode decision, rate allocation
■ Improve classical tasks of compression algorithms
• Probability models
• Block prediction
• Segmentation for object-based coding
• MPEG contributions
■ Paradigm shift in signal representation
• Autoencoders, GAN’s
2
IMPACT
Decisive
Incremental
Disruptive

Outline
■ On-going works @ MM group
• Airplane screen content video compression
• Virtual viewpoint synthesis and super-resolution
• Subjective quality comparison of DL-based compression
algorithms
• Other possible applications
■ The ML-Compression working group
■ Conclusions
3

Airplane screen content video compression
■ Airplane screen content: critical text information embedded over
natural image or synthetic background
• Sensors, navigation and positioning information, etc
• No direct access to these data only to captured screens
■ Compression is required for
several use cases
■ Semantic (or object-based)
video coding
• Text is recognized and
encoded as such
• Perfect text reconstruction
at the decoder side
4

Semantic coding
■ Deep learning is a key component of such
schemes since it allows to obtain a reliable
detection of the semantic information
■ Three NN architectures tested (complexity vs. accuracy trade-off)
■ First results: up to -90% rate reduction wrt the state of the
art for the same quality, or +4.6 dB PSNR improvement
■ Example :
HEVC-SCC at 0.018 bpp, 33.1 dB Proposed at 0.007 bpp, 38.2 dB
5

Outline
■ On-going works @ MM group
algorithms
■ Conclusions
6

Virtual viewpoint synthesis
7
x
y
z
Real cameras
Virtual
viewpoints

View Synthesis Reference Software (MPEG)
8
3D back-
projection
3D back-
projection
Merging
Filling
holes
Reference
Homography
Matrix
Reference
Homography
Matrix
Synthesis
Homography
Matrix

Proposed scheme
9
3D back-
projection
3D back-
projection
CNN-based
merge
Reference
Homography
Matrix
Reference
Homography
Matrix
Synthesis
Homography
Matrix

CNN-based merge
Architecture derived from a video super-resolution technique
10
Concatenate Convolutional
Layer 1
Convolutional
Layer 2

CNN-based view synthesis: results
11
VSRS

12
Ground
truth

13
Proposed

Outline
■ On-going works
algorithms
■ The ML-Compr working group
■ Conclusions
14

Subjective quality evaluation of DL-compression
methods
■ Deep generative models try to learn the latent distribution generating images
■ A typical architecture is based on auto-encoders, i.e. networks trained to
reproduce their input
■ Autoencoders include an information bottleneck, achieving compression
■ Very low-bitrate compression could also be obtained with GANs
─ Training process stability?
─ Naturaliness vs. fidelity
15
Encoder Code Decoder
𝑥
L=||𝑥-𝑦||2
𝑦

methods
■ Subjective quality evaluation (PSNR is not reliable enough)
■ 6 images, 113 compressed stimuli (uniform span of the
impairment scale)
■ 23 participants
■ Double stimulus impairment scale
■ Four compression methods:
1. Ballé et al.: 3-layers autoencoder with biologically-inspired non-
linearity and an approximation of rate-distortion optimization
2. Toderici et al.: Progressive RNN-based encoder working on 32x32
pixels patches
3. JP2K: Wavelet Transform, RDO, arithmetic coding
4. BPG: Spatial prediction, variable size prediction and transform
units, DCT and arithmetic coding
16

methods
17
Image 1 Image 2

methods - Image 1
18
Ballé, 0.38 bpp JP2K, 0.43 bpp

methods - Image 2
19
Toderici, 0.125 bpp JP2K, 0.1 bpp

Outline
■ On-going works
algorithms
■ Conclusions
20

Other applications
■ The flexibility of learning methods make them suitable for
several other problems in the field of compression and
streaming
• Spatial image prediction
• Probability distribution estimation for lossless coding
• Digital Hologram Compression
• HTTP Adaptive streaming (Q-learning)
Digicosme
post-doc
BCOM PhD HUAWEI (CIFRE)
PhD

Outline
■ On-going works
algorithms
■ Conclusions
22

ML-Compression working group
http://mlcompr.wp.imt.fr
■ Started in 4 months ago
■ People
• 2 MdC, 2 Post-doc, 3 PhD, interns @ MM
• Possible recruiting of 1-3 PhDs in the next months (CIFRE)
• Researchers from L2S (with PhDs and one post-doc)
• Contributions from other groups (talks, discussions, …)
■ Regular seminars with contributions from
─ IMAGES and S2A groups
─ Former IDS members
• Paris 5, L2S
─ Other universities (Paris13, Poitiers, CentraleSupéléc …)
─ Companies (Orange, Zodiac, …)
■ Make it an “official research topic”(aka “theme”)?
■ Connection with the Learning theme?
23

Outline
■ On-going works
algorithms
■ The ML-Compr working group
■ Conclusions
24

Conclusions
■ Deep learning has triggered a revolution in many
fields: will it be the same for compression?
• Possible, if we consider the impact on close fields
(computer vision)
• But not sure: traditional methods still have very
important properties that cannot (yet) be guaranteed
by DL-based methods (robustness, progressivity,
rate-control, low-complexity decoders, …)
■ Will DL provide decisive gains inside traditional
architecture?
• Possible, but many difficulties have to be faced
■ Or will DL just be used for incremental
improvements in traditional architectures?
• Almost sure that this minimal target can be achieved
25
IMPACT
Decisive
Incremental
Disruptive

Perspectives
■ Increasing activity of the ML-Compression group
• Seminars from AI experts
• Growing network of collaborations
■ Industrial activity
• 2 or 3 PhD Cifre proposals for next autumn
■ Critical mass?
• Intra-department? NewUni? New recruitment?
26

Thank you!
Working group: http://mlcompr.wp.imt.fr
Next seminar on July 19th
Subscribe to the mailing list:
https://listes.telecom-paristech.fr/mailman/listinfo/mlcompr
27

Activity report on Deep-learning based compression

Recommended

Recommended

More Related Content

What's hot

What's hot (10)

Similar to Activity report on Deep-learning based compression

Similar to Activity report on Deep-learning based compression (20)

Recently uploaded

Recently uploaded (20)

Activity report on Deep-learning based compression