Slides, thesis dissertation defense, deep generative neural networks for novelty generation

/ 56
Deep generative neural networks for novelty
generation:
a foundational framework, metrics and experiments
1
Mehdi Cherti
LAL/CNRS, Université Paris Saclay
Supervised by:
- Balàzs Kégl (LAL/CNRS, Université Paris Saclay)
- Akın Kazakçı(Mines Paristech)

/ 562
Prediction
Quest for artiﬁcial intelligence

/ 563
Prediction Novelty generation
Quest for artiﬁcial intelligence

/ 56
How to study novelty generation ?
4

/ 565
Studying novelty generation:
•Design theory
•Computational creativity
•Machine learning

/ 566
Design theory
• Early work: (Simon, 1969, 1973), Design as
‘problem solving’ (i.e. moving from an initial state
to a desired state)
• C-K theory: (Hatchuel et al., 2003), Design as
joint expansion of knowledge and concepts
• Various formalisms of knowledge (Set Theory
(Hatchuel et al, 2007), Graphs (Kazakci et al,
2010), Matroids (Le Masson et al, 2017))

/ 56
• Through C-K, it acknowledges that
knowledge is central
• But lacks computer-based experimental
tools
7
Design theory

/ 568
Generation as optimization with
evolutionary algorithms
Computational creativity

/ 569
• Enables experimentation but the end-
goal is the object itself rather than
studying the generative process
• Fitness function barrier
• No representation learning
• Generation and evaluation are
disconnected
Computational creativity

/ 5610
Machine learning proposes powerful
generative models,

/ 5611
but these powerful models are used
to regenerate objects that we can
relate easily to known objects…

/ 56
• Although trained for
generating what we know,
some models can generate
unrecognizable objects
• However, these models and
samples are considered as
spurious (Bengio et al. 2013),
or as a failure(Salimans et al.
2016)
12

/ 56
Instead of ignoring or eliminating novelty,
we should study it.
13

/ 5614
• Goal of the thesis: study generative potential of deep generative
networks (DGNs) for novelty generation
• Research questions:
• What novelty can be generated by DGN?
• How to evaluate the generative potential of a DGN?
• What are the general characteristics of DGN that can generate
novelty?
• Method: We use computer based simulations with deep generative
models because
• They offer a rich and powerful set of existing techniques
• They can learn (i.e. representations of objects)
• Their generative potential has not been studied systematically

/ 56
Outline
1. Introduction
2.The impact of representations on novelty generation
3. Results
3.a. Studying the generative potential of a deep net
3.b. Evaluating the generative potential of deep nets
3.c. Characteristics of models that can generate novelty
4. Conclusion and perspectives
15

/ 5616
2.The impact of representations on
novelty generation

/ 56
2.The impact of representations on novelty
generation
17
(Reich, 1995)
In design literature, it has been acknowledged that
objects can be represented in multiple ways
What effect do representations have
on novelty generation ?

/ 56
• Suppose we have a dataset of 16 letters
18
generation

/ 56
• Suppose we represent images in pixel space
• We generate pixels randomly uniformly
19
Everything is new,
but no structure
generation

/ 56
• Suppose we re-represent each letter using strokes
20
• For instance,
generation

/ 5621
generation

/ 5622
Pixel space Stroke space
Representations change what you can generate
generation

/ 56
•How do we choose a “useful” representation for
novelty generation ?
•Machine learning, and deep generative models in
particular, provides ways to learn
representations from data
Q: Can we use those learned representations for
generation of novelty even if these models are
not designed to do so ?
23
generation

/ 5624
generation
•Noise vs novelty
•Likelihood
•Compression of representations
Summary:

/ 5625
• What novelty can be generated by deep generative
nets (DGN)?
• How to evaluate the generative potential of a
DGN?
• What are the general characteristics of DGN that
can generate novelty?
Research questions:

/ 5627
• We observed that some models could generate novelty
although not designed to do that
• Thus, deep generative models have an unused generative
potential
• Can we demonstrate this more systematically ?

/ 5628

/ 5629
(Kazakci, Cherti, Kégl, 2016)
Train data
Generative
model
Learn
Generate
??

We use a convolutional sparse auto-encoder as a
model
Sparsity
Training objective is to
minimize the
reconstruction error
30
/ 56

Reconstruction
Input (dim 625)
Bottleneck
Encode
Decode
Deep autoencoder with a bottleneck from Hinton, G. E., & Salakhutdinov, R. R. (2006).

/ 56
• We use an iterative method to generate new images
• Start with a random image
• Force the network to construct (i.e. interpret)
• , until convergence, f(x) = dec(enc(x))
32

/ 5633

/ 56
34

/ 56
35

/ 5636Kazakçı, Cherti, Kégl, 2016

/ 5637
Known Training digits
Representable “Combinations of strokes”
37
Our interpretation
of the results:

/ 5638
Representable All digits that the model can generate
Valuable All recognizable digits

/ 5639
Valuable Human selection

/ 56
• We have one example of a deep generative
model that can indeed generate novelty
• Can we go further by automatically ﬁnding
models that can generate novelty ?
40

/ 5641
We designed a new setup and set of
metrics to ﬁnd models that are capable
of generating novelty

/ 56
•Training on known classes
•Testing on classes known to the experimenter but
unknown to the model
42
Idea: simulate the unknown by
Proposed setup: train on digits and test on letters,
where letters are used as a proxy for evaluating
the capacity of models to generate novelty

/ 5643
Generative
model
Learn
Generate
Q: How many of those are
letters ?

/ 5644
Discriminator
Learn
To count letters, we learn a discriminator with
36 classes = 10 for digits + 26 for letters

/ 5645
Discriminator
Nb. of letters
Predict
We then use the discriminator to score the models:

/ 5646
“Nb of letters” score is a proxy for ﬁnding
models that generate images that are :
• non trivial
• non recognizable as digits

/ 56
• We do a large scale experiment where we train ~1000
models (autoencoders, GANs) by varying their
hyperparameters.
• From each model, we generate 1000 images, then we
evaluate the model using our proposed metrics
• Question we tried to answer:
47
Can we ﬁnd models that can generate novelty ?

/ 56
• Selecting models for letters count lead to models that
can generate novelty
48
• Selecting models for digits count lead to models that
memorize training classes

Pangrams
49

/ 5650
Valuable Letters

/ 5651
We have shown that we can automatically
ﬁnd models that can generate novelty, as
well as other models that cannot

/ 5652
• Can we characterize the difference between
models that can generate novelty and models
that cannot ?
• We study a particular model architecture
through a series of experiments

/ 5653
• We study the effect of different ways of restricting the
capacity of the representation on the same architecture
• We ﬁnd that restricting the capacity of the
representation hurts their ability to generate novelty
3.c.Characteristics of
models that can generate novelty
More capacity
Morenovelty

/ 5654
Conclusion
Main Contributions:
• Importance of representation on novelty generation
• Current models can generate novelty even though not
designed for that
• We propose a new setup and a set of metrics to assess
the capacity of the models to generate novelty
• We show that constraining the capacity of the
representation can be harmful for novelty generation

/ 5655
Perspectives: immediate next steps
• Explain why existing models can generate novelty
• Propose an explicit training criterion to learn a
representation suitable for novelty generation
• Propose alternatives generative procedures to
random sampling
• Experiment on more complex datasets and
domains

/ 5656
• Agent evolving over time: dynamic knowledge and
value function
• Multi-agent system so that agents get/give feedback
and cooperate
Perspectives: future

/ 56
58

Slides, thesis dissertation defense, deep generative neural networks for novelty generation

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Slides, thesis dissertation defense, deep generative neural networks for novelty generation

Similar to Slides, thesis dissertation defense, deep generative neural networks for novelty generation (20)

Recently uploaded

Recently uploaded (20)

Slides, thesis dissertation defense, deep generative neural networks for novelty generation