GAN with Mathematics

GAN, with Mathematics
연세대학교 영상 및 비디오 패턴 인식 연구실
이형민

GAN??
GAN에 대한 일반적인 오해

GAN??
MAGIC!!
이미지를 만들어주는 알고리즘
데이터 양을 늘려주는 알고리즘

Discovering the Distribution!!

Discovering the Distribution!!
Discriminative Model Generative Model
Sample
New Sample
Which Label??
Sample
GAN

Kullback – Leibler Divergence
두 Distribution이 얼마나 유사한가?

https://www.countbayesie.com/blog/2017/5/9/kullback-leibler-divergence-explained

정보량이 얼만큼 손실되는가?

𝐼𝑖 = − log 𝑝 𝑥𝑖
𝐻 = 𝐸 𝐼𝑖 = −
𝑖=1
𝑁
𝑝 𝑥𝑖 log 𝑝 𝑥𝑖
정보량
엔트로피

𝐷 𝐾𝐿 𝑝 𝑞 = 𝐸 log 𝑝 𝑥 − log 𝑞 𝑥 =
𝑖=1
𝑁
𝑝 𝑥𝑖 log
𝑝(𝑥𝑖)
𝑞(𝑥𝑖)
손실되는 정보량의 기댓값!!
𝐷 𝐾𝐿 = 0.338 𝐷 𝐾𝐿 = 0.477

But!! KL-Divergence는 비대칭 함수!!
Jensen – Shannon Divergence를 쓴다.
𝐽𝑆𝐷(𝑝| 𝑞 =
1
2
𝐾𝐿 𝑝
𝑝 + 𝑞
2
+ 𝐾𝐿 𝑞
𝑝 + 𝑞
2

𝒛~𝒑 𝒛(𝒛) 𝒙~𝒑 𝒈(𝒙)
𝒙~𝒑 𝒅𝒂𝒕𝒂(𝒙)
G

G를 먼저 고정! D를 Optimize 해보자!
𝑉 𝐺, 𝐷 = 𝐸 𝑥~𝑝 𝑑𝑎𝑡𝑎(𝑥) log 𝐷 𝑥 + 𝐸𝑧~𝑝 𝑧(𝑧)[log(1 − 𝐷(𝐺(𝑧)))]
= 𝐸 𝑥~𝑝 𝑑𝑎𝑡𝑎(𝑥) log 𝐷 𝑥 + 𝐸 𝑥~𝑝 𝑔(𝑥) log(1 − 𝐷(𝑥) (∵ 𝐺 𝑧 ~𝑝 𝑔(𝑥))
= 𝑥
𝑝 𝑑𝑎𝑡𝑎 𝑥 log 𝐷 𝑥 + 𝑝 𝑔 𝑥 log 1 − 𝐷 𝑥 𝑑𝑥
D에 대해 미분!!
𝐷 𝐺
∗
𝑥 =
𝑝 𝑑𝑎𝑡𝑎(𝑥)
𝑝 𝑑𝑎𝑡𝑎 𝑥 + 𝑝 𝑔(𝑥)

이제 D를 Optimal로 가정하고 G를 Optimize 해보자!!
𝐶 𝐺 = 𝑉 𝐺, 𝐷∗
= 𝐸 𝑥~𝑝 𝑑𝑎𝑡𝑎(𝑥) log 𝐷∗
𝑥 + 𝐸 𝑥~𝑝 𝑔(𝑥)[log(1 − 𝐷∗
(𝑥))]
= 𝐸 𝑥~𝑝 𝑑𝑎𝑡𝑎(𝑥) log
𝑝 𝑑𝑎𝑡𝑎(𝑥)
𝑝 𝑑𝑎𝑡𝑎 𝑥 +𝑝 𝑔(𝑥)
+ 𝐸 𝑥~𝑝 𝑔(𝑥)[log
𝑝 𝑔(𝑥)
]
= − log 4 + 𝐸 𝑥~𝑝 𝑑𝑎𝑡𝑎(𝑥) log(𝑝 𝑑𝑎𝑡𝑎(𝑥)) − log(
2
)
+ 𝐸 𝑥~𝑝 𝑔(𝑥) log(𝑝 𝑔(𝑥)) − log(
2
)
= − log 4 + 𝐾𝐿( 𝑝 𝑑𝑎𝑡𝑎
2
) + 𝐾𝐿( 𝑝 𝑔
2
)
= − log 4 + 2 × 𝐽𝑆𝐷( 𝑝 𝑑𝑎𝑡𝑎 𝑝 𝑔)

이론적인 GAN의 Training 시나리오…
 Optimize D  Optimize G  Optimize D  …
https://www.youtube.com/watch?v=RlAgB0Ooxaw&list=PLlMkM4tgfjnJhhd4wn5aj8fVTYJwIpWkS&index=49

실제 데이터는 공간상에 넓게 분포하지 않음
‘집합의 크기’
Ex: 수직선상에서 (0,1)의 Measure = 1
𝑅2
에서 (0,1)×(0,1)의 Measure = 1
But, 𝑅2
에서 {(x, y) : 0<x<1, y=0}의 Measure = 0
간단히 말하면, 전체 집합보다 차원이 작은 영역의 Measure는 0이다.

Real, Fake data의 영역 모두 Measure가 Zero
 두 영역의 교집합 또한 Measure가 Zero

𝑫 𝒙 = 𝟏
𝛁𝑫 𝒙 = 𝟎
𝑫 𝒙 = 𝟎
𝛁𝑫 𝒙 = 𝟎
그래서, Real과 Fake를 구분하는 완벽한 Discriminator가 항상 존재한다.
𝑴 = 𝑺𝒖𝒑𝒑(𝑷 𝒓) 𝑴 = 𝑺𝒖𝒑𝒑(𝑷 𝒈)

𝛁𝑫 𝒙 = 𝟎
Vanishing Gradient

D의 Gradient가 0이면 G의 Gradient도 0이 된다.
 학습 불가

Real Data와 Fake Data에 Noise를 추가한다.
𝑴 = 𝑺𝒖𝒑𝒑(𝑷 𝒓) 𝑴 = 𝑺𝒖𝒑𝒑(𝑷 𝒈)
Noise를 추가하므로, 당연히 성능 저하!!

JSD 말고 다른 Distance를 이용하자.
Infimum = Greatest Lower Bound
inf 0,1 = 0
inf 0,1 = 0
inf 0,1 = 0
inf 0,1 = 0  프로그래밍적으로 구현 불가능  Wasserstein GAN

Progressive GAN (a.k.a. 자낳괴)
아직 GAN의 학습 방법론에 대한 연구는 현재 진행형…

GAN/VAE 비교 및 코드
Tensorflow
https://github.com/hwalsuklee/
tensorflow-generative-model-collections
PyTorch
https://github.com/znxlwm/
pytorch-generative-model-collections

GAN Loss Function 계열 (학습 안정화)
• GAN (NIPS 2014)
• Least Square GAN (LSGAN, ICCV 2017)
• Wasserstein GAN (WGAN, ICML 2017)
• Improved Training of Wasserstein GANs (WGAN_GP, NIPS 2017)
• Improving the Improved Training of Wasserstein GANs (CT-GAN, ICLR 2018)

cGAN / Application 계열 (Conditional Generation)
• Conditional GAN (cGAN, 2014)
• Image to Image Translation with Conditional Adversarial Networks (Pix2Pix, CVPR 2017)
• Learning to Discover Cross-Domain Relations with GAN (DiscoGAN, ICML 2017)
• Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks (CycleGAN, ICCV 2017)
 LSGAN Loss 사용
• Toward Multimodal Image-to-Image Translation (BiCycleGAN, NIPS 2017)
 LSGAN Loss 사용 / Image 변환에 다양성 부여
• High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs (Pix2PixHD, 2017)
 LSGAN Loss 사용 / 고화질 영상 생성
• StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation (CVPR 2018)
 WGAN_GP Loss 사용 / 여러 Domain Image 변환을 동시에 학습
• Progressive Growing of GANs for Improved Quality, Stability and Variation (ProGAN, ICLR 2018)
 WGAN_GP Loss 사용 / 실험적 방법으로 생성 능력 극대화

결론?
GAN을 우리가 원하는 이미지를 만들어주는 편리하고 신기한 Tool 정도로
만 이해하면 더 깊은 연구를 하는 데 한계가 있다.
우리 분야에도 굉장히 많은 수학적 Background를 요구하는 분야가 많으
며, 수학을 잘하면 논문이 간지난다.

GAN with Mathematics

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to GAN with Mathematics

Similar to GAN with Mathematics (20)

More from Hyeongmin Lee

More from Hyeongmin Lee (20)

GAN with Mathematics