0% found this document useful (0 votes)

46 views

Gan Framework

This document discusses generative adversarial networks (GANs) and techniques for training them. It provides an outline that covers why generative modeling is useful, existing generative models, properties and framework of GANs, challenges in GAN training, and tricks to improve GAN training. Some key points include: GANs use a minimax game between a generator and discriminator, GAN training is difficult due to non-convergence and mode collapse issues, and techniques like feature matching and unrolling can help stabilize GAN training.

Uploaded by

Suraj Zaware

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views

Gan Framework

Uploaded by

Suraj Zaware

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 57

CS – 3750

Machine Learning

Generative Adversarial
Network
Khushboo Thaker
[email protected]
Exponential Growth in GAN Papers

Ian Goodfellow

Khushboo Thaker 2
• Why Generative Modeling ?
• Existing Generative Models – A review
• Properties of GAN
• GAN Framework
Outline • Minimax Play for GAN
• Why GAN training is Hard ?
• Tricks to train GAN
• Examples of some common extension to GAN
• Conclusion and future reading

Khushboo Thaker 3
Generative Modeling
• Input is Training examples and output is
some representation of probability
distribution which defines this example
space.

• Un-Supervised
Data – X
Goal – Learn Hidden structure of data
Training Examples Generated Samples
Figure from Dr. Fei-Fei Li slides

• Supervised
Data – X , y
Goal – Learn mapping from X -> Y

Khushboo Thaker Sample Generation 4

Features
Noisy Input Simulated Data Representative
of Data
Prediction of Semi-Supervised
Future State Missing Data Learning

Khushboo Thaker 5
Maximum Likelihood based Models

P(x)

Maximum likelihood tries increase the

likelihood of data given the parameters

Khushboo Thaker 6
Khushboo Thaker 7
Tractable Model - PixelRNN / PixelCNN /
WaveNet
Fully visible belief Network
• Generate image pixels from corner

• Training Faster

• Generation Slow / Sequential

• Cannot generate samples based on

some latent code
Chain Rule
Khushboo Thaker 8
Maximum Likelihood based Training
Non Tractable Model - Variational
Approximation
Variational Auto-encoder
• Model is able to achieve high
likelihood
• Model is not asymptotically
consistent unless q is perfect
• Samples tend to have lower quality

Khushboo Thaker 9
Non Tractable Model - MCMC Approximation
Boltzmann Machine
• Energy Function based models
• Markov chains don’t work for long
sequences
• Hard to scale on large dataset

Khushboo Thaker 10
Khushboo Thaker 11
Where do GANs fall ?
•Can Use Latent Information while sample generation
•Asymptotically consistent ( claims to recover true
distribution)
•No Markov Chain assumption
•Samples produced are high quality

Khushboo Thaker 12
Generated Samples - GAN

Khushboo Thaker 13
Next Video Frame Prediction

• Sharp image
• Better estimation of Ear
position
• Much crisp eyes

Khushboo Thaker 14
Generative Adversarial Networks

Generator Discriminator

Khushboo Thaker 15
Generative Adversarial Networks

Khushboo Thaker 16
Classic GAN Framework

Z – random Noise
(latent representation of data)
Zd <= Xd

https://www.slideshare.net/xavigiro/deep-learning-for-computer-vision-generative-models-and-adversarial-training-upc-2016
Khushboo Thaker 17
Training Discriminator

Discriminator output for Discriminator output for

real data x fake data G(z)

• Generator minimizes the log-probability of the discriminator being correct

• Resembles Jensen-Shannon divergence
• Saddle Point of discriminators loss
Khushboo Thaker 20
Mini-max Game Approach

Nash Equilibrium / Saddle Point

• Generator minimizes the log-probability of the discriminator being correct

• Resembles Jensen-Shannon divergence
• Saddle Point of discriminators loss
Khushboo Thaker 21
Vanishing Gradient Problem with Generator

Gradient goes to 0 if D is confident , ie D(G(z)) -> 0

As can be seen that whenever the
discriminator becomes very confident the
loss value will be zero

Nothing to improve for Generator

Khushboo Thaker 22
Heuristic Non Saturating Game

Generator maximizes the log probability of the discriminator’s mistake

Does not change when discriminator is successful

Khushboo Thaker 23
Comparison of Generator Losses Able to learn even if the
Gradient signal is low

• Generators cost is a function D(G(z))

Khushboo Thaker 24
• Why Generative Modeling ?
• Existing Generative Models – A review
• Properties of GAN
• GAN Framework
Outline • Minimax Play for GAN
• Why GAN training is Hard ?
• Tricks to train GAN
• Examples of some common extension to GAN
• Conclusion and future reading

https://www.youtube.com/watch?v=mObnwR-u8pc
Khushboo Thaker 25
Why GAN are hard to train ?

Khushboo Thaker 26
Non-Convergence • Differential Equation’s solution has sinusoidal
terms
D & G nullifies each others learning in every iteration • Even with a small learning rate, it will not
converge
Train for a long time – without generating good quality samples
• Discrete time gradient descent can spiral
outward for large step size

Khushboo Thaker 27
Sample
Coverage
Sample
Mode Collapse Accuracy

http://www.youtube.com/watch?v=ktxhiKhWoEE&t=0m30s

Generator excels in a subspace but

does-not cover entire real distribution

Unroll GAN

GAN Luke et al. 2016

Khushboo Thaker 28
Why GAN are hard to train ?

• Generator keeps generating similar images – so nothing to learn

• Maintain trade-off of generating more accurate vs high coverage samples

• The two learning tasks need to have balance to achieve stability

• If Discriminator is not sufficiently trained – it can worse generator

• If Discriminator is over-trained - will produce no gradients

Khushboo Thaker 29
Tricks to Train GAN
• One sided label smoothing
• Historical generated batches
• Feature Matching
• Batch Normalization
• Regularizing discriminator gradient in region around real data
(DRAGAN)

Khushboo Thaker 30
One Side Label Smoothening
• Generator is very sensitive to
Discriminators output
• Prevents discriminator to give
high gradients
• Does-not reduce accuracy.
• Increase confidence
• Only smooth positive samples

Salimans, Tim, et al. "Improved techniques for training gans." Advances in Neural
Information Processing Systems. 2016.

Khushboo Thaker 31
Historical generated batches
Help stabilize discriminator training at early stages
Don’t Let discriminator
forget what it already
learned

Shrivastava, Ashish, et al. "Learning from Simulated and Unsupervised Images through Adversarial Training." CVPR. Vol. 2. No. 4. 2017.

Khushboo Thaker 32
Feature Matching
• Generated images must match
statistics of real images
• Discriminator defines the statistics
• Generator is trained such that the
expected value of statistics
matches the expected value of real
statistics
• Generator tries to minimize the L2
distance in expected values in
some arbitrary space
• Discriminator defines that arbitrary
space

Khushboo Thaker 33
Batch Normalization
• Construct different mini-batches
for real and fake

• Each mini-batch needs to

contain only all real images or all
generated images.

• Makes samples with-in a batch

less dependent

Khushboo Thaker 34
DRAGAN
• Failed GANs typically have extreme gradients/sharp peaks around
real data
• Regularize GANs to reduce the gradient of the discriminator in a
region around real data

Khushboo Thaker 35
Few variations of GAN
• Conditional GAN
• LapGAN
• DCGAN
• CatGAN
• InfoGAN
• AAE
• DRAGAN
• IRGAN

Khushboo Thaker 36
• Generator Learns P (X | Z, Y)
• Discriminator Learns P (L | X,Y)
• Much better samples

Mirza, M. and Osindero, S., 2014. Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784.

Khushboo Thaker 37
Khushboo Thaker 38
DCGAN
• Multiple Convolutional Layers
• Batch Normalization
• Strides with Convolution
• Leaky ReLUs

Radford, Alec, Luke Metz, and Soumith Chintala. "Unsupervised representation learning with deep convolutional generative adversarial networks." (2015

Khushboo Thaker 39
DCGAN

Radford, Alec, Luke Metz, and Soumith Chintala. "Unsupervised representation learning with deep convolutional generative adversarial networks." (2015).

Khushboo Thaker 40
InfoGAN
• Rewards Disentanglement – ( individual dimensions capturing key
attributes of images)

• Z – partitioned into two parts

z – capture slight variation in the images
y – captures the main attributes of the images

Mutual Information – maximizing mutual information

Between the code and generator output

Khushboo Thaker 41
InfoGAN

Khushboo Thaker 42
BiGANs
• Encoder
• Decoder
• Discriminator

Khushboo Thaker 43
LapGANs
• To Scale GAN for large image

• Laplacian pyramid function is

used to generate different
scales of image

Denton EL, Chintala S, Fergus R. Deep generative image models using a laplacian pyramid of adversarial networks. NIPS 2015 (pp. 1486-1494).

Khushboo Thaker 44
LapGAN

Denton EL, Chintala S, Fergus R. Deep generative image models using a laplacian pyramid of adversarial networks. InAdvances in neural information processing
systems 2015 (pp. 1486-1494).

Khushboo Thaker 45
DCGAN
• Multiple Convolutional Layers
• Batch Normalization
• Strides with Convolution
• Leaky ReLUs

Radford, Alec, Luke Metz, and Soumith Chintala. "Unsupervised representation learning with deep convolutional generative adversarial networks." (2015

Khushboo Thaker 46
Khushboo Thaker 47
Adversarial Autoencoder (GAN + VAE)

Khushboo Thaker 48
Khushboo Thaker 49
GAN for Text
• GANs for Language Generation (Yu et al. 2017)
• GANs for MT (Yang et al. 2017)
• GANs for Dialogue Generation (Li et al. 2016)
• GANs for fake news detection (Yang et al. 2017)
• GANs for Information Retrieval

Khushboo Thaker 50
GAN and RL connection
• GANs – Inverse Reinforcement
Learning
• GANs - Imitate Learning
• GANs – actor critic framework

• REINFORCE - Policy Gradient Based

learning
• Gumbel Softmax

Khushboo Thaker 51
Conclusion
• GAN is an active area of research
• GAN architecture is flexible to support variety of learning problems
• GAN does not guarantee to converge
• GAN is able to capture perceptual similarity and generates better
images than VAE
• Needs a lot of work in theoretic foundation of Network
• Evaluation of GAN is still an open research (Theis et. al)

Khushboo Thaker 52
Important Papers to dig into GAN
• NIPS 2016 Tutorial: - Ian Goodfellow
• Arjovsky, Martin, and Léon Bottou. "Towards principled methods for training generative adversarial
networks." arXiv preprint arXiv:1701.04862 (2017).
• Roth, Kevin, et al. "Stabilizing training of generative adversarial networks through regularization." Advances
in Neural Information Processing Systems. 2017.
• Li, Jerry, et al. "Towards understanding the dynamics of generative adversarial networks." arXiv preprint
arXiv:1706.09884 (2017).
• Kodali, Naveen, et al. "On convergence and stability of GANs." arXiv preprint arXiv:1705.07215 (2017).
• Fedus, William, et al. "Many Paths to Equilibrium: GANs Do Not Need to Decrease aDivergence At Every
Step." arXiv preprint arXiv:1710.08446 (2017).
• https://github.com/soumith/ganhacks#authors
• http://www.inference.vc/instance-noise-a-trick-for-stabilising-gan-training/
• https://www.araya.org/archives/1183

Khushboo Thaker 53
Startup code, Tools and Tricks
• https://github.com/soumith/ganhacks#authors

• https://medium.com/@utk.is.here/keep-calm-and-train-a-gan-pitfalls-and-tips-on-training-generative-adver
sarial-networks-edd529764aa9

• https://jhui.github.io/2017/03/05/Generative-adversarial-models/

Khushboo Thaker 54
References
• Deep Learning Book
• GAN paper: https://arxiv.org/abs/1701.00160
• GAN slides: http://slazebni.cs.illinois.edu/spring17/lec11_gan.pd
• GAN Tutorial: https://www.youtube.com/watch?v=HGYYEUSm-0Q
• GAN for text:
http://www.phontron.com/class/nn4nlp2017/assets/slides/nn4nlp-1
7-adversarial.pdf

Khushboo Thaker 55
Not the end..

Khushboo Thaker 56
Thank You for Listening
Questions ?

Khushboo Thaker 57

Dlmblse01-01 Course Book
No ratings yet
Dlmblse01-01 Course Book
116 pages
What Are The Eight Stages of Human Development?: Stage 1 - Infancy: Trust vs. Mistrust
No ratings yet
What Are The Eight Stages of Human Development?: Stage 1 - Infancy: Trust vs. Mistrust
6 pages
12-DL-Deep Learning For GANS
No ratings yet
12-DL-Deep Learning For GANS
75 pages
Generative Adversarial Networks: Akrit Mohapatra Ece Department, Virginia Tech
No ratings yet
Generative Adversarial Networks: Akrit Mohapatra Ece Department, Virginia Tech
21 pages
Advanced Design For AI Algorithms: Lec.: 1 GAN
No ratings yet
Advanced Design For AI Algorithms: Lec.: 1 GAN
223 pages
Generative Adversarial Networks (GANs) - 22006
No ratings yet
Generative Adversarial Networks (GANs) - 22006
17 pages
Image With GAN-topic
No ratings yet
Image With GAN-topic
20 pages
Atal - Gan
No ratings yet
Atal - Gan
67 pages
Lec19 - GANs
No ratings yet
Lec19 - GANs
47 pages
DL Unit5
No ratings yet
DL Unit5
15 pages
Introduction To Generative Adversarial Networks: Luke de Oliveira
No ratings yet
Introduction To Generative Adversarial Networks: Luke de Oliveira
31 pages
Gans Stanford
No ratings yet
Gans Stanford
39 pages
7_065352036659400a9de67c338a983e19
No ratings yet
7_065352036659400a9de67c338a983e19
88 pages
Week 3 - Post - GAN
No ratings yet
Week 3 - Post - GAN
38 pages
Gans
No ratings yet
Gans
26 pages
GAPE_module_2 - Copy
No ratings yet
GAPE_module_2 - Copy
30 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
36 pages
CSCI 5922 Neural Networks and Deep Learning
No ratings yet
CSCI 5922 Neural Networks and Deep Learning
37 pages
Generative Adversarial Networks Review 1-06-08-1.edit
No ratings yet
Generative Adversarial Networks Review 1-06-08-1.edit
24 pages
A Technical Seminar2018-19
No ratings yet
A Technical Seminar2018-19
15 pages
Generative Adversarial Networks Seminar Report
50% (4)
Generative Adversarial Networks Seminar Report
11 pages
Gen AI 10-1
No ratings yet
Gen AI 10-1
60 pages
L19 GANs
No ratings yet
L19 GANs
9 pages
Gan
No ratings yet
Gan
28 pages
10 Generative Adversarial Networks
No ratings yet
10 Generative Adversarial Networks
37 pages
GENAI_WEEK5
No ratings yet
GENAI_WEEK5
33 pages
GANs
No ratings yet
GANs
13 pages
01 GAN & Its Application
No ratings yet
01 GAN & Its Application
21 pages
Generative Adversarial Networks (Gans) : Date: 14.11.2022
100% (1)
Generative Adversarial Networks (Gans) : Date: 14.11.2022
12 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
4 pages
Aiml Demo
No ratings yet
Aiml Demo
12 pages
17
No ratings yet
17
16 pages
gan_tutorial_suwang
No ratings yet
gan_tutorial_suwang
11 pages
UNIT V
No ratings yet
UNIT V
20 pages
Generative Adversarial Networks: Biplab Banerjee
No ratings yet
Generative Adversarial Networks: Biplab Banerjee
54 pages
GANS-ppt
No ratings yet
GANS-ppt
22 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
14 pages
Generative Adversarial Networks (GAN) : A Gentle Introduction (UPDATED)
No ratings yet
Generative Adversarial Networks (GAN) : A Gentle Introduction (UPDATED)
11 pages
Generative Adversarial Networks (GANs) - Engine and Applications PDF
No ratings yet
Generative Adversarial Networks (GANs) - Engine and Applications PDF
13 pages
From Adversarial Training To Geenerative Adversarial Networks
No ratings yet
From Adversarial Training To Geenerative Adversarial Networks
12 pages
Lecture 2.3.4GAN
No ratings yet
Lecture 2.3.4GAN
4 pages
2017 Beginner's Review of Generative Adversarial Networks (GAN) Architectures
No ratings yet
2017 Beginner's Review of Generative Adversarial Networks (GAN) Architectures
9 pages
Week 9 Generative Adversarial Networks
No ratings yet
Week 9 Generative Adversarial Networks
50 pages
Deep Learning
No ratings yet
Deep Learning
45 pages
CISC 867 Deep Learning: 15. Generative Adversarial Networks
No ratings yet
CISC 867 Deep Learning: 15. Generative Adversarial Networks
71 pages
paper4 (GAN)
No ratings yet
paper4 (GAN)
24 pages
Generative Adversarial Networks (Gans) - 01: Main Notions About Gans
No ratings yet
Generative Adversarial Networks (Gans) - 01: Main Notions About Gans
2 pages
11 GANs
No ratings yet
11 GANs
76 pages
GAN Lecture
No ratings yet
GAN Lecture
53 pages
GaNs L7
No ratings yet
GaNs L7
14 pages
DL Unit6 Gan
No ratings yet
DL Unit6 Gan
44 pages
Generative Adversarial Networks (GANs)
No ratings yet
Generative Adversarial Networks (GANs)
37 pages
MV50GAN
No ratings yet
MV50GAN
37 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
10 pages
Introduction Generative Adversarial Networks
No ratings yet
Introduction Generative Adversarial Networks
41 pages
Machine Learning Final Presentation
No ratings yet
Machine Learning Final Presentation
32 pages
Generative Nural Network
No ratings yet
Generative Nural Network
5 pages
GANModeCollapse Metz PDF
No ratings yet
GANModeCollapse Metz PDF
25 pages
11. GANs
No ratings yet
11. GANs
41 pages
What Are Generative Adversarial Networks (GANs) - Simplilearn
No ratings yet
What Are Generative Adversarial Networks (GANs) - Simplilearn
19 pages
3 GANs
No ratings yet
3 GANs
50 pages
Scanline Rendering: Exploring Visual Realism Through Scanline Rendering Techniques
From Everand
Scanline Rendering: Exploring Visual Realism Through Scanline Rendering Techniques
Fouad Sabry
No ratings yet
Why Facts Don't Change Our Minds
No ratings yet
Why Facts Don't Change Our Minds
13 pages
2018 Graduation Statistics
No ratings yet
2018 Graduation Statistics
34 pages
Full Project - Impact of News Commentaries On Radio Listeners
No ratings yet
Full Project - Impact of News Commentaries On Radio Listeners
4 pages
Introduction To Artificial Learning Lecture One
No ratings yet
Introduction To Artificial Learning Lecture One
16 pages
Aging and The Challenges in Self Care An Integrative Review
No ratings yet
Aging and The Challenges in Self Care An Integrative Review
6 pages
CAS Proposal Book Club
No ratings yet
CAS Proposal Book Club
2 pages
MYP 4/5 Arts Criteria Achievement Level Descriptors (Assignment Name)
No ratings yet
MYP 4/5 Arts Criteria Achievement Level Descriptors (Assignment Name)
1 page
Year 4 Civic Lesson Plan (September-Love) : Content Standard
No ratings yet
Year 4 Civic Lesson Plan (September-Love) : Content Standard
2 pages
Business Accounting BK 2-12 Edition CAPE Accounting
No ratings yet
Business Accounting BK 2-12 Edition CAPE Accounting
2 pages
Subject Teachers' Google Meet Links List
No ratings yet
Subject Teachers' Google Meet Links List
8 pages
Strengthening The Role of Science and Technology For Disaster Risk Reduction in The Arab Region
No ratings yet
Strengthening The Role of Science and Technology For Disaster Risk Reduction in The Arab Region
21 pages
הכשרת מורים מאמר
No ratings yet
הכשרת מורים מאמר
11 pages
2 Shs Daily Lesson Log DLL Pr1 Feb 27-Mar 3
No ratings yet
2 Shs Daily Lesson Log DLL Pr1 Feb 27-Mar 3
4 pages
Efrendividedlessonplan
100% (1)
Efrendividedlessonplan
3 pages
Foundations of Education
No ratings yet
Foundations of Education
40 pages
5-SIP Format
No ratings yet
5-SIP Format
17 pages
01 Work Design Measurement v4
No ratings yet
01 Work Design Measurement v4
2 pages
Week 4
No ratings yet
Week 4
7 pages
Difference Between Psychological Assessment and Psychological Testing
No ratings yet
Difference Between Psychological Assessment and Psychological Testing
4 pages
Practical Research I: 1 Semester - Module 3
No ratings yet
Practical Research I: 1 Semester - Module 3
13 pages
Mechanical Design of Process Equipment Course
100% (1)
Mechanical Design of Process Equipment Course
2 pages
The Effect of Parenting Styles On Personality: A Review of Literature
No ratings yet
The Effect of Parenting Styles On Personality: A Review of Literature
5 pages
Elliott Markingtimeethnography 2017
No ratings yet
Elliott Markingtimeethnography 2017
22 pages
Group 1 (Behaviorism in Education and Structuralism in ELT)
No ratings yet
Group 1 (Behaviorism in Education and Structuralism in ELT)
22 pages
Risk Management Plan Template Excel
No ratings yet
Risk Management Plan Template Excel
10 pages
Tugas Pendidikan Gizi 4
No ratings yet
Tugas Pendidikan Gizi 4
9 pages
Assignment 2 - MAC 2021
No ratings yet
Assignment 2 - MAC 2021
1 page
Presentation1.pptx Practical Research
No ratings yet
Presentation1.pptx Practical Research
15 pages

Gan Framework

Uploaded by

Gan Framework

Uploaded by

CS – 3750

Khushboo Thaker Sample Generation 4

Maximum likelihood tries increase the

• Generation Slow / Sequential

• Cannot generate samples based on

Discriminator output for Discriminator output for

• Generator minimizes the log-probability of the discriminator being correct

Nash Equilibrium / Saddle Point

• Generator minimizes the log-probability of the discriminator being correct

Gradient goes to 0 if D is confident , ie D(G(z)) -> 0

Nothing to improve for Generator

Generator maximizes the log probability of the discriminator’s mistake

Does not change when discriminator is successful

• Generators cost is a function D(G(z))

Generator excels in a subspace but

GAN Luke et al. 2016

• Generator keeps generating similar images – so nothing to learn

• Maintain trade-off of generating more accurate vs high coverage samples

• The two learning tasks need to have balance to achieve stability

• If Discriminator is not sufficiently trained – it can worse generator

• If Discriminator is over-trained - will produce no gradients

• Each mini-batch needs to

• Makes samples with-in a batch

• Z – partitioned into two parts

Mutual Information – maximizing mutual information

• Laplacian pyramid function is

• REINFORCE - Policy Gradient Based

You might also like