0% found this document useful (0 votes)

9 views7 pages

2002.12164v2

This document presents a performance analysis of the Variational Auto Encoder (VAE) algorithm in a semi-supervised learning context, specifically addressing challenges in small-data regimes using the CIFAR-10 dataset. The study explores the impact of different latent space sizes on model accuracy, concluding that a latent space size of 10,000 yields the best classification results compared to smaller and larger sizes. Future work is suggested to incorporate a Bayesian approach to better handle uncertainties due to limited data.

Uploaded by

mannamforu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views7 pages

2002.12164v2

Uploaded by

mannamforu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

P ERFORMANCE A NALYSIS OF S EMI - SUPERVISED L EARNING IN

THE S MALL - DATA R EGIME USING VAE S

A P REPRINT

Varun Mannam ∗ Arman Kazemi

Department of Electrical Engineering Department of Computer Science and Engineering
University of Notre Dame University of Notre Dame
arXiv:2002.12164v2 [cs.LG] 17 Jul 2020

Notre Dame, IN 46556 Notre Dame, IN 46556

[email protected] [email protected]

July 21, 2020

A BSTRACT
Extracting large amounts of data from biological samples is not feasible due to radiation issues,
and image processing in the small-data regime is one of the critical challenges when working with
a limited amount of data. In this work, we applied an existing algorithm named Variational Auto
Encoder (VAE) that pre-trains a latent space representation of the data that captures the features in a
lower-dimension for the small-data regime input. The latent space representation will be fine-tuned,
and its weights will be fixed. The latent space will be used as a segment of the neural network that
can be used for classification. Here we will present the performance analysis of the VAE algorithm
with various latent space sizes in the semi-supervised learning using the CIFAR-10 dataset.

Keywords VAE, CIFAR-10, Small Data

1 Introduction
Artificial neural networks (ANNs), specifically Convolutional Neural Networks (CNNS), have become popular
due to their success in image classification, feature extraction and object recognition and detection [1] in the
recent years. CNNs leverage the huge amount of labeled data available to train networks that outperform
humans in image recognition tasks. However, in the small-data regime, the accuracy of trained networks using
a limited number of labeled samples is low [2]. This is a typical case when working with biological samples
where exposure to radiation (in order to capture an image) is detrimental to the well-being of the sample. More im-
ages can be derived from the initial data by some augmentation methods, but it is unhelpful due to lack of labeled images.

To address this problem, there exists a framework called “Auto Encoder” (AE [3]) that uses all the input data, labeled
and unlabeled, to train a low-dimensional embedding. AE is a neural network that takes unlabeled images as the input
and regards the input itself as the label. As illustrated in Figure 1, AE is comprised of two parts: the encoder and the
decoder. The encoder part tries to embed the features in a latent space that can extract the features of the original image
and the decoder tries to restore the image to the original image. The process called “pre-training” trains the weights for
both encoder and decoder parts. Once trained, the encoder part of the AE will be a representation of all the labeled and
unlabeled data. This increases the amount of usable information from all of the images.

Traditional semi-supervised models consists of pretraining using Restricted Boltzmann machine (RBM)[4] or Gaussian-
Restricted Boltzmann machine (G-RBM)[4]. RBM is an energy based model that is represented using an undirected
graph containing a layer of observable variables and a single layer of latent variables (similar to hidden units in a
∗
Varun Mannam is with the Department of Electrical Engineering, University of Notre Dame, Notre Dame, IN, 46556 USA
e-mail: [email protected]
A PREPRINT - J ULY 21, 2020

multi-layer perceptron) [1, 5]. This energy based model was first introduced in 1980’s [6] and have been implemented
using diverse datasets including image [4] and medical data [7]. Hinton and Salakhutdinov [8] showed that RBMs
can be stacked and trained in a greedy manner. This deep learning model, Deep belief networks(DBN), that utilizes
RBM as the learning model has been implemented on various unsupervised and supervised learning problems. Later,
Bengio et. al. [9] showed that the pre-trained undirected graphical model in semi-supervised setting performs well with
deep architectures. However, the challenge working with RBM is that it is constructed using sigmoid functions as the
activation function between the input and the hidden layer. As we are aware that the major drawback of using sigmoid
activation function is the vanishing gradient problem. Hence, in this work, we pre-train the model using “Variational
Auto Encoder” (VAE [3]).
After pre-training the encoder is able to observe similar images to the training images and extract the valuable features
from it in a lower dimension than the initial image. Now it is possible to couple the encoder with a small neural network
and train that network for classification tasks. The present work is similar to reinforcement learning where the model is
trained with one dataset and uses the feature extraction part of that model to train another model for a different dataset.
It is important to mention that the encoder weights are fixed and can’t be changed. It is only the small neural network
that will be trained. The input to this network is the small set of labeled data. This is called “fine-tuning”.
In this project we will implement VAE that tries to capture not only the compressed representation of the images,
but also the parameters of a probability distribution representing the data. We will examine the effect of different
size of the latent spaces and how it affects the accuracy of the model. Later, we will analyze the performance of the
semi-supervised model with the optimum latent space.

2 Background

In this section we enumerate the basic details about AE and VAE.

2.1 Auto Encoder

An autoencoder is a type of ANN used to learn efficient data encoding in an unsupervised manner. The aim of an
autoencoder is to learn a representation of a set of data, typically for dimensionality reduction, by training the network
to ignore signal noise. Along with the reduction side, a reconstructing side is learned, where the autoencoder tries to
generate from the reduced encoding a representation as close as possible to its original input. An autoencoder always
consists of two parts, the encoder and the decoder, which can be defined as transitions φ and ψ such that [3]:

φ:X→F (1)

ψ:F →Y (2)

φ, ψ = argminφ,ψ ||X − (ψoφ)(X)2 || (3)

where the given input is X and the predicted output is Y . If the feature space F has lower dimensionality than the input
space X, then the feature vector φ(x) can be regarded as a compressed representation of the input X.

Figure 1: Autoencoder block diagram

2
A PREPRINT - J ULY 21, 2020

Figure 2: Network architecture: pre-training with VAE(first row) In the second row, parameter θ is underlined and bold
to indicate that these parameters are freezed when the fine-tuning the network

2.2 VAE

To extend the idea used in the AE, there is a variation called VAE which uses the “KL-divergence” between the predicted
probability distribution function and actual posterior distribution in the latent space whereas traditional AEs try to find
the accurate mapping functions in the encoder (between input and latent space) as well as in the decoder (between the
latent space and output). Using VAE, we generate a large dataset by adding the noise in the latent space which is similar
to input data augmentation (adding noise to images to increase the number of examples in the input dataset).
VAE uses a variational approach for latent representation learning, which results in an additional loss component.
It assumes that the data is generated by a directed graphical model p(X|Z) and that the encoder is learning an
approximation qφ (Z|X) of the posterior distribution pθ (X|Z) where φ and θ denote the parameters of the encoder
(recognition model) and decoder (generative model) respectively. We can write the conditional or posterior distribution
p(z, x) p(x|z)p(z)
p(z|x) = = (4)
p(x) p(x)
The denominator of above equation is the marginal distribution of the observations and is calculated by marginalizing
out the latent variables from the joint distribution, i.e.
Z
p(x) = p(z, x)dz (5)
z
In many cases of interest this integral is not available in closed form or is intractable (requires exponential time to
compute). Hence, we consider variational approximation as follows: consider a tractable distribution q(z). The goal is
to find the best approximation, e.g., the one that satisfies the following optimization problem:
M inimize : DKL [qφ (z|x)||pθ (z|x)] (6)
Therefore, the objective of the variational autoencoder in this case has the following form:
L(φ, θ, x) = DKL (qφ (z|x)kpθ (z)) − Eqφ (z|x) (log pθ (x|z)) (7)
where DKL stands for the KL-divergence. In the VAE, the principle is to minimize the loss between input and the
restored image along with the loss generated by the latent space to represent the features in the input images.

3 Methodology
Consider a surrogate model y = f (x, θ) which is trained using limited simulation data D = {xi , y i }N j D
i=1 , {x }j=N +1 .
i dx ×H×W
Where the input data, x ∈ R is the input from CIFAR-10 dataset. Here H and W are the height and width
respectively and dx is the number of dimensions for the input x at one location. xj is the additional data utilized for
pretraining the model. y i ∈ R1 is the classified result. θ is the model parameter and N is the total number of training
data utilized during fine-tuning and D is the total number of data utilized for pre-training. In semi-supervised model, we
pre-train the model with the input data Rdx ×H×W and then perform image classification problem Rdx ×H×W → R1 .
For both the pre-training and fine tuning, we used stochastic gradient descent with Adam optimizer to update the
network weights and biases. The simulation was performed using Pytorch machine learning package in Python.

3
A PREPRINT - J ULY 21, 2020

3.1 VAE pre-training

We implemented dense-net [10],[11] version of VAE for the pre-training part. Dense-net contains the encoder and
decoder blocks along with the dense-layer that has the simple and complex features.

3.2 VAE fine-tuning

We implemented a simple fully connected layer to classify the input images on the CIFAR-10 [12] data. This is due to
the expectation that the latent space is smaller (either 4 × 4 or 8 × 8) (image size is 32 × 32). If the number of channels
are large at the latent space, we will add more fully connected layers for classification of images.

4 Data

We perform the simulations on the CIFAR-10 dataset with ten image classes with three input channels (C = 3) of size
32 × 32 (W × H). CIFAR-10 dataset has 50000 training images and 10000 test images.

5 Results

In this section we enumerate the results obtained using CIFAR-10 data. We consider the following latent dimensions:
6400, 10,000 and 14,400. In order to evaluate the performance of the model for above three latent space, we consider
the distribution estimated for the values at various pixel location.

5.1 Pre-training

For the results presented in this section, we have a dataset with 50,000 {xk }50000
k=1 examples for pre-training and the test
set consist of 10000, {xk }10000
k=1 examples. Adam optimizer was used for training 100 epochs, with learning rate of 1e-4
and a plateau scheduler on the test RMSE. Batch size is always smaller than the number of training data. In this work, a
batch size of 16 for pre-training was used. Weight decay was set to 1e-3 for pre-training.

We consider Equation: 7 loss function to evaluate the trained model on test data and also to monitor the convergence.
From figure 1, we observe that the solution is converged after 50 epochs and most importantly the loss for the three
latent spaces is similar.
From figure 4, we observe that even when the latent size is small (Batch × 100 (channels) × 8 × 8) and (Batch × 100
(channels) × 10 × 10) the reconstructed density estimate is close to actual input data. The PDF with the latent size
10000 is closer to the actual input and also 6400 & 14400 latent space. Since, all the latent spaces yield the similar
outputs,we fine-tune and compare the classification accuracy in the next section.

5.2 Fine-tuning

In this section we freeze the parameters (weights and bias) used in the pre-training stage and fine tune the parameters
(weights and bias) in the classification network. For this problem, we consider small data from the given CIFAR-10
dataset and use fully connected layers to perform classification. Cross entropy loss function is commonly used for all
classification problems, we implemented cross entropy to measures the performance of a classification mode. From
figure 5, we observe the latent space 10000 yields better accuracy than other two latent spaces (6400 and 14400). The
smaller the latent space the lower the test accuracy, this is due to insufficient features to classify the data. Also, for the
large latent space, the test accuracy is low, this is due to model complexity [1].

6 Conclusions and Future work

The present document outlines the development of surrogate model for semi-supervised problem. In this work, we have
implemented VAE as a pre-training model and a feed forward deep learning model for the classification. The results
obtained for differently sized latent spaces are presented. It was observed that there is a slight improvement in the test
accuracy when the latent space is 10000 in comparison with latent space of 6400 and 14400.

4
A PREPRINT - J ULY 21, 2020

Figure 3: Error v/s epoch for 6400 latent space (top), Error v/s epoch for 10000 latent space (middle) and Error v/s
epoch for 14400 latent space (bottom)

5
A PREPRINT - J ULY 21, 2020

Figure 4: Distribution estimate for the values at various location of the square domain for 6400, 10000 and 14400 latent
space

Figure 5: Fine tuning results with three different latent space models

6
A PREPRINT - J ULY 21, 2020

For future work, a Bayesian approach can be explored. Due to a limited amount of data, it is necessary to model
appropriate surrogate, since it is important to quantify the epistemic uncertainty induced by limited data [13], [14] and
hence, a Bayesian probabilistic approach is a natural way of addressing this challenge.

References
[1] Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep learning. MIT press, 2016.
[2] Nitesh V Chawla et al. Learning from labeled and unlabeled data: An empirical study across techniques and
domains. Journal of Artificial Intelligence Research, 23:331–366, 2005.
[3] Autoencoder. Autoencoder — Wikipedia, the free encyclopedia, 2019.
[4] Geoffrey E Hinton and Ruslan R Salakhutdinov. Reducing the dimensionality of data with neural networks.
science, 313(5786):504–507, 2006.
[5] Kevin P Murphy. Machine learning: a probabilistic perspective. MIT press, 2012.
[6] Paul Smolensky. Information processing in dynamical systems: Foundations of harmony theory. Technical report,
Colorado Univ at Boulder Dept of Computer Science, 1986.
[7] Tu Dinh Nguyen, Truyen Tran, Dinh Phung, and Svetha Venkatesh. Latent patient profile modelling and
applications with mixed-variate restricted boltzmann machine. In Pacific-Asia conference on knowledge discovery
and data mining, pages 123–135. Springer, 2013.
[8] G Hinton and R Salakhutdinov. An efficient learning procedure for deep boltzmann machines. Neural Computation,
24(8):1967–2006, 2012.
[9] Yoshua Bengio, Pascal Lamblin, Dan Popovici, and Hugo Larochelle. Greedy layer-wise training of deep networks.
In Advances in neural information processing systems, pages 153–160, 2007.
[10] Gao Huang, Zhuang Liu, Laurens Van Der Maaten, and Kilian Q Weinberger. Densely connected convolutional
networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4700–4708,
2017.
[11] Yide Zhang, Yinhao Zhu, Evan Nichols, Qingfei Wang, Siyuan Zhang, Cody Smith, and Scott Howard. A
poisson-gaussian denoising dataset with real fluorescence microscopy images. arXiv preprint arXiv:1812.10366,
2018.
[12] Alex Krizhevsky and Geoffrey Hinton. Learning multiple layers of features from tiny images. Technical report,
Citeseer, 2009.
[13] Rohit K Tripathy and Ilias Bilionis. Deep uq: Learning deep neural network surrogate models for high dimensional
uncertainty quantification. Journal of Computational Physics, 375:565–588, 2018.
[14] Yinhao Zhu and Nicholas Zabaras. Bayesian deep convolutional encoder–decoder networks for surrogate modeling
and uncertainty quantification. Journal of Computational Physics, 366:415–447, 2018.

Cyber Security Research Proposal - Sandboxing
100% (4)
Cyber Security Research Proposal - Sandboxing
23 pages
Auto Encoder s
No ratings yet
Auto Encoder s
22 pages
AVAE
No ratings yet
AVAE
21 pages
Unsupervised Deep Learning
No ratings yet
Unsupervised Deep Learning
11 pages
Variational AutoEncoder
No ratings yet
Variational AutoEncoder
21 pages
Combinevae&Gan 4
No ratings yet
Combinevae&Gan 4
19 pages
GAPE_module_3 - Copy - Copy
No ratings yet
GAPE_module_3 - Copy - Copy
21 pages
CSD411-Week14-AutoRBM_1731474657667996771673434e1e7d46
No ratings yet
CSD411-Week14-AutoRBM_1731474657667996771673434e1e7d46
18 pages
Vae - Gan 1
No ratings yet
Vae - Gan 1
136 pages
Variational Autoencoders (VAEs)
No ratings yet
Variational Autoencoders (VAEs)
5 pages
Mod 3 Advanced AI
No ratings yet
Mod 3 Advanced AI
37 pages
C 03 Variational Autoencoders Generative Adversarial Network
No ratings yet
C 03 Variational Autoencoders Generative Adversarial Network
54 pages
Autoencoder For Neuroimage: Abstract. Variational Autoencoder (Vae) As A Class of Neural Networks
No ratings yet
Autoencoder For Neuroimage: Abstract. Variational Autoencoder (Vae) As A Class of Neural Networks
7 pages
465-Lecture 12
No ratings yet
465-Lecture 12
31 pages
7.Variational Autoencoders
No ratings yet
7.Variational Autoencoders
4 pages
5 - VAE
No ratings yet
5 - VAE
20 pages
Module 2 Gen
No ratings yet
Module 2 Gen
57 pages
Auto Encoder S
No ratings yet
Auto Encoder S
32 pages
Tutorial - What Is A Variational Autoencoder - Jaan Altosaar
No ratings yet
Tutorial - What Is A Variational Autoencoder - Jaan Altosaar
20 pages
7& 9 Autoencoder and Variational Autoencoder
No ratings yet
7& 9 Autoencoder and Variational Autoencoder
13 pages
Auto Encoder s
No ratings yet
Auto Encoder s
16 pages
12 Variational Autoencoder v2.07
No ratings yet
12 Variational Autoencoder v2.07
35 pages
Generative Model For Image Classification
No ratings yet
Generative Model For Image Classification
4 pages
Chapter 10
No ratings yet
Chapter 10
20 pages
Chapter17 Autoencoders
No ratings yet
Chapter17 Autoencoders
23 pages
Lecture # 6 Latent Variable Models
No ratings yet
Lecture # 6 Latent Variable Models
55 pages
Deep Generative Models
No ratings yet
Deep Generative Models
55 pages
Essay 6
No ratings yet
Essay 6
15 pages
DLA Unit 5
No ratings yet
DLA Unit 5
18 pages
220110038_MuskanSharma_III IT
No ratings yet
220110038_MuskanSharma_III IT
10 pages
Variational Autoencoders-Fashion Mnist
No ratings yet
Variational Autoencoders-Fashion Mnist
9 pages
VAE, Domain Adaptation
No ratings yet
VAE, Domain Adaptation
15 pages
Autoencoders
No ratings yet
Autoencoders
4 pages
Autoencoders
No ratings yet
Autoencoders
35 pages
VAE Vs GAN
100% (1)
VAE Vs GAN
3 pages
Assignment On Module-3
No ratings yet
Assignment On Module-3
3 pages
DeepLearning 4 and 5
No ratings yet
DeepLearning 4 and 5
60 pages
Generative_Models
No ratings yet
Generative_Models
65 pages
Make 02 00020
No ratings yet
Make 02 00020
19 pages
Autoencoder - Unit 4
No ratings yet
Autoencoder - Unit 4
39 pages
Auto Encoder
No ratings yet
Auto Encoder
12 pages
Generating Diverse High-Fidelity Images
No ratings yet
Generating Diverse High-Fidelity Images
15 pages
Sparse, Stacked and Variational Autoencoder | by Venkata Krishna Jonnalagadda | Medium
No ratings yet
Sparse, Stacked and Variational Autoencoder | by Venkata Krishna Jonnalagadda | Medium
17 pages
1 Autoencoders
No ratings yet
1 Autoencoders
22 pages
Lecture 14 Autoencoders
No ratings yet
Lecture 14 Autoencoders
39 pages
UNIT-V DL
No ratings yet
UNIT-V DL
31 pages
VAE Continued: Biplab Banerjee
No ratings yet
VAE Continued: Biplab Banerjee
23 pages
DL UNIT 4
No ratings yet
DL UNIT 4
21 pages
6S191 MIT DeepLearning L4
No ratings yet
6S191 MIT DeepLearning L4
88 pages
Introduction To VAE
No ratings yet
Introduction To VAE
5 pages
Assignment On Module-3
No ratings yet
Assignment On Module-3
3 pages
Introduction To Autoencoders: A Brief Overview
No ratings yet
Introduction To Autoencoders: A Brief Overview
27 pages
DL ASMT-2
No ratings yet
DL ASMT-2
17 pages
Presentation-2 CDVAE(05-31-2024)
No ratings yet
Presentation-2 CDVAE(05-31-2024)
33 pages
D5_PPT
No ratings yet
D5_PPT
79 pages
Unit5 Autoencoders.doc
No ratings yet
Unit5 Autoencoders.doc
45 pages
Auto Encoders
No ratings yet
Auto Encoders
4 pages
Unit 2 Variational Auto Encoder
No ratings yet
Unit 2 Variational Auto Encoder
11 pages
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
61 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
The Keys of Basilus With Commentary
No ratings yet
The Keys of Basilus With Commentary
20 pages
Lecturer Zoology MCQs Past Papers... KutabKhano
100% (1)
Lecturer Zoology MCQs Past Papers... KutabKhano
9 pages
Rick Berg's Deposition in Lawsuit Against His Family
No ratings yet
Rick Berg's Deposition in Lawsuit Against His Family
15 pages
KFS-576A Maintenance Manual
100% (4)
KFS-576A Maintenance Manual
118 pages
Reaction For Slaughterhouse
No ratings yet
Reaction For Slaughterhouse
2 pages
Wind Turbine
No ratings yet
Wind Turbine
11 pages
Lean Startup
No ratings yet
Lean Startup
3 pages
Get (eBook PDF) Engineering Economic Analysis 13th by Donald G. Newnan free all chapters
80% (5)
Get (eBook PDF) Engineering Economic Analysis 13th by Donald G. Newnan free all chapters
56 pages
Pls Find Here Below
No ratings yet
Pls Find Here Below
5 pages
3 Way Electrical Control Valve: TYPE 7562E
No ratings yet
3 Way Electrical Control Valve: TYPE 7562E
13 pages
PW3 and 4 - Synthesis and Analysis of Copper (II) Sulphate Pentahydrate
No ratings yet
PW3 and 4 - Synthesis and Analysis of Copper (II) Sulphate Pentahydrate
3 pages
Rom31 99 PART I
100% (1)
Rom31 99 PART I
30 pages
Performance Task
No ratings yet
Performance Task
3 pages
AO ProductCatalog PDF
No ratings yet
AO ProductCatalog PDF
139 pages
Uttar Pradesh Secondary Education Service Selection Board, Prayagraj
No ratings yet
Uttar Pradesh Secondary Education Service Selection Board, Prayagraj
3 pages
Experiment 1 - ChE Lab
No ratings yet
Experiment 1 - ChE Lab
4 pages
K To 12 Music
No ratings yet
K To 12 Music
14 pages
UX Design Case Study The Art of Crafting Mesmerizing User Experiences
No ratings yet
UX Design Case Study The Art of Crafting Mesmerizing User Experiences
7 pages
Mcqs-Auditing (Set 1)
No ratings yet
Mcqs-Auditing (Set 1)
6 pages
Cable
No ratings yet
Cable
13 pages
Essential Elements of Writing A Research Review Paper For Conference Journals
No ratings yet
Essential Elements of Writing A Research Review Paper For Conference Journals
6 pages
Icelands New Climate Action Plan For 2018 2030
No ratings yet
Icelands New Climate Action Plan For 2018 2030
9 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Questions and Answers Physics PDF Free
No ratings yet
Questions and Answers Physics PDF Free
49 pages
Business Analytics Level 1 Quiz - Attempt Review
No ratings yet
Business Analytics Level 1 Quiz - Attempt Review
14 pages
Alien Minotaurs Prize - Hattie Jacks
No ratings yet
Alien Minotaurs Prize - Hattie Jacks
160 pages
Gym Management Application: Abstract
No ratings yet
Gym Management Application: Abstract
6 pages
Taxonomies of Visual Programming P and Rogram Visualization: Brad A. Myers
No ratings yet
Taxonomies of Visual Programming P and Rogram Visualization: Brad A. Myers
33 pages
Picker Wheel - Spin the Wheel to Decide a Random Choice
No ratings yet
Picker Wheel - Spin the Wheel to Decide a Random Choice
1 page

2002.12164v2

Uploaded by

2002.12164v2

Uploaded by

P ERFORMANCE A NALYSIS OF S EMI - SUPERVISED L EARNING IN

THE S MALL - DATA R EGIME USING VAE S

Varun Mannam ∗ Arman Kazemi

Notre Dame, IN 46556 Notre Dame, IN 46556

July 21, 2020

Keywords VAE, CIFAR-10, Small Data

In this section we enumerate the basic details about AE and VAE.

2.1 Auto Encoder

φ, ψ = argminφ,ψ ||X − (ψoφ)(X)2 || (3)

Figure 1: Autoencoder block diagram

3.1 VAE pre-training

3.2 VAE fine-tuning

6 Conclusions and Future work

You might also like