Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
Introduction To Deep Learning: TA: Drew Hudson May 8, 2020
Slides credits: Atharva Parulekar, Jingbo Yang, Drew Hudson, Guanzhi Wang
Overview
● Motivation for deep learning
● Convolutional neural networks
● Recurrent neural networks
● Deep learning tools
But we learned multi-layer perceptron in class?
Expensive to learn. Will not generalize well
Does not exploit the order and local relations in the data!
64x64x3=12288 parameters
We also want many layers
What are areas of deep learning?
Convolutional NN Recurrent NN
Image Time Series
Graph NN
Networks/Relational
Deep RL
Control System
What are areas of deep learning?
Convolutional
Neural Network
Learnable filters!!
Convolution on multiple channels
Images are generally RGB !!
Images that maximize filter outputs at certain How deeper layers can learn deeper
layers. We observe that the images get more embeddings. How an eye is made up of multiple
complex as filters are situated deeper curves and a face is made up of two eyes.
How do we use convolutions?
Let convolutions extract features and let normal cnn’s decide on them.
Image credit: LeCun et al. (1998)
Convolution really is just a linear operation
In fact convolution is a giant matrix
multiplication.
Memory size
Compute time
highway.
ResNet (2015)
Image credit: He et al. (2015)
Initialization
Can we initialize all neurons to zero? Relu units once knocked out and their
output is zero, their gradient flow also
If all the weights are same we will not becomes zero.
be able to break symmetry of the
network and all filters will end up We need small random numbers at
learning the same thing. initialization.
Recurrent NN
Time Series
I love CS !
There are 2 types of RNN cells
Store in “long term memory” Response to current input Reset gate Update gate
Response to
current input
No hard dependency
More? Take CS230, CS236, CS231N, CS224N
Convolutional NN Recurrent NN
Image Time Series
Graph NN
Networks/Relational
Deep RL
Control System
Not today, but take CS234 and CS224W
Convolutional NN Recurrent NN
Image Time Series
Graph NN
Networks/Relational
Deep RL
Control System
Tools for deep learning Specialized
Groups
Popular Tools
$50 not enough! Where can I get free stuff?
Google Colab
Azure Notebook
Free (limited-ish) GPU access
Kaggle kernel???
Works nicely with Tensorflow
Amazon SageMaker?
Links to Google Drive