0% found this document useful (0 votes)

43 views41 pages

Lec10 Handout

The document discusses neural networks and backpropagation. It introduces multi-layer perceptrons and how they can be used to perform nonlinear classification. It describes how neural networks are trained using backpropagation to minimize a loss function by propagating gradients through the network layers in the forward and backward passes.

Uploaded by

Saswati Banerjee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views41 pages

Lec10 Handout

Uploaded by

Saswati Banerjee

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

CSC 411 Lecture 10: Neural Networks I

Ethan Fetaya, James Lucas and Emad Andrews

University of Toronto

CSC411 Lec10 1 / 41
Today

Multi-layer Perceptron
Forward propagation
Backward propagation

CSC411 Lec10 2 / 41
Motivating Examples

CSC411 Lec10 3 / 41
Are You Excited about Deep Learning?

CSC411 Lec10 4 / 41
Limitations of Linear Classifiers

Linear classifiers (e.g., logistic regression) classify inputs based on linear

combinations of features xi
Many decisions involve non-linear functions of the input
Canonical example: do 2 input elements have the same value?

0,1 1,1

outpu
t
outpu =1
t =0

0,0 1,0

The positive and negative cases cannot be separated by a plane

What can we do?
CSC411 Lec10 5 / 41
How to Construct Nonlinear Classifiers?

We would like to construct non-linear discriminative classifiers that utilize

functions of input variables
Use a large number of simpler functions
I If these functions are fixed (Gaussian, sigmoid, polynomial basis
functions), then optimization still involves linear combinations of (fixed
functions of) the inputs
I Or we can make these functions depend on additional parameters →
need an efficient method of training extra parameters

CSC411 Lec10 6 / 41
Inspiration: The Brain

Many machine learning methods inspired by biology, e.g., the (human) brain
Our brain has ∼ 1011 neurons, each of which communicates (is connected)
to ∼ 104 other neurons

Figure: The basic computational unit of the brain: Neuron

[Pic credit: http://cs231n.github.io/neural-networks-1/]

CSC411 Lec10 7 / 41
Mathematical Model of a Neuron
Neural networks define functions of the inputs (hidden features), computed
by neurons
Artificial neurons are called units

Figure: A mathematical model of the neuron in a neural network

[Pic credit: http://cs231n.github.io/neural-networks-1/]

CSC411 Lec10 8 / 41
Activation Functions

Most commonly used activation functions:

1
Sigmoid: σ(z) = 1+exp(−z)

exp(z)−exp(−z)
Tanh: tanh(z) = exp(z)+exp(−z)

ReLU (Rectified Linear Unit): ReLU(z) = max(0, z)

CSC411 Lec10 9 / 41
Neural Network Architecture (Multi-Layer Perceptron)

Network with one layer of four hidden units:

output units

input units
Figure: Two different visualizations of a 2-layer neural network. In this example: 3 input
units, 4 hidden units and 2 output units

Each unit computes its value based on linear combination of values of units
that point into it, and an activation function

[http://cs231n.github.io/neural-networks-1/]

CSC411 Lec10 10 / 41
Neural Network Architecture (Multi-Layer Perceptron)
Network with one layer of four hidden units:

output units

input units
Figure: Two different visualizations of a 2-layer neural network. In this example: 3 input
units, 4 hidden units and 2 output units

Naming conventions; a 2-layer neural network:

I One layer of hidden units
I One output layer

(we do not count the inputs as a layer)

[http://cs231n.github.io/neural-networks-1/]
CSC411 Lec10 11 / 41
Neural Network Architecture (Multi-Layer Perceptron)

Going deeper: a 3-layer neural network with two layers of hidden units

Figure: A 3-layer neural net with 3 input units, 4 hidden units in the first and second
hidden layer and 1 output unit

Naming conventions; a N-layer neural network:

I N − 1 layers of hidden units
I One output layer

[http://cs231n.github.io/neural-networks-1/]
CSC411 Lec10 12 / 41
Representational Power
Neural network with at least one hidden layer is a universal approximator
(can represent any function).
Proof in: Approximation by Superpositions of Sigmoidal Function, Cybenko, paper

The capacity of the network increases with more hidden units and more
hidden layers
Why go deeper (still kind of an open theory question)? One hidden layer
might need exponential number of neurons, deep can be more compact.
[http://cs231n.github.io/neural-networks-1/]
CSC411 Lec10 13 / 41
Demo

Great tool to visualize networks http://playground.tensorflow.org/

Highly recommend playing with it!

CSC411 Lec10 14 / 41
Neural Networks

Two main phases:

I Forward pass: Making predictions
I Backward pass: Computing gradients

CSC411 Lec10 15 / 41
Forward Pass: What does the Network Compute?
Output of the network can be
written as:
D
X
hj (x) = f (vj0 + xi vji )
i=1
J
X
ok (x) = g (wk0 + hj (x)wkj )
j=1

(j indexing hidden units, k

indexing the output units, D
number of inputs)
Activation functions f , g : sigmoid/logistic, tanh, or rectified linear (ReLU)

1 exp(z) − exp(−z)
σ(z) = , tanh(z) = , ReLU(z) = max(0, z)
1 + exp(−z) exp(z) + exp(−z)

What if we don’t use any activation function?

CSC411 Lec10 16 / 41
Special Case
What is a single layer (no hiddens) network with a sigmoid act. function?

Network: 1
ok (x) =
1 + exp(−zk )
J
X
zk = wk0 + xj wkj
j=1

Logistic regression!
CSC411 Lec10 17 / 41
Feedforward network

Feedforward network - Connections are a directed acyclic graphs (DAG)

Layout can be more complicated than just k hidden layers.

CSC411 Lec10 18 / 41
How do we train?

We’ve seen how to compute predictions.

How do we train the network to make sensible predictions?

CSC411 Lec10 19 / 41
Training Neural Networks

How do we find weights?

N
X
w∗ = argmin loss(o(n) , t(n) )
w
n=1

where o = f (x; w) is the output of a neural network

I can use any (smooth) loss function we want.
Problem: With hidden units the objective is no longer convex!
No guarantees gradient methods won’t end up in a (bad) local minima/
saddle point.
Some theory/experimental evidence that most local minimas are good, i.e.
almost as good as the global minima.
SGD with some (critical) tweaks works well. It is not really well understood.

CSC411 Lec10 20 / 41
Training Neural Networks: Back-propagation

Back-propagation: an efficient method for computing gradients needed to

perform gradient-based optimization of the weights in a multi-layer network

Training neural nets:

Loop until convergence:
I for each example n
1. Given input x(n) , propagate activity forward (x(n) → h(n) → o (n) )
(forward pass)
2. Propagate gradients backward (backward pass)
3. Update each weight (via gradient descent)

Given any error function E, activation functions g () and f (), just need to
derive gradients

CSC411 Lec10 21 / 41
Key Idea behind Backpropagation

We don’t have targets for a hidden unit, but we can compute how fast the
error changes as we change its activity
I Instead of using desired activities to train the hidden units, use error
derivatives w.r.t. hidden activities
I Each hidden activity can affect many output units and can therefore
have many separate effects on the error. These effects must be
combined
I We can compute error derivatives for all the hidden units efficiently
I Once we have the error derivatives for the hidden activities, its easy to
get the error derivatives for the weights going into a hidden unit
This is just the chain rule!

CSC411 Lec10 22 / 41
Useful Derivatives

name function derivative

1
Sigmoid σ(z) = 1+exp(−z) σ(z) · (1 − σ(z))
exp(z)−exp(−z)
Tanh tanh(z) = exp(z)+exp(−z) 1/ cosh2 (z)
(
1, if z > 0
ReLU ReLU(z) = max(0, z)
0, if z ≤ 0

CSC411 Lec10 23 / 41
Computing Gradients: Single Layer Network
Let’s take a single layer network and draw it a bit differently

CSC411 Lec10 24 / 41
Computing Gradients: Single Layer Network

Error gradients for single layer network:

∂E ∂E ∂ok ∂zk
=
∂wki ∂ok ∂zk ∂wki

Error gradient is computable for any smooth activation function g (), and
any smooth error function
CSC411 Lec10 25 / 41
Computing Gradients: Single Layer Network

Error gradients for single layer network:

∂E ∂E ∂ok ∂zk
=
∂wki ∂ok ∂zk ∂wki
|{z}
δko

CSC411 Lec10 26 / 41
Computing Gradients: Single Layer Network

Error gradients for single layer network:

∂E ∂E ∂ok ∂zk ∂ok ∂zk

= = δko
∂wki ∂ok ∂zk ∂wki ∂zk ∂wki

CSC411 Lec10 27 / 41
Computing Gradients: Single Layer Network

Error gradients for single layer network:

∂E ∂E ∂ok ∂zk ∂ok ∂zk

= = δko ·
∂wki ∂ok ∂zk ∂wki ∂z ∂wki
| {z k}
δkz

CSC411 Lec10 28 / 41
Computing Gradients: Single Layer Network

Error gradients for single layer network:

∂E ∂E ∂ok ∂zk ∂zk

= = δkz = δkz · xi
∂wki ∂ok ∂zk ∂wki ∂wki

CSC411 Lec10 29 / 41
Gradient Descent for Single Layer Network
Assuming the error function is mean-squared error (MSE), on a single
training example n, we have
∂E (n) (n)
(n)
= ok − tk := δko
∂ok

Using logistic activation functions:

g (zk ) = (1 + exp(−zk ))−1

(n) (n) (n)
ok =
(n)
∂ok (n) (n)
(n)
= ok (1 − ok )
∂zk

The error gradient is then:

N (n) (n) N
∂E X ∂E ∂ok ∂zk X (n) (n) (n) (n) (n)
= (n) (n) ∂w
= (ok − tk )ok (1 − ok )xi
∂wki n=1 ∂o ∂z ki
n=1
k k

The gradient descent update rule is given by:

N
∂E X (n) (n) (n) (n) (n)
wki ← wki − η = wki − η (ok − tk )ok (1 − ok )xi
∂wki n=1

CSC411 Lec10 30 / 41
Multi-layer Neural Network

CSC411 Lec10 31 / 41
Back-propagation: Sketch on One Training Case

Convert discrepancy between each output and its target value into an error
derivative
1X ∂E
E= (ok − tk )2 ; = ok − tk
2 ∂ok
k

Compute error derivatives in each hidden layer from error derivatives in layer
above. [assign blame for error at k to each unit j according to its influence
on k (depends on wkj )]

Use error derivatives w.r.t. activities to get error derivatives w.r.t. the
weights.
CSC411 Lec10 32 / 41
Gradient Descent for Multi-layer Network

The output weight gradients for a

multi-layer network are the same as for a
single layer network
N
X ∂E ∂o ∂z (n) (n) N
∂E k k
X z,(n) (n)
= (n) (n)
= δk hj
∂wkj n=1 ∂ok ∂zk
∂wkj n=1

where δk is the error w.r.t. the net input

for unit k

Hidden weight gradients are then computed via back-prop:

∂E
(n)
=
∂hj

CSC411 Lec10 33 / 41
Gradient Descent for Multi-layer Network

The output weight gradients for a

multi-layer network are the same as for a
single layer network
N
X ∂E ∂o ∂z (n) (n) N
∂E k k
X z,(n) (n)
= (n) (n) ∂w
= δk hj
∂wkj n=1 ∂ok ∂zk
kj
n=1

where δk is the error w.r.t. the net input

for unit k

Hidden weight gradients are then computed via back-prop:

∂E X ∂E ∂o (n) ∂z (n) X z,(n) h,(n)

k k
(n)
= (n) (n) (n)
= δk wkj := δj
∂hj k ∂ok ∂zk ∂hj k

CSC411 Lec10 34 / 41
Gradient Descent for Multi-layer Network
The output weight gradients for a
multi-layer network are the same as for a
single layer network
N
X ∂E ∂o ∂z (n) (n) N
∂E k k
X z,(n) (n)
= (n) (n) ∂w
= δk hj
∂wkj n=1 ∂ok ∂zk
kj
n=1

where δk is the error w.r.t. the net input

for unit k

Hidden weight gradients are then computed via back-prop:

∂E X ∂E ∂o (n) ∂z (n) X z,(n) h,(n)

k k
(n)
= (n) (n) (n)
= δk wkj := δj
∂hj k ∂ok ∂zk ∂hj k

N (n) (n) N (n)

∂E X ∂E ∂hj ∂uj X h,(n) 0 (n) ∂uj
= (n) (n)
= δj f (uj ) =
∂vji n=1 ∂hj ∂uj ∂vji n=1
∂vji

CSC411 Lec10 35 / 41
Gradient Descent for Multi-layer Network
The output weight gradients for a
multi-layer network are the same as for a
single layer network
N
X ∂E ∂o ∂z (n) (n) N
∂E k k
X z,(n) (n)
= (n) (n) ∂w
= δk hj
∂wkj n=1 ∂ok ∂zk
kj
n=1

where δk is the error w.r.t. the net input

for unit k

Hidden weight gradients are then computed via back-prop:

∂E X ∂E ∂o (n) ∂z (n) X z,(n) h,(n)

k k
(n)
= (n) (n) (n)
= δk wkj := δj
∂hj k ∂ok ∂zk ∂hj k

N (n) N (n) N (n)

∂E X ∂E ∂hj ∂uj X h,(n) 0 (n) ∂uj
X u,(n) (n)
= (n) (n)
= δj f (uj ) = δj xi
∂vji n=1 ∂hj ∂uj ∂vji n=1
∂vji n=1

CSC411 Lec10 36 / 41
Backprob in deep networks

The exact same ideas (and math) can be used when we have multiple
∂E ∂E ∂E
hidden layer - compute ∂h L and use it to compute ∂w L and
∂hL−1
j ij j

Two phases:
I Forward: Compute output layer by layer (in order)
I Backwards: Compute gradients layer by layer (reverse order)
Modern software packages (theano, tensorflow, pytorch) do this
automatically.
I You define the computation graph, it takes care of the rest.

CSC411 Lec10 37 / 41
Training neural networks

Why was training neural nets considered hard?

With one or more hidden layers the optimization is no longer convex.
I No Guarantees, optimization can end up in a bad local minima/ saddle
point.
Vanishing gradient problem.
Long compute time.
I Training on imagenet can take 3 weeks on GPU (∼ ×30 speedup!)
We will talk about a few simple tweaks that made it easy!

CSC411 Lec10 38 / 41
Activation functions

Sigmoid and tanh can saturate.

I σ 0 (z) = σ(z) · (1 − σ(z)) what happens when z is very large/small?
Even without saturation gradients can vanish in deep networks
ReLU have 0 or 1 gradients, as long as not all path to the error are zero the
gradient doesn’t vanish.
I Neurons can still ”die”.
Other alternatives: maxout, leaky ReLU, ELU (ReLU is by far the most
common).
On output layer usually no activations or sigmoid/softmax (depends on what
do we want to represent)

CSC411 Lec10 39 / 41
Initialization

How do we initialize the weights?

What if we initialize all to a constant c?
I All neurons will stay the same!
I Need to break symmetry - random initialization
Standard approach - Wij ∼ N (0, σ 2 )
I If we pick σ 2 too small - output will converge to zero after a few layers.
I If we pick σ 2 too large - output will diverge.
Xavier initialization - σ 2 = 2/(nin + nout )
I nin and nout are the number of units in the previous layer and the next
layer
He initialization - σ 2 = 2/nin
I Builds on the math of Xavier initialization but takes ReLU into account.
I Recommended method for ReLUs (i.e. almost always)

CSC411 Lec10 40 / 41
Momentum

”Vanilla” SGD isn’t good enough to train - bad at ill-conditioned problems.

Solution - add momentum

vt+1 = βvt + ∇L(wt )

xt+1 = xt − αvt+1

I Builds up when we continue at the same direction.

I decreases when we change signs
Normality pick β = 0.9
More recent algorithms like ADAM still use momentum (just add a few more
tricks).

Nice visualization - http:

//www.denizyuret.com/2015/03/alec-radfords-animations-for.html

CSC411 Lec10 41 / 41

All MCQ of Software Engineering Unit 1,2,3,4,5.docx Answer Key
50% (4)
All MCQ of Software Engineering Unit 1,2,3,4,5.docx Answer Key
24 pages
A Beginner's Tutorial For CNN
100% (1)
A Beginner's Tutorial For CNN
35 pages
EMT1
No ratings yet
EMT1
58 pages
Final Exam Solutions
No ratings yet
Final Exam Solutions
31 pages
A10 SSL Insight Deloyment Guide
No ratings yet
A10 SSL Insight Deloyment Guide
34 pages
Lesson 3 Artificial Neural Network
No ratings yet
Lesson 3 Artificial Neural Network
77 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
Physics Formulas Sheet
No ratings yet
Physics Formulas Sheet
4 pages
Chapter 5 Artificial Neural Networks
No ratings yet
Chapter 5 Artificial Neural Networks
50 pages
Fluid Package Guide PDF
No ratings yet
Fluid Package Guide PDF
21 pages
Lecture 25
No ratings yet
Lecture 25
25 pages
Valve & Type of Vales
83% (6)
Valve & Type of Vales
82 pages
Chpater 3 CNN
No ratings yet
Chpater 3 CNN
98 pages
10 nn1
No ratings yet
10 nn1
162 pages
All Drawings
No ratings yet
All Drawings
5 pages
2 - Time Series Regression (pt.1)
No ratings yet
2 - Time Series Regression (pt.1)
92 pages
Lecture 2 - Neural Network v1.0
No ratings yet
Lecture 2 - Neural Network v1.0
64 pages
10 Neural Network
No ratings yet
10 Neural Network
65 pages
Lecture 0.4 - Neural Networks
No ratings yet
Lecture 0.4 - Neural Networks
51 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
ML.8-Neural Networks - Deep Learning (Week 12,13)
No ratings yet
ML.8-Neural Networks - Deep Learning (Week 12,13)
80 pages
Neural Networks
No ratings yet
Neural Networks
52 pages
Lect 5
No ratings yet
Lect 5
89 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
C++ Network Programming: Systematic Reuse With ACE & Frameworks
No ratings yet
C++ Network Programming: Systematic Reuse With ACE & Frameworks
383 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
Slides 11
No ratings yet
Slides 11
48 pages
Ethiopian Secondary School Lleaving Certificate: Equilibrium
No ratings yet
Ethiopian Secondary School Lleaving Certificate: Equilibrium
21 pages
Neural Networks
No ratings yet
Neural Networks
45 pages
Singhania HAG DATA
No ratings yet
Singhania HAG DATA
21 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
L10 Neural Network
No ratings yet
L10 Neural Network
52 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
Yihong Qiu, Yisheng Chen, Geoff G.Z. Zhang, Lawrence Yu, Rao v. Mantri (Eds.) - Developing Solid Oral Dosage Forms - Pharmaceutical Theory and Practice-Academic Press (2016) - 166-190
No ratings yet
Yihong Qiu, Yisheng Chen, Geoff G.Z. Zhang, Lawrence Yu, Rao v. Mantri (Eds.) - Developing Solid Oral Dosage Forms - Pharmaceutical Theory and Practice-Academic Press (2016) - 166-190
25 pages
Neural Network (Basics)
No ratings yet
Neural Network (Basics)
48 pages
Module 5 Lecture 2
No ratings yet
Module 5 Lecture 2
45 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
NNDL
No ratings yet
NNDL
96 pages
9 Neural Networks Learning
No ratings yet
9 Neural Networks Learning
38 pages
CNN and Gan: Introduction To
No ratings yet
CNN and Gan: Introduction To
58 pages
Ad3451 ML Unit 4 Notes
No ratings yet
Ad3451 ML Unit 4 Notes
36 pages
Aakashns - Python-Variables-And-Data-Types - Jovian
No ratings yet
Aakashns - Python-Variables-And-Data-Types - Jovian
37 pages
Lect8 DNN
No ratings yet
Lect8 DNN
33 pages
22 NeuralNetworks
No ratings yet
22 NeuralNetworks
29 pages
Wa0001.
No ratings yet
Wa0001.
21 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
31 pages
3 - DeepLearning - and - CNN v3
No ratings yet
3 - DeepLearning - and - CNN v3
50 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Ad3451 ML Unit 4 Notes Eduengg
No ratings yet
Ad3451 ML Unit 4 Notes Eduengg
36 pages
Machine Learning For Beginners
No ratings yet
Machine Learning For Beginners
16 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
FML Unit5
No ratings yet
FML Unit5
21 pages
Lecture 11 - Introduction To Artificial Neural Networks (ANN)
No ratings yet
Lecture 11 - Introduction To Artificial Neural Networks (ANN)
35 pages
LO3 Develop Best Topology
No ratings yet
LO3 Develop Best Topology
28 pages
Lec 06
No ratings yet
Lec 06
20 pages
Process Monitoring in Grinding
No ratings yet
Process Monitoring in Grinding
47 pages
Lecture 12 - Neural Networks (DONE!!) PDF
No ratings yet
Lecture 12 - Neural Networks (DONE!!) PDF
27 pages
Neural Networks - Annotated
No ratings yet
Neural Networks - Annotated
21 pages
Assignment - 4
No ratings yet
Assignment - 4
24 pages
Unit 4
No ratings yet
Unit 4
19 pages
Main
No ratings yet
Main
25 pages
ESP 9 Steps Design Hand Calculations
No ratings yet
ESP 9 Steps Design Hand Calculations
16 pages
01-Intro To DS
No ratings yet
01-Intro To DS
16 pages
Artificial Intelligence Basics
No ratings yet
Artificial Intelligence Basics
13 pages
Nuclear Physics
No ratings yet
Nuclear Physics
13 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
TIS L160RS-W Air Compressor Specifications
100% (1)
TIS L160RS-W Air Compressor Specifications
1 page
MLS 1 - Presentation
No ratings yet
MLS 1 - Presentation
11 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
Neural Networks and Fuzzy Systems: Multi-Layer Feed Forward Networks
No ratings yet
Neural Networks and Fuzzy Systems: Multi-Layer Feed Forward Networks
27 pages
Limit Teacher
No ratings yet
Limit Teacher
16 pages
FLT Course Manual
No ratings yet
FLT Course Manual
28 pages
Lecture 6. Precipitation Titration
No ratings yet
Lecture 6. Precipitation Titration
21 pages
Born's Conditions On The Wave Function
No ratings yet
Born's Conditions On The Wave Function
4 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2016
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2016
14 pages
Sparseautoencoder 2011new
No ratings yet
Sparseautoencoder 2011new
19 pages
Problem A.1
No ratings yet
Problem A.1
8 pages
Soft Assignment
No ratings yet
Soft Assignment
5 pages
Devops Unit 4
No ratings yet
Devops Unit 4
15 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
Forces Revision Mat
No ratings yet
Forces Revision Mat
1 page
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
Side of A Die 1 2 5 1 6,2 3,3 4 4,5 6: Activity
No ratings yet
Side of A Die 1 2 5 1 6,2 3,3 4 4,5 6: Activity
1 page
Neural Networks From Scratch: 3.1 Formal Neuron
No ratings yet
Neural Networks From Scratch: 3.1 Formal Neuron
8 pages
Annette Paper
No ratings yet
Annette Paper
7 pages
Understanding Backpropagation Algorithm - Towards Data Science
No ratings yet
Understanding Backpropagation Algorithm - Towards Data Science
11 pages
Ict Worksheet Ts
No ratings yet
Ict Worksheet Ts
2 pages
Revision 10
No ratings yet
Revision 10
3 pages
Neural Network Presentation
No ratings yet
Neural Network Presentation
33 pages
NN Concepts
No ratings yet
NN Concepts
4 pages
UbD Lesson Plan
No ratings yet
UbD Lesson Plan
4 pages
Sis m672 Data Sheet
No ratings yet
Sis m672 Data Sheet
5 pages
GE178 LabExe 3D Conformal Transformation
No ratings yet
GE178 LabExe 3D Conformal Transformation
1 page
Second Half 2022 Posters
No ratings yet
Second Half 2022 Posters
15 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet