0% found this document useful (0 votes)

35 views12 pages

Lab 7

Uploaded by

Shaiza Akhtar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views12 pages

Lab 7

Uploaded by

Shaiza Akhtar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

Lab 8: Feed Forward Neural Network

CLO-2

Part 1

Representing the feed-forward neural network using Python

Let us create the respective sample weights which are to be applied in the
input layer, the first & the second hidden layer
import numpy as np
from sklearn import datasets
#
# Generate a dataset and plot it
#
np.random.seed(0)
X, y = datasets.make_moons(200, noise=0.20)
#
# Neural network architecture
# No of nodes in input layer = 4
# No of nodes in output layer = 3
# No of nodes in the hidden layer = 6
#
input_dim = 4 # input layer dimensionality
output_dim = 3 # output layer dimensionality
hidden_dim = 6 # hidden layer dimensionality
#
# Weights and bias element for layer 1
# These weights are applied for calculating
# weighted sum arriving at neurons in 1st hidden layer
#
W1 = np.random.randn(input_dim, hidden_dim)
b1 = np.zeros((1, hidden_dim))
#
# Weights and bias element for layer 2
# These weights are applied for calculating
# weighted sum arriving at neurons in 2nd hidden layer
#
W2 = np.random.randn(hidden_dim, hidden_dim)
b2 = np.zeros((1, hidden_dim))
#
# Weights and bias element for layer 2
# These weights are applied for calculating
# weighted sum arriving at in the final / output layer
#
W3 = np.random.randn(hidden_dim, output_dim)
b3 = np.zeros((1, output_dim))

Python code implementation for the propagation of the input signal

through different layers towards the output layer
#
# Forward propagation of input signals
# to 6 neurons in first hidden layer
# activation is calculated based tanh function
#
z1 = X.dot(W1) + b1
a1 = np.tanh(z1)
#
# Forward propagation of activation signals from first hidden layer
# to 6 neurons in second hidden layer
# activation is calculated based tanh function
#
z2 = a1.dot(W2) + b2
a2 = np.tanh(z2)
#
# Forward propagation of activation signals from second hidden layer
# to 3 neurons in output layer
#
z3 = a2.dot(W3) + b3
#
# Probability is calculated as an output
# of softmax function
#
probs = np.exp(z3) / np.sum(np.exp(z3), axis=1, keepdims=True)

Part 2

Now we will train a deep Artificial Neural Networks (ANN) to better classify
the datasets which the logistic regression model struggled, Moons and
Circles. We will also classify an even harder dataset of Sine Wave to
demonstrate that ANN can form really complex decision boundaries.

1. Complex Data - Moons

While building Keras models for logistic regression above, we performed

the following steps:

Step 1: Define a Sequential model.

Step 2: Add a Dense layer with sigmoid activation function. This was the
only layer we needed.

Step 3: Compile the model with an optimizer and loss function.

Step 4: Fit the model to the dataset.

Step 5: Analyze the results: plotting loss/accuracy curves, plotting the

decision boundary, looking at the classification report, and understanding
the confusion matrix.

While building a deep neural network, we only need to change step 2 such
that, we will add several Dense layers one after another. The output of
one layer becomes the input of the next. Keras again does most of the
heavy lifting by initializing the weights and biases, and connecting the
output of one layer to the input of the next. We only need to specify how
many nodes we want in a given layer, and the activation function. It’s as
simple as that.
We first add a layer with 4 nodes and tanh activation function. Tanh is a
commonly used activation function. We then add another layer with 2
nodes again using tanh activation. We finally add the last layer with 1
node and sigmoid activation. This is the final layer that we also used in
the logistic regression model.

This is not a very deep ANN, it only has 3 layers: 2 hidden layers, and the
output layer. But notice a couple of patterns:

Output layer still uses the sigmoid activation function since we’re working
on a binary classification problem.

Hidden layers use the tanh activation function. If we added more hidden
layers, they would also use tanh activation. We have a couple of options
for activation functions: sigmoid, tanh, relu, and variants of relu.

We have fewer number of nodes in each subsequent layer. It’s common to

have less nodes as we stack layers on top of one another, sort of a
triangular shape.

We didn’t build a very deep ANN here because it wasn’t necessary. We

already achieve 100% accuracy with this configuration.

The ANN is able to come up with a perfect separator to distinguish the

classes.
100% precision, nothing misclassified.

2. Complex Data - Circles

Now let’s look at the Circles dataset, where the LR model achieved only
50% accuracy. The model is the same as above, we only change the input
to the fit function using the current dataset. And we again achieve 100%
accuracy.
Similarly the decision boundary looks just like the one we would draw
by hand ourselves. The ANN was able to figure out an optimal
separator.

Just like above we get 100% accuracy.

3. Complex Data - Sine Wave

Let’s try to classify one final toy dataset. In the previous sections, the
classes were separable by one continuous decision boundary. The
boundary had a complex shape, it wasn’t linear, but still one continuous
decision boundary was enough. ANN can draw arbitrary number of
complex decision boundaries, and we will demonstrate that.

Let’s create a sinusoidal dataset looking like the sine function, every up
and down belonging to an alternating class. As we can see in the figure, a
single decision boundary won’t be able to separate out the classes. We
will need a series of non-linear separators.

Now we need a more complex model for accurate classification. So we

have 3 hidden layers, and an output layer. The number of nodes per layer
has also increased to improve the learning capacity of the model.
Choosing the right number of hidden layers and nodes per layer is more of
an art than science, usually decided by trial and error.
The ANN was able to model a pretty complex set of decision boundaries.

Precision is 99%, we only have 14 misclassified points out of 2400. Pretty

good.

4. Multiclass Classification
In the previous sections we worked on binary classification. Now we will
take a look at a multi-class classification problem, where the number of
classes is more than 2. We will pick 3 classes for demonstration, but our
approach generalizes to any number of classes.

Here’s how our dataset looks like, spiral data with 3 classes, using
the make_multiclass method in scikit-learn.

Softmax Regression

As we know Logistic Regression (LR) is a classification method for 2

classes. It works with binary labels 0/1. Softmax Regression (SR) is a
generalization of LR where we can have more than 2 classes. In our
current dataset we have 3 classes, represented as 0/1/2.

Activation function: SR uses softmax. Softmax scales the values of the

output nodes such that they represent probabilities and sum up to 1. So in
our case P(class=0) + P(class=1) + P(class=2)=1. It doesn’t do it in a
naive way by dividing individual probabilities by the sum though, it uses
the exponential function. So higher values get emphasized more and
lower values get squashed more. We will talk in detail what softmax does
in another tutorial. For now you can simply think of it as a normalization
function which lets us interpret the output values as probabilities.

Loss function: In a binary classification problem, the loss function is

binary_crossentropy. In the multiclass case, the loss function is
categorical_crossentropy. Categorical crossentropy is the generalization of
binary crossentropy to more than 2 classes.

Training the model gives us an accuracy of around 50%. The most naive
method which always predicts class 1 no matter what the input is would
have an accuracy of 33%. The SR model is not much of an improvement
over it. Which is expected because the dataset is not linearly separable.

Looking at the decision boundary confirms that we still have a linear

classifier. The lines look jagged due to floating point rounding but in reality
they’re straight.

Here’s the precision and recall corresponding to the 3 classes. And the
confusion matrix is all over the place. Clearly this is not an optimal
classifier.
5. Deep ANN

Now let’s build a deep ANN for multiclass classification. We will do the
same again. Adding a couple of Dense layers with tanh activation
function.

Note that the output layer still has 3 nodes, and uses the softmax
activation. The loss function also didn’t change, still
categorical_crossentropy. These won’t change going from a linear model
to a deep ANN, since the problem definition hasn’t changed. We’re still
working on multiclass classification. But now using a more powerful
model, and that power comes from adding more layers to our neural net.

We achieve 99% accuracy in just a couple of epochs.

The decision boundary is non-linear.

We got almost 100% accuracy. We totally misclassified 5 points out of

1500.

Exploring Deep Learning and Neural Networks in Data Science
No ratings yet
Exploring Deep Learning and Neural Networks in Data Science
11 pages
Neural Networks and Deep Learning Notes
No ratings yet
Neural Networks and Deep Learning Notes
88 pages
"Artificial Neural Networks": A Presentation On
No ratings yet
"Artificial Neural Networks": A Presentation On
13 pages
Experiment 2.5 DL
No ratings yet
Experiment 2.5 DL
3 pages
ML04 KNN-SVM 2024-2025
No ratings yet
ML04 KNN-SVM 2024-2025
57 pages
Tutorial Sheet 2 (KOE073)
No ratings yet
Tutorial Sheet 2 (KOE073)
3 pages
Pattern Recognition Handwritten Notes
No ratings yet
Pattern Recognition Handwritten Notes
64 pages
CS373 Lecture18.1
No ratings yet
CS373 Lecture18.1
33 pages
Scalable and Efficient Multi-Label Classification For Evolving Data Streams
No ratings yet
Scalable and Efficient Multi-Label Classification For Evolving Data Streams
30 pages
IVPML Unit III
No ratings yet
IVPML Unit III
139 pages
6.neural Networks 2
No ratings yet
6.neural Networks 2
44 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
36 pages
09-RNN (V.Andicsova)
No ratings yet
09-RNN (V.Andicsova)
30 pages
Week 1
No ratings yet
Week 1
24 pages
6S191 MIT DeepLearning L2
No ratings yet
6S191 MIT DeepLearning L2
93 pages
RandomForests Sayed
No ratings yet
RandomForests Sayed
21 pages
Time Delay Neural Network
No ratings yet
Time Delay Neural Network
6 pages
DL Exp-3 16010422230
No ratings yet
DL Exp-3 16010422230
9 pages
Deep Learning Lab Manual - 23-24
No ratings yet
Deep Learning Lab Manual - 23-24
41 pages
Chapter 11 Neural Nets (Python)
No ratings yet
Chapter 11 Neural Nets (Python)
43 pages
04 - Machine Learning For Embedded and Edge AI
No ratings yet
04 - Machine Learning For Embedded and Edge AI
58 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Deep Learning Models (Basic)
No ratings yet
Deep Learning Models (Basic)
35 pages
cs188 sp24 Note22
No ratings yet
cs188 sp24 Note22
8 pages
Classification Algos 222
No ratings yet
Classification Algos 222
23 pages
DL Exp-2 16010422230
No ratings yet
DL Exp-2 16010422230
6 pages
Back Propagation Example
No ratings yet
Back Propagation Example
3 pages
CS230
No ratings yet
CS230
101 pages
Neural Networks Unit-3
No ratings yet
Neural Networks Unit-3
14 pages
Nonlinear
No ratings yet
Nonlinear
8 pages
Tutorial On Neural Networks - 18MAR2024
No ratings yet
Tutorial On Neural Networks - 18MAR2024
33 pages
Unit II
No ratings yet
Unit II
12 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
Tugas JST #Individual Task 1. MLP (Multi Layer Perceptron)
No ratings yet
Tugas JST #Individual Task 1. MLP (Multi Layer Perceptron)
3 pages
NN Notes
No ratings yet
NN Notes
39 pages
ML 2 (Mainly KNN)
100% (1)
ML 2 (Mainly KNN)
12 pages
Perceptron
No ratings yet
Perceptron
24 pages
10 Multilayer Perceptrons
No ratings yet
10 Multilayer Perceptrons
54 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
Deep Learning Lab: Translated MLP - CNN
No ratings yet
Deep Learning Lab: Translated MLP - CNN
19 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
DL - ANN - RNN - CNN (Autosaved) (Autosaved)
No ratings yet
DL - ANN - RNN - CNN (Autosaved) (Autosaved)
53 pages
Deep Learning With Tensorflow
No ratings yet
Deep Learning With Tensorflow
15 pages
Presentation On: Neural Network
No ratings yet
Presentation On: Neural Network
30 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
Preceptron
No ratings yet
Preceptron
17 pages
Lec 05
No ratings yet
Lec 05
46 pages
Frequency Table: Um Ur
No ratings yet
Frequency Table: Um Ur
14 pages
CV Lec5
No ratings yet
CV Lec5
54 pages
2021 Logistic Regression
No ratings yet
2021 Logistic Regression
33 pages
Lecture15 Neural Nets
No ratings yet
Lecture15 Neural Nets
70 pages
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
Tutorial 1,2
No ratings yet
Tutorial 1,2
12 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
216 pages
Cours 1
No ratings yet
Cours 1
42 pages
Types of Neural Networks
No ratings yet
Types of Neural Networks
7 pages
Electronics: Identification of Plant-Leaf Diseases Using CNN and Transfer-Learning Approach
No ratings yet
Electronics: Identification of Plant-Leaf Diseases Using CNN and Transfer-Learning Approach
19 pages
Lab Manual Soft Computing
100% (1)
Lab Manual Soft Computing
44 pages
Artificial Neural Nets
No ratings yet
Artificial Neural Nets
34 pages
Unit 2.1
No ratings yet
Unit 2.1
37 pages
Neural - Networks
No ratings yet
Neural - Networks
47 pages
Neural Networks - 2
No ratings yet
Neural Networks - 2
79 pages
Pattern Classification 10. Linear Perceptron, Least Squares & Multi-Layer Nns
No ratings yet
Pattern Classification 10. Linear Perceptron, Least Squares & Multi-Layer Nns
38 pages
GK Deeplearning
No ratings yet
GK Deeplearning
15 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
CVD Lab Manual
No ratings yet
CVD Lab Manual
33 pages
Module 2
No ratings yet
Module 2
44 pages
Inference and Learning
No ratings yet
Inference and Learning
33 pages
NISS Deep Learning Tutorial
No ratings yet
NISS Deep Learning Tutorial
58 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
Notes Chapter8
No ratings yet
Notes Chapter8
4 pages
cs188 sp23 Note25
No ratings yet
cs188 sp23 Note25
8 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
Machinelearning
No ratings yet
Machinelearning
59 pages
Machine Learning
No ratings yet
Machine Learning
83 pages
Ann MPDM Ii
No ratings yet
Ann MPDM Ii
42 pages
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
No ratings yet
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
61 pages
Neural Networks (Representation) : 1a. Non-Linear Hypothesis
No ratings yet
Neural Networks (Representation) : 1a. Non-Linear Hypothesis
11 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
3 ArtificialNeuralNetworks PDF
No ratings yet
3 ArtificialNeuralNetworks PDF
77 pages
Feedforward Propagation: 1.1 Visualizing The Data
No ratings yet
Feedforward Propagation: 1.1 Visualizing The Data
11 pages
Machine Learning: The Hundred-Page Book
No ratings yet
Machine Learning: The Hundred-Page Book
17 pages
04 - Neural Networks PDF
No ratings yet
04 - Neural Networks PDF
46 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Home Work
No ratings yet
Home Work
12 pages
Neural Networks / Deep Learning
No ratings yet
Neural Networks / Deep Learning
9 pages
The Deep Learning Revolution: Introductory Overview Lecture
No ratings yet
The Deep Learning Revolution: Introductory Overview Lecture
35 pages
Neural Networks
No ratings yet
Neural Networks
5 pages