0% found this document useful (0 votes)

7 views

ANNs

Uploaded by

adityas199292

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

ANNs

Uploaded by

adityas199292

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

rajat.prakash1992@gmail.

com
3DVB5QRZ69

Neural Networks

This file is meant for personal use by [email protected] only.

1
Sharing or publishing the contents in part or full is liable for legal action.
Artificial Neural Networks

• Vaguely inspired by the biological neural networks that

constitute animal brains

• The human brain is made

of close to 100 billions of
neurons interconnected
by synapses.
[email protected]
3DVB5QRZ69

• A neuron processes and

transmits information
through electrical and
chemical signals that are
carried via the synapses

• Neurons can connect to each other to form neural networks –

each neuron can be connected with about 5,000 other
neurons

This file is meant for personal use by [email protected] only.

2
Sharing or publishing the contents in part or full is liable for legal action.
Artificial NNs Vs Biological NN

ANN BNN

10-1000 Neurons & 86 Billion Neurons &

Size 1000’s of synapses >100 Trillion synapses

• Size: our brain contains about 86 billion neurons and more

than a 100 trillion
Usually (or according
Feed-forward, to some estimates
that is computed 1000that is
Complex Network
Topology
Layer by Layer
trillion) synapses (connections).
[email protected]
3DVB5QRZ69 The number of “neurons”
computed in
asynchronously
artificial networks is much less than that (usually in the
Calculation
ballpark of 10–1000) nanobut comparing their numbers
seconds this way
milli seconds
Speeds
is misleading.

Power 100 watts (~100Watts) 20 watts

Not Fault Tolerant, Fault Tolerant,

Others
Learning Learning?

This file is meant for personal use by [email protected] only.

3
Sharing or publishing the contents in part or full is liable for legal action.
Applications

• Neural Nets have done exceptionally well at tasks like

• Image Recognition, Character Recognition, Face

Recognition

• Feature Extractions, Finger print processing, Signature

matching
[email protected]
3DVB5QRZ69

• Speech Recognition

• Other modest successes in: Stock market predictions,

Combinatorial optimization, Medicine etc.

This file is meant for personal use by [email protected] only.

4
Sharing or publishing the contents in part or full is liable for legal action.
The Perceptron
• Perceptron: The main building block

Bias

[email protected]
3DVB5QRZ69

Activation function

This file is meant for personal use by [email protected] only.

5
Sharing or publishing the contents in part or full is liable for legal action.
The Artificial Neural Net

• Number of Layers

• Single Vs

• Multi Layer
[email protected]
3DVB5QRZ69
• Number of Nodes in each
Layer

• Weights/connections

• Activation or Transfer function

This file is meant for personal use by [email protected] only.

6
Sharing or publishing the contents in part or full is liable for legal action.
Examples of Activation functions

• ReLu (with Softmax/Linear)

• Sigmoid (Logistic)

• Hyperbolic Tangent (tanh)

• Step function (Heaviside)

[email protected]

•
3DVB5QRZ69
Softmax (Generalized Logistic)

• Linear

• Which one do we use?

• There is no set procedure/Rule

• ReLu has become very popular

This file is meant for personal use by [email protected] only.

7
Sharing or publishing the contents in part or full is liable for legal action.
An Example

[email protected]
3DVB5QRZ69

This file is meant for personal use by [email protected] only.

8
Sharing or publishing the contents in part or full is liable for legal action.
Learning

• How to determine the weights?

• Start with guess values for weights

• Calculate outputs from inputs

• Compare outputs to desired outputs: Calculating errors

[email protected]
3DVB5QRZ69
• Training algorithms update the weights in a way to
minimize the errors (cost functions)

• Cost (loss) functions: measures how close an output is to

the desired output. Preferably:

• Non-negativity

• Globally continuous and differentiable

This file is meant for personal use by [email protected] only.

9
Sharing or publishing the contents in part or full is liable for legal action.
Training Also: Back Propagation

• Back propagation of errors, is a common algorithm to train

artificial neural networks used in conjunction with an
optimization method such as gradient descent.

•
[email protected]
3DVB5QRZ69 The method calculates the gradient of a loss function with
respect to all the weights in the network.

• The gradient is fed to the optimization method which in turn

uses it to update the weights, in an attempt to minimize the
loss function.

This file is meant for personal use by [email protected] only.

10
Sharing or publishing the contents in part or full is liable for legal action.
Gradient Descent

• A first-order optimization algorithm.

• Essentially equivalent to falling down on a slope to

eventually find the minimum (lowest point in the valley).

• To find a minimum (valley) of a function, take a small step

along the steepest direction. And keep iterating.

• For finding maximums we’d do gradient ascent.

[email protected]
3DVB5QRZ69

This file is meant for personal use by [email protected] only.

11
Sharing or publishing the contents in part or full is liable for legal action.
Learning rate

• Choosing the Learning rate ( )

• Too small, we will need too many iterations for

convergence

• Too large, we may skip the optimal solution

•
[email protected]
3DVB5QRZ69 Adaptive Learning Rate : start with high learning rate and
gradually reduce the learning rate with each iteration

• Can also be chosen by Trial & Error – Use a range of

learning rate (1, 0.1, 0.001, 0.0001) and use the results as a
guide

This file is meant for personal use by [email protected] only.

12
Sharing or publishing the contents in part or full is liable for legal action.
Epochs, Batch size, Iterations

• When dataset is too large, passing all the data through a

Neural net before we make weight updates is computationally
expensive

• Instead we would create data batches with smaller batch size.

• After each batch is passed and weights updated, we will count

it as one iteration.
[email protected]

•
3DVB5QRZ69
When an entire dataset is passed forward and backward
(weights updated) through the neural network, we will count it
as one epoch.

• Too few epochs: under fitting, Too many: overfitting

• Batch training: All of the training samples pass through the

neural net, before weights are updated

• Sequential training: Weights are updated after each training

vector is passed through the neural net.
This file is meant for personal use by [email protected] only.
13
Sharing or publishing the contents in part or full is liable for legal action.
Scaling
• Scaling the Variables

• The non-linearities in the activation function and the

numerical rounding errors make input scaling quite
important

• Scaling can accelerate learning and improve

performance
[email protected]
3DVB5QRZ69

This file is meant for personal use by [email protected] only.

14
Sharing or publishing the contents in part or full is liable for legal action.
Overfitting

• Neural Network Models are susceptible to overfitting

• Large number of weights and biases

• Excessive learning (Epocs) on training data

3DVB5QRZ69 •
[email protected]
Ways to avoid Overfitting

• Increase sample size

• Early stopping

• Reduce Network Size

• Regularization

This file is meant for personal use by [email protected] only.

15
Sharing or publishing the contents in part or full is liable for legal action.
Regularization (weight decay)

• Regularization is a technique used to avoid this overfitting

problem.

• The idea behind regularization is that models that overfit the

data are complex models that have for example too many
parameters.

•
[email protected]
Regularization penalizes the usual loss function by adding a
3DVB5QRZ69
complexity term that would give a bigger loss for more
complex models.

• Types of Regularization

• LASSO

• Ridge

• Optimal value of λ, the decay rate or penalty coefficient, is

determined through cross-validation
This file is meant for personal use by [email protected] only.
16
Sharing or publishing the contents in part or full is liable for legal action.
Hyper parameters and tuning

• Hyper parameters are the variables which determines the

network structure and the variables which determine how
the network is trained

• Number of hidden layers

• Number of neurons in hidden layers

[email protected]
3DVB5QRZ69

• Decay factor

• Number of Epoch

• Learning Rate

• The Activation Function

• Tuning these to ensure that you don’t overfit is an art.

This file is meant for personal use by [email protected] only.

17
Sharing or publishing the contents in part or full is liable for legal action.

6. Integrating+LLMs+into+AI-Driven+Supply+Chains
No ratings yet
6. Integrating+LLMs+into+AI-Driven+Supply+Chains
35 pages
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
From Everand
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
Frank Millstein
No ratings yet
ANN Doc
No ratings yet
ANN Doc
2 pages
Artificial Neural Networks_dl
No ratings yet
Artificial Neural Networks_dl
55 pages
Lesson 3 Artificial Neural Network
No ratings yet
Lesson 3 Artificial Neural Network
77 pages
MLS+1+-+Presentation
No ratings yet
MLS+1+-+Presentation
11 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
ML Lec 10 Neural Networks
No ratings yet
ML Lec 10 Neural Networks
87 pages
Artificial Neural Network (2)
No ratings yet
Artificial Neural Network (2)
75 pages
Introduction To Neural Network
No ratings yet
Introduction To Neural Network
20 pages
CS 329 Lecture4 2025New
No ratings yet
CS 329 Lecture4 2025New
61 pages
Learning Algorithm
No ratings yet
Learning Algorithm
100 pages
Artificial neural network using R
No ratings yet
Artificial neural network using R
15 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
Deep Learning
No ratings yet
Deep Learning
57 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Lect 12 -Deep Feed Forward NN- Review
No ratings yet
Lect 12 -Deep Feed Forward NN- Review
93 pages
NeuralNetworks
No ratings yet
NeuralNetworks
29 pages
DEEP LEARNING
No ratings yet
DEEP LEARNING
38 pages
AI - W7L13
No ratings yet
AI - W7L13
46 pages
Lecture W15ab
No ratings yet
Lecture W15ab
44 pages
Introduction to Artificial Neural Networks
No ratings yet
Introduction to Artificial Neural Networks
31 pages
1725876123-Unit 1 Fundamental of Deep Learning
No ratings yet
1725876123-Unit 1 Fundamental of Deep Learning
51 pages
mv_cs4243_2024_amir_6_p1 (1)
No ratings yet
mv_cs4243_2024_amir_6_p1 (1)
97 pages
ML Unit-2
No ratings yet
ML Unit-2
141 pages
Designing Your Neural Networks - Towards Data Science
No ratings yet
Designing Your Neural Networks - Towards Data Science
15 pages
General Observation
No ratings yet
General Observation
93 pages
Unit 5 Ml
No ratings yet
Unit 5 Ml
37 pages
CS224n: Natural Language Processing With Deep Learning
No ratings yet
CS224n: Natural Language Processing With Deep Learning
18 pages
Bai 1 Eng
No ratings yet
Bai 1 Eng
10 pages
a imprimer 4
No ratings yet
a imprimer 4
4 pages
3 - DeepLearning - and - CNN v3
No ratings yet
3 - DeepLearning - and - CNN v3
50 pages
Deep Learning_Part II-1
No ratings yet
Deep Learning_Part II-1
23 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
34 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2015
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2015
14 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
Intro To DL
No ratings yet
Intro To DL
28 pages
06 AIS302 ANN backpropagation
No ratings yet
06 AIS302 ANN backpropagation
83 pages
cs188-fa24-lec24
No ratings yet
cs188-fa24-lec24
46 pages
Deep Learning concepts ppt
No ratings yet
Deep Learning concepts ppt
13 pages
A Recipe For Training Neural Networks
No ratings yet
A Recipe For Training Neural Networks
15 pages
Lec-06
No ratings yet
Lec-06
20 pages
Lecture 11 - Introduction To Artificial Neural Networks (ANN)
No ratings yet
Lecture 11 - Introduction To Artificial Neural Networks (ANN)
35 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
Supervised Deep Learning
No ratings yet
Supervised Deep Learning
28 pages
Optimization of Deep Networks
No ratings yet
Optimization of Deep Networks
84 pages
Ai-Ds 2
No ratings yet
Ai-Ds 2
31 pages
IoT - Lecture 11
No ratings yet
IoT - Lecture 11
58 pages
UNIT 4
No ratings yet
UNIT 4
13 pages
DNN Hyperparameter Tuning
No ratings yet
DNN Hyperparameter Tuning
105 pages
Lecture 10
No ratings yet
Lecture 10
155 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2016
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2016
14 pages
Chapter 11 Neural Nets (Python)
No ratings yet
Chapter 11 Neural Nets (Python)
43 pages
Unit III
No ratings yet
Unit III
37 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
NN Concepts
No ratings yet
NN Concepts
4 pages
Understanding and Coding Neural Networks From Scratch in Python and R
No ratings yet
Understanding and Coding Neural Networks From Scratch in Python and R
12 pages
Unit 1
No ratings yet
Unit 1
20 pages
Session NN
No ratings yet
Session NN
32 pages
L10 Neural Network
No ratings yet
L10 Neural Network
52 pages
Neural Networks
From Everand
Neural Networks
Sasha Kurzweil
No ratings yet
SCalib A Warehouse Sensor Fault Detection and Self-Calibration Technique For Sustainable IoT
No ratings yet
SCalib A Warehouse Sensor Fault Detection and Self-Calibration Technique For Sustainable IoT
6 pages
Other Modeling Tools On EM
No ratings yet
Other Modeling Tools On EM
19 pages
Business Analytics Important Question Answers
No ratings yet
Business Analytics Important Question Answers
38 pages
Seminar Topic On Predictive Maintenance in Computer Systems
No ratings yet
Seminar Topic On Predictive Maintenance in Computer Systems
13 pages
A Machine Learning Predictive Model For Determining Credit Risks in Ethiopian Microfinance Institutions
No ratings yet
A Machine Learning Predictive Model For Determining Credit Risks in Ethiopian Microfinance Institutions
20 pages
"Artificial Intelligence" Topic-A Modern Approach
No ratings yet
"Artificial Intelligence" Topic-A Modern Approach
14 pages
Data Scientist Master Program
No ratings yet
Data Scientist Master Program
31 pages
600 Machine Learning Projects
No ratings yet
600 Machine Learning Projects
15 pages
Xgboostcomp
No ratings yet
Xgboostcomp
21 pages
Potato Leaf Disease Identification With Multi-Stage Approach A Comparative Study
No ratings yet
Potato Leaf Disease Identification With Multi-Stage Approach A Comparative Study
5 pages
Vicky
No ratings yet
Vicky
13 pages
Loan Risk Prediction Using User Transaction Information
No ratings yet
Loan Risk Prediction Using User Transaction Information
3 pages
IoT_and_Cloud_Quiz_Questions_and_Answers
No ratings yet
IoT_and_Cloud_Quiz_Questions_and_Answers
5 pages
beevi (1)
No ratings yet
beevi (1)
39 pages
Assignment - Week 7-With Solution
No ratings yet
Assignment - Week 7-With Solution
4 pages
CCS 341 lab manual
No ratings yet
CCS 341 lab manual
32 pages
Pyspark - Kmeans Clustering With Map Reduce in Spark - Stack Overflow
No ratings yet
Pyspark - Kmeans Clustering With Map Reduce in Spark - Stack Overflow
6 pages
Adjoint-Based Model Tuning and Machine Learning Strategy for Turbulence Model Improvement
No ratings yet
Adjoint-Based Model Tuning and Machine Learning Strategy for Turbulence Model Improvement
23 pages
Machine Learning for Diabetes Clinical Decision Support a Review
No ratings yet
Machine Learning for Diabetes Clinical Decision Support a Review
24 pages
DP 100T01A ENU TrainerHandbook
No ratings yet
DP 100T01A ENU TrainerHandbook
146 pages
Chapter 7 Learning
No ratings yet
Chapter 7 Learning
34 pages
Shaping Tomorrow
No ratings yet
Shaping Tomorrow
5 pages
What Is Information Technology
No ratings yet
What Is Information Technology
5 pages
Age and Gender
No ratings yet
Age and Gender
14 pages
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
No ratings yet
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
30 pages
Ijst 2023 2979
No ratings yet
Ijst 2023 2979
12 pages
Materials Chemistry 4th Edition Bradley D. Fahlman - The full ebook version is just one click away
100% (1)
Materials Chemistry 4th Edition Bradley D. Fahlman - The full ebook version is just one click away
79 pages
Improvement of Transmission Line Ampacity Utilization Via Machine Learning-Based Dynamic Line Rating Prediction
No ratings yet
Improvement of Transmission Line Ampacity Utilization Via Machine Learning-Based Dynamic Line Rating Prediction
10 pages
1RV21AI011-1RV21AI028 Stream Lab Report
No ratings yet
1RV21AI011-1RV21AI028 Stream Lab Report
34 pages

ANNs

Uploaded by

ANNs

Uploaded by

rajat.prakash1992@gmail.

This file is meant for personal use by [email protected] only.

• Vaguely inspired by the biological neural networks that

• The human brain is made

• A neuron processes and

• Neurons can connect to each other to form neural networks –

This file is meant for personal use by [email protected] only.

10-1000 Neurons & 86 Billion Neurons &

• Size: our brain contains about 86 billion neurons and more

Power 100 watts (~100Watts) 20 watts

Not Fault Tolerant, Fault Tolerant,

This file is meant for personal use by [email protected] only.

• Neural Nets have done exceptionally well at tasks like

• Image Recognition, Character Recognition, Face

• Feature Extractions, Finger print processing, Signature

• Other modest successes in: Stock market predictions,

This file is meant for personal use by [email protected] only.

This file is meant for personal use by [email protected] only.

• Activation or Transfer function

This file is meant for personal use by [email protected] only.

• ReLu (with Softmax/Linear)

• Hyperbolic Tangent (tanh)

• Step function (Heaviside)

• Which one do we use?

• There is no set procedure/Rule

• ReLu has become very popular

This file is meant for personal use by [email protected] only.

This file is meant for personal use by [email protected] only.

• How to determine the weights?

• Start with guess values for weights

• Calculate outputs from inputs

• Compare outputs to desired outputs: Calculating errors

• Cost (loss) functions: measures how close an output is to

• Globally continuous and differentiable

This file is meant for personal use by [email protected] only.

• Back propagation of errors, is a common algorithm to train

• The gradient is fed to the optimization method which in turn

This file is meant for personal use by [email protected] only.

• A first-order optimization algorithm.

• Essentially equivalent to falling down on a slope to

• To find a minimum (valley) of a function, take a small step

• For finding maximums we’d do gradient ascent.

This file is meant for personal use by [email protected] only.

• Choosing the Learning rate ( )

• Too small, we will need too many iterations for

• Too large, we may skip the optimal solution

• Can also be chosen by Trial & Error – Use a range of

This file is meant for personal use by [email protected] only.

• When dataset is too large, passing all the data through a

• Instead we would create data batches with smaller batch size.

• After each batch is passed and weights updated, we will count

• Too few epochs: under fitting, Too many: overfitting

• Batch training: All of the training samples pass through the

• Sequential training: Weights are updated after each training

• The non-linearities in the activation function and the

• Scaling can accelerate learning and improve

This file is meant for personal use by [email protected] only.

• Neural Network Models are susceptible to overfitting

• Large number of weights and biases

• Excessive learning (Epocs) on training data

• Increase sample size

• Reduce Network Size

This file is meant for personal use by [email protected] only.

• Regularization is a technique used to avoid this overfitting

• The idea behind regularization is that models that overfit the

• Optimal value of λ, the decay rate or penalty coefficient, is

• Hyper parameters are the variables which determines the

• Number of hidden layers

• Number of neurons in hidden layers

• The Activation Function

• Tuning these to ensure that you don’t overfit is an art.

This file is meant for personal use by [email protected] only.

You might also like