0% found this document useful (0 votes)

2 views44 pages

AIMLB-PGP-2025-Session-13-14

Uploaded by

abhay kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views44 pages

AIMLB-PGP-2025-Session-13-14

Uploaded by

abhay kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

Artificial Intelligence and

Machine Learning for

Business (AIMLB)
Mukul Gupta
(Information Systems Area)
Biological Neurons
• The neural system of the human body consists of
three stages:
• Receptors,
• The receptors receive the stimuli either internally or from
the external world, then pass the information into the
neurons in a form of electrical impulses.
• A neural network,
• The neural network then processes the inputs then makes
proper decision of outputs.
• Effectors
• Finally, the effectors translate electrical impulses from the
neural network into responses to the outside environment.
2
Biological Neurons

3
Biological Neurons

• The brain works like a big computer.

• It processes information that it receives from the senses
and body and sends messages back to the body.
• But the brain can do much more than a machine
can:
• humans think and experience emotions with their brain,
and it is the root of human intelligence.

4
Biological Neurons

5
Biological Neurons
• A neuron mainly consists of three parts:
• Dendrites
• Dendrites are the tree-like structure that receives the
signal from surrounding neurons, where each line is
connected to one neuron.
• Soma
• Cell body, contains the nucleus.
• Axon
• Axon is a thin cylinder that transmits the signal from one
neuron to others.
• At the end of axon, the contact to the dendrites is made
through a synapse.
6
Biological Neurons: Working

7
Biological neurons

https://www.kdnuggets.com/2019/10/introduction-artificial-neural-networks.html 8
Biological neurons - Firing

Captured using two-

photon calcium imaging,
this video depicts
neurons firing in the
brain of a mouse in
response to stimulation
of its whiskers.
Neurons firing in the
brain of a mouse | UCLA
Health Newsroom

https://www.youtube.com/watch?v=4GleKfxW288 9
Perceptron
• Frank Rosenblatt, an American psychologist,
proposed the classical perceptron model in 1958.

10
Artificial Neuron - Perceptron: History

• Frank Rosenblatt created the first perceptron,

simulating first on an IBM® 704 computer, and then
later implementing the perceptron as custom
hardware (called the Mark 1 Perceptron), with an
array of 400 photocells for vision applications.
• The photocells were randomly connected to
neurons, and the weights were implemented as
potentiometers (variable resistors) that could be
adjusted by attached motors as part of the learning
process.

11
Perceptron
• Mark I Perceptron machine, the first implementation
of the perceptron algorithm.

12
Artificial Neuron: Perceptron

13
Artificial Neuron: Perceptron

14
Neuron as a linear classifier

15
Perceptron: Final Look

16
Boolean Functions Using Perceptron
• OR Function — Can Do!

17
Boolean Functions Using Perceptron
• XOR Function — Cannot Do!
• there are no perceptron solutions for non-linearly
separated data. So, the key take away is that a single
perceptron cannot learn to separate the data that are non-
linear in nature.

18
BNN vs ANN

19
ANN: Perceptron [video]

20
A non-linear classifier?

21
Non-linearity = activation function
• A smooth (differentiable) nonlinear function that is
applied after the inner product with the weights

22
Activation Function
• The purpose of the activation function is to introduce
non-linearity into the output of a neuron.
• This is important because most real-world data is
nonlinear, and we want neurons to learn these
nonlinear representations.
• Every activation function (or non-linearity) takes a
single number and performs a certain fixed
mathematical operation on it.

23
Activation Function

24
Activation Function: Softmax
• Logistic function that maps a 𝐾 dimensional vector
(e.g., a set of 𝐾 data inputs) of real values to values
in the range of 0~1 such that all values of the vector
add up to 1
• ML (Machine Learning) NNs (Neural Networks)
often use the Softmax function to enhance the
accuracy of the classification process.

25
Activation Function: Softmax
• Example:

26
Activation Function: Softmax
• Example:

27
Activation Function: Softmax
• Advantages:
• Softmax is optimal for maximum-likelihood estimation of
the model parameters.
• The properties of softmax (all output values in the range
(0, 1) and sum up to 1.0) make it suitable for a probabilistic
interpretation that’s very useful in machine learning.
• Softmax normalization is a way of reducing the influence
of extreme values or outliers in the data without removing
data points from the set.

28
Single Layer Perceptron: Feedforward

• Because SLP is a linear classifier and if the cases

are not linearly separable the learning process will
never reach a point where all the cases are classified
properly.

29
Multi-Class: Non-Linearly Separable

Solution: MLP

30
Multi Layer Perceptron
• Neurons are arranged into networks of neurons.
• A row of neurons is called a layer and one network
can have multiple layers. The architecture of the
neurons in the network is often called the network
topology.

31
Multi Layer Perceptron
• Input Layer
• The bottom (first) layer that takes input from your dataset is
called the visible (or input) layer, because it is the exposed part
of the network.
• These are not neurons, but simply pass the input value to the
next layer.
• Hidden Layer(s)
• Layers after the input layer are called hidden layers because
that are not directly exposed to the input.
• Deep learning can refer to having many hidden layers in your
neural network.
• Output Layer
• The final layer is called the output layer and it is responsible for
outputting a value or vector of values that correspond to the
format required for the problem.

32
Output activation
● Usually, a non-linear activation after each layer
● Typically, ReLU between the layers
● At the output layer we need to consider the task, i.e.,
what kinds of outputs we want, e.g.,
○ Multi-label classification
each K output separate probability (values 0.0-1.0)
→ sigmoid
○ Multi-class classification
probability distribution over K classes (sums to 1.0)
→ softmax
○ Regression softmax:
free range of values → linear
33
NN: A toy example
• A simple network, toy example

1 1 + −1 −2 + 1 = 4

0.98 Sigmoid Function

1 4
1
-2  z  
1
1 1  e z
 z 
-1 -2 0.12
-1
1 z
0

1 −1 + −1 1 + 0 =-2

34
NN: A toy example
• A simple network, toy example (cont’d)
 For an input vector [1 −1] , the output is [0.62 0.83]

1 4 0.98 2 0.86 3 0.62

1
-1 -1
-2
1 0 -
2
-1 -2 0.12 -2 0.11 -1 0.83
-1
1 -1 4
0 0 2

1 0.62
𝑓: 𝑅 → 𝑅 𝑓 =
−1 0.83
35
NN: Matrix Operations
• Matrix operations are helpful when working with multidimensional inputs and
outputs

1 4 0.98
1 𝜎 W x + b = a
-2
1
0.12 1 −2 1 1 0.98
-1 -2 𝜎 + = 0.12
-1 −1 1 −1 0
1
0 4
−2

36
Deep Learning
• DL applies a multi-layer process for learning rich hierarchical features (i.e., data
representations)
 Input image pixels → Edges → Textures → Parts → Objects

Low-Level Mid- High-Level Trainable

Output
Features Level Features Classifier
Features

37
Deep NN
• Deep NNs have many hidden layers
 Fully-connected (dense) layers (a.k.a. Multi-Layer Perceptron or MLP)
 Each neuron is connected to all neurons in the succeeding layer

Input Layer 1 Layer 2 Layer L Output

x1 …… y1
x2 …… y2

……
……

……

……
xN …… yM

Input Layer Output Layer

Hidden Layers
38
What is a model? Recap

• A model is a function specified by a set of parameters 𝜃

𝑓 (𝑥) 0.99

• Example: linear predictor

𝑓 𝑥 = 𝑤𝑇 ⋅ 𝑥 + 𝑏 (𝜃 = 𝑤, 𝑏 )
parameters

39
Training NNs
• The network parameters 𝜃 include the weight matrices and bias vectors from all
layers
𝜃 = 𝑊 ,𝑏 ,𝑊 ,𝑏 ,⋯𝑊 ,𝑏
 Often, the model parameters 𝜃 are referred to as weights

• Training a model to learn a set of parameters 𝜃 that are optimal (according to a

criterion) is one of the greatest challenges in ML

x1 0.1 is 1

Softmax
…… y2
0.7 is 2
……

……

……
x256 …… y10
0.2 is 0
16 x 16 = 256
40
Training NNs

• Define a loss function/objective function/cost function ℒ 𝜃 that

calculates the difference (error) between the model prediction and the
true label
 E.g., ℒ 𝜃 can be mean-squared error, cross-entropy, etc.

x1 …… y1 0.2 1
x2 …… y2 0.3 0
Cost
……

……
……

……
……
……
x256 …… y10 0.5 ℒ(𝜃) 0
True label “1”

41
Loss Functions
• Neural networks are trained using an optimizer and
we are required to choose a loss function while
configuring our model.
• While optimization, we use a function to evaluate
the weights and try to minimize the error.
• This objective function is our loss function and the
evaluation score calculated by this loss function is
called loss.
• In simple words, losses refer to the quality that is
computed by the model and try to minimize during
model training.
42
Loss Functions: Examples
• Regression Loss Functions
• Mean Squared Error Loss
• Mean Squared Logarithmic Error Loss
• Mean Absolute Error Loss
• Binary Classification Loss Functions
• Binary Cross-Entropy
• Hinge Loss
• Squared Hinge Loss
• Multi-Class Classification Loss Functions
• Multi-Class Cross-Entropy Loss
• Sparse Multiclass Cross-Entropy Loss
• Kullback Leibler (KL) Divergence Loss
43
Thank You

This Document Is About Artificial Inteligence.
No ratings yet
This Document Is About Artificial Inteligence.
81 pages
Machine Learning
No ratings yet
Machine Learning
77 pages
Unit-3 ML
No ratings yet
Unit-3 ML
21 pages
Unit 4 Neural Networks
No ratings yet
Unit 4 Neural Networks
76 pages
Unit 5
No ratings yet
Unit 5
102 pages
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
No ratings yet
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
40 pages
2021 Pho1 15 Neural Networks Part1
No ratings yet
2021 Pho1 15 Neural Networks Part1
77 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Lecture Slides-Week13,14
No ratings yet
Lecture Slides-Week13,14
62 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
Neural Networks
No ratings yet
Neural Networks
33 pages
Unit 3
No ratings yet
Unit 3
8 pages
Neural Network
No ratings yet
Neural Network
7 pages
Unit 1
No ratings yet
Unit 1
16 pages
AI(3) (1)
No ratings yet
AI(3) (1)
16 pages
4.0 The Complete Guide To Artificial Neural Networks
No ratings yet
4.0 The Complete Guide To Artificial Neural Networks
23 pages
Unit 1 Fundamentals of Deep Learning
No ratings yet
Unit 1 Fundamentals of Deep Learning
20 pages
Neural Network - Overview
No ratings yet
Neural Network - Overview
37 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
Notes ML 02 Slides RNN ANN
No ratings yet
Notes ML 02 Slides RNN ANN
105 pages
DL_UNIT-1_SAN
No ratings yet
DL_UNIT-1_SAN
58 pages
Lecture 10 Neural Network
No ratings yet
Lecture 10 Neural Network
34 pages
Unit 5
No ratings yet
Unit 5
59 pages
Neural Networks
No ratings yet
Neural Networks
19 pages
2.Deep Feed Forward Networks
No ratings yet
2.Deep Feed Forward Networks
26 pages
UNit 6 Machine Learning
No ratings yet
UNit 6 Machine Learning
23 pages
2 DeepLearning
No ratings yet
2 DeepLearning
46 pages
Chapter 6 AI
No ratings yet
Chapter 6 AI
52 pages
El Assignment
No ratings yet
El Assignment
10 pages
ML-Lec10-Artificial Neural Networks
No ratings yet
ML-Lec10-Artificial Neural Networks
76 pages
Unit I Introduction to ANN
No ratings yet
Unit I Introduction to ANN
8 pages
Lesson 7.0 Supervised Learning With Neural Networks
No ratings yet
Lesson 7.0 Supervised Learning With Neural Networks
22 pages
Unit-1 Deep Learning (1)
No ratings yet
Unit-1 Deep Learning (1)
71 pages
Module - 2
No ratings yet
Module - 2
33 pages
Chapter-4 Fundamental of Neural Network
No ratings yet
Chapter-4 Fundamental of Neural Network
26 pages
Unit I
No ratings yet
Unit I
90 pages
ML 4 PPT Unit Iv
No ratings yet
ML 4 PPT Unit Iv
71 pages
20MEMECH Part 6 - NN Vol - 1
No ratings yet
20MEMECH Part 6 - NN Vol - 1
34 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
82 pages
Neural NetworksChapter2Sup
No ratings yet
Neural NetworksChapter2Sup
20 pages
Lecture NN 2005
No ratings yet
Lecture NN 2005
137 pages
Ai Unit-3
No ratings yet
Ai Unit-3
69 pages
NNDL
No ratings yet
NNDL
96 pages
Unit V
No ratings yet
Unit V
49 pages
Unit 3 - IntrotoNN
No ratings yet
Unit 3 - IntrotoNN
4 pages
Notes Chapter Neural Networks
No ratings yet
Notes Chapter Neural Networks
18 pages
Part7.2 Artificial Neural Networks
No ratings yet
Part7.2 Artificial Neural Networks
51 pages
Machine Learning
No ratings yet
Machine Learning
13 pages
Artificial Neural Networks (Anns) : Intro
No ratings yet
Artificial Neural Networks (Anns) : Intro
15 pages
Unit III
No ratings yet
Unit III
29 pages
Lecture 7 - Neural Networks
No ratings yet
Lecture 7 - Neural Networks
48 pages
Lecture 5-Introduction To Neural Network
No ratings yet
Lecture 5-Introduction To Neural Network
42 pages
Unit-4 TNM
No ratings yet
Unit-4 TNM
25 pages
Unit-Ii MLT1
No ratings yet
Unit-Ii MLT1
45 pages
UNIT 1 Neural Networks & DL
No ratings yet
UNIT 1 Neural Networks & DL
123 pages
Deep Learning
No ratings yet
Deep Learning
11 pages
Unit - 4
No ratings yet
Unit - 4
17 pages
Deep Learning Unit1
No ratings yet
Deep Learning Unit1
25 pages
Wk. 12. Artificial Neural Networks (12!05!2021)
No ratings yet
Wk. 12. Artificial Neural Networks (12!05!2021)
48 pages
TOPAZ B. Ing2
100% (3)
TOPAZ B. Ing2
6 pages
Cifras Internacionais
No ratings yet
Cifras Internacionais
17 pages
Enb 3
No ratings yet
Enb 3
10 pages
Isotope Geochemistry
No ratings yet
Isotope Geochemistry
44 pages
MTH 601 MCQ
0% (1)
MTH 601 MCQ
4 pages
Book Summary
No ratings yet
Book Summary
41 pages
8020 Blocked From Use: Tuesday
No ratings yet
8020 Blocked From Use: Tuesday
95 pages
Weight-For-Age BOYS: 6 Months To 2 Years (Percentiles)
No ratings yet
Weight-For-Age BOYS: 6 Months To 2 Years (Percentiles)
1 page
XPS FOAM - SquareEdge
No ratings yet
XPS FOAM - SquareEdge
4 pages
Lampiran SPTJM PSP
No ratings yet
Lampiran SPTJM PSP
7 pages
Xenon E-180 Service Manual
No ratings yet
Xenon E-180 Service Manual
17 pages
City School Itep Test
100% (4)
City School Itep Test
4 pages
ALAC14
No ratings yet
ALAC14
6 pages
CSPP Geo GRB Installation Guide v0.3
No ratings yet
CSPP Geo GRB Installation Guide v0.3
13 pages
Neha Dahiya - Content Submission (Patenting Life Forms and Gmo - Scope and Challenges For Intellectual Prope (4053)
No ratings yet
Neha Dahiya - Content Submission (Patenting Life Forms and Gmo - Scope and Challenges For Intellectual Prope (4053)
5 pages
Year 3 Science Exam Paper
No ratings yet
Year 3 Science Exam Paper
2 pages
Summer 2122 Aubf Lab Periodical Test 2
No ratings yet
Summer 2122 Aubf Lab Periodical Test 2
38 pages
Dialogo Ingles Semana 8
No ratings yet
Dialogo Ingles Semana 8
3 pages
10.2305 IUCN - UK.1998.RLTS.T33255A9771604.en
No ratings yet
10.2305 IUCN - UK.1998.RLTS.T33255A9771604.en
5 pages
Fan Tool Kit - Ad Hoc Group - V4dd
No ratings yet
Fan Tool Kit - Ad Hoc Group - V4dd
121 pages
Pipe Support Span Chart
No ratings yet
Pipe Support Span Chart
1 page
RFID Labels For Labs Flyer
No ratings yet
RFID Labels For Labs Flyer
6 pages
How Glass Is Recycled
100% (1)
How Glass Is Recycled
2 pages
Anatomy Main MCQ
No ratings yet
Anatomy Main MCQ
11 pages
Rwanda Eia Guidelines Road Construction
No ratings yet
Rwanda Eia Guidelines Road Construction
54 pages
Welding Metallurgy of Carbon Steel PDF
100% (1)
Welding Metallurgy of Carbon Steel PDF
17 pages
MAT301 Lecture Notes 2018version
No ratings yet
MAT301 Lecture Notes 2018version
99 pages
MSDS KLINGERSIL C4430 e
No ratings yet
MSDS KLINGERSIL C4430 e
6 pages
Bead Weaving
No ratings yet
Bead Weaving
17 pages
Navsure N400i
No ratings yet
Navsure N400i
76 pages

AIMLB-PGP-2025-Session-13-14

Uploaded by

AIMLB-PGP-2025-Session-13-14

Uploaded by

Artificial Intelligence and

Machine Learning for

• The brain works like a big computer.

Captured using two-

• Frank Rosenblatt created the first perceptron,

• Because SLP is a linear classifier and if the cases

0.98 Sigmoid Function

1 4 0.98 2 0.86 3 0.62

Low-Level Mid- High-Level Trainable

Input Layer 1 Layer 2 Layer L Output

Input Layer Output Layer

• A model is a function specified by a set of parameters 𝜃

• Example: linear predictor

• Training a model to learn a set of parameters 𝜃 that are optimal (according to a

• Define a loss function/objective function/cost function ℒ 𝜃 that

You might also like