0% found this document useful (0 votes)

22 views39 pages

Chapter 11 Neural Nets

Uploaded by

Mery

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views39 pages

Chapter 11 Neural Nets

Uploaded by

Mery

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 39

Chapter 11 – Neural Nets

Machine Learning for

Business Analytics in
RapidMiner
Shmueli, Bruce, Deokar & Patel
The contributions of Sambit Tripathi are gratefully
acknowledged
© Galit Shmueli, Peter Bruce and Amit Deokar 2023
Basic Idea
●Combine input information in a complex &
flexible neural net “model”

●Model “coefficients” are continually

tweaked in an iterative process

Obs. Fat Score Salt Score Opinion

1 0.2 0.9 like
2 0.1 0.1 dislike
3 0.2 0.4 dislike
1
4 0.2 0.5 dislike
5 0.4 0.5 like
6 0.3 0.8 like
Example – Using fat & salt content to
predict consumer acceptance of
cheese

Rectangles are nodes, wij on arrows are weights, and ϴj are node bias values
Moving Through the Network
The Input Layer

●For input layer, input = output

●E.g., for record #1:
Fat input = output = 0.2
Salt input = output = 0.9

●Output of input layer = input into hidden

layer
The Hidden Layer
●In this example, it has 3 nodes
●Each node receives as input the
output of all input nodes
●Output of each hidden node is some
function of the weighted sum of
inputs
The Weights
●The weights θ (theta) and w are typically
initialized to random values in the range -
0.05 to +0.05

●Equivalent to a model with random

prediction (in other words, no predictive
value)

●These initial weights are used in the first

round of training
Output of Node 3 if g is a Logistic
Function
Initial Pass of the Network
Node outputs (on right within node) using first record in tiny example, and
logistic function

Calculations at hidden node

3:
Output Layer
The output of the last hidden layer becomes input for the
output layer, which has one node per class.
Mapping the output to a
classification

Output for “like” = 0.506, just slightly greater

than that for “dislike,” so classification, at
this early stage, is “like”
Relation to Linear Regression
A net with a single output node and no
hidden layers, where g is the identity
function, takes the same form as a linear
regression model
Training the Model

© Galit Shmuel and Peter Bruce 2017

Preprocessing Steps

●Scale variables to 0-1

●Categorical variables
● If equidistant categories, map to
equidistant interval points in 0-1 range
● Otherwise, create dummy variables
●Transform (e.g., log) skewed variables
Initial Pass Through Network
●Goal: Find weights that yield best
predictions
●The process described above is repeated
for all records
●At each record compare prediction to actual
●Difference is the error for the output node
●Error is propagated back and distributed to
all the hidden nodes and used to update
their weights
Back Propagation (“back-
prop”)

● Output from output node k:

● Error associated with that node:

Note: this is like ordinary error, multiplied by

a correction factor
Error is Used to Update
Weights

l = constant between 0 and 1, reflects the

“learning rate” or “weight decay parameter”
Why It Works
●Big errors lead to big changes in weights
●Small errors leave weights relatively
unchanged
●Over thousands of updates, a given weight
keeps changing until the error associated
with that weight is negligible, at which
point weights change little
RapidMine
r Process
Tiny Example - Final
Weights
Tiny Example - Final Propensities
and Classifications

And Confusion Matrix

Common Criteria to Stop the
Updating
●When weights change very little from one
iteration to the next

●When the misclassification rate reaches a

required threshold

●When a limit on runs is reached

Avoiding Overfitting
With sufficient iterations, neural net can
easily overfit the data

To avoid overfitting:

● Track error in validation data or via cross-

validation
● Limit iterations
● Limit complexity of network
User Inputs
Specify Network Architecture

Number of hidden layers

Most popular – one hidden
layer
Size (number of nodes)
in hidden layer(s)
More nodes capture
complexity, but increase
chances of overfit)
Learning Rate
Low values “downweight”
the new information from
errors at each iteration
This slows learning, but
reduces tendency to overfit
to local structure
Momentum
Helps avoid getting stuck in
local max or min
Advantages

● Good predictive ability

● Can capture complex relationships
● No need to specify a model
● Complex networks are good with large
numbers of “low level” features, like pixel
values in an image, or words in a text (see
“deep learning”)
Disadvantages
●Considered a “black box” prediction
machine, with no insight into relationships
between predictors and outcome
●No variable-selection mechanism, so you
have to exercise care in selecting variables
●Heavy computational requirements if there
are many variables (additional variables
dramatically increase the number of
weights to calculate)
Deep Learning
● The statistical and machine learning models
in this book - including standard neural nets
- work where you have informative
predictors (purchase information, bank
account information, # of rooms in a house,
etc.)

● In rapidly-growing applications of voice and

image recognition, you have high numbers
of “low-level” granular predictors - pixel
values, wave amplitudes, uninformative at
this low level
Deep Learning
The most active application area for neural nets
RapidMiner extensions Image Handling and Deep Learning

• In image recognition, pixel values are predictors, and there might be

100,000+ predictors – big data! (voice recognition similar)
• Deep neural nets with many layers (“neural nets on steroids”) have
facilitated revolutionary breakthroughs in image/voice recognition, and in
artificial intelligence (AI)
• Key is the ability to self-learn features (“unsupervised”)
• For example, clustering could separate the pixels in this 1” by 1” football
field image into the “green field” and “yard marker” areas without
knowing that those concepts exist
• From there, the concept of a boundary, or “edge” emerges
• Successive stages move from identification of local, simple features to
more global & complex features
Convolutional Neural Net
example in image recognition
● A popular deep learning implementation is a convolutional
neural net (CNN)
● Need to aggregate predictors (pixels)
● Rather than have weights for each pixel, group pixels together
and apply the same operation: “convolution”
● Common aggregation is a 3 x 3 pixel area, for example the
small area around this man’s lower chin

Enlargement Pixel values

of area (higher number =
darker)
Apply the convolution

Convolution operation is “multiply the pixel matrix by

the filter matrix” then sum

025 + 1200 + 0*25 +

x 0*25 + 1*225 + 0*25 +
0*25 + 1*225 + 0*25
= 650

Filter matrix that is Sum = 650; this is higher

good at identifying Pixel values than for any other
center vertical arrangement of the filter
lines (we will see matrix, because pixel values
why shortly) are highest in central column
Continue the
Convolution
● The filter matrix moves across the image,
storing its result, yielding a smaller matrix
whose values indicate the presence or
absence of a vertical line.
● Similar filters can detect horizontal lines,
curves, borders - hyper-local features
● Further convolutions can be applied to
these local features
● Result: multi-dimensional matrix, or
tensor, of higher-level features
The Learning Process
How does the net learn which convolutions to do?
● In supervised learning, the net retains those
convolutions and features which are successful
in labeling (tagging) images
● Note that the feature-learning process yields a
reduced (simpler) set of features than the
original set of pixel values

training data has

known labels
Unsupervised Learning
Autoencoding
● Deep learning nets can learn higher level
features even when there are no labels to guide
the process
● The net adds a process to take the high level
features and generate an image
● The generated image is compared to the
original image and the net retains the
architecture that produces the best matches
Deep learning networks have many
settings
Summary
●Neural nets can capture flexible/complicated
relationships between outcome and predictors
●The network “learns” and updates its model
iteratively as more data are fed into it
●Major danger: overfitting
●Requires large amounts of data
●Good predictive performance, yet it’s a “black
box”
●Deep learning (very complex neural nets) is
effective in learning higher level features from a
multitude of lower level ones
●Deep learning is the key to image recognition
and many AI applications

A Little Book of Deep Learning - Francois Fleuret
No ratings yet
A Little Book of Deep Learning - Francois Fleuret
149 pages
ML06 Neural-Network 2024-2025
No ratings yet
ML06 Neural-Network 2024-2025
78 pages
The Little Book of Deep Learning
100% (1)
The Little Book of Deep Learning
140 pages
HCIA-AI V3.0 Training Material
100% (2)
HCIA-AI V3.0 Training Material
474 pages
CNN Stanford2015
No ratings yet
CNN Stanford2015
129 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
123 pages
Chapter 11 Neural Nets (Python)
No ratings yet
Chapter 11 Neural Nets (Python)
43 pages
Ch10 Deep Learning
No ratings yet
Ch10 Deep Learning
104 pages
Deep Learning: A Visual Introduction
No ratings yet
Deep Learning: A Visual Introduction
53 pages
Neural Networks
No ratings yet
Neural Networks
68 pages
ImageNet Classification With Deep Convolutional Convolutional Neural Networks PDF
No ratings yet
ImageNet Classification With Deep Convolutional Convolutional Neural Networks PDF
37 pages
1 Deep Learning
No ratings yet
1 Deep Learning
70 pages
L7 Lecture Image - classification.DNN v4
No ratings yet
L7 Lecture Image - classification.DNN v4
61 pages
Lec14 CNNRNNModels
No ratings yet
Lec14 CNNRNNModels
64 pages
Neural Networks
No ratings yet
Neural Networks
45 pages
06 NeuralNetworks 2024
No ratings yet
06 NeuralNetworks 2024
82 pages
04introduction To Neural Networks
No ratings yet
04introduction To Neural Networks
62 pages
Introduction Deep Eng
No ratings yet
Introduction Deep Eng
50 pages
Deep Learning
100% (2)
Deep Learning
49 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
LBDL
No ratings yet
LBDL
143 pages
Unit 4
100% (1)
Unit 4
57 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
140 pages
DL Concepts 1 Overview
No ratings yet
DL Concepts 1 Overview
80 pages
Lecture W15ab
No ratings yet
Lecture W15ab
44 pages
Midterm Study Guide Csci566
No ratings yet
Midterm Study Guide Csci566
20 pages
3 - DeepLearning - and - CNN v3
No ratings yet
3 - DeepLearning - and - CNN v3
50 pages
The - Little - Book - of - Deep Learning
No ratings yet
The - Little - Book - of - Deep Learning
140 pages
Antim Prahar AI and ML For Business 2025
No ratings yet
Antim Prahar AI and ML For Business 2025
45 pages
The Little Book of Deep Learning
No ratings yet
The Little Book of Deep Learning
143 pages
Lecture 1
No ratings yet
Lecture 1
38 pages
Lbdlu
No ratings yet
Lbdlu
168 pages
Deep Learning Model
No ratings yet
Deep Learning Model
144 pages
Deep Learning PDF
No ratings yet
Deep Learning PDF
55 pages
Lecture 4
No ratings yet
Lecture 4
45 pages
Business Data Mining Week 12
No ratings yet
Business Data Mining Week 12
24 pages
Chap11 Neural Nets
No ratings yet
Chap11 Neural Nets
38 pages
Module1 ECO-598 AI & ML Aug 21
No ratings yet
Module1 ECO-598 AI & ML Aug 21
45 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
Module 2
No ratings yet
Module 2
73 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
Unit Iv DM
No ratings yet
Unit Iv DM
58 pages
Super VIP Cheetsheet - Deep Learning, AI, ML
No ratings yet
Super VIP Cheetsheet - Deep Learning, AI, ML
47 pages
F11 Handout
No ratings yet
F11 Handout
5 pages
Unit 2.1
No ratings yet
Unit 2.1
37 pages
5 1 ArtificialNeuralNetworks 4up
No ratings yet
5 1 ArtificialNeuralNetworks 4up
12 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
The Deep Learning Revolution: Introductory Overview Lecture
No ratings yet
The Deep Learning Revolution: Introductory Overview Lecture
35 pages
Unit 03 - Neural Networks - MD
No ratings yet
Unit 03 - Neural Networks - MD
24 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
Complete UNIT III DEEP LEARNING
No ratings yet
Complete UNIT III DEEP LEARNING
126 pages
Unit 1
No ratings yet
Unit 1
16 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
22 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
Deep Learning Techniques and Application
No ratings yet
Deep Learning Techniques and Application
20 pages
AI Lab 1
No ratings yet
AI Lab 1
11 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
Practical File: Deep Learning
No ratings yet
Practical File: Deep Learning
33 pages
Types of Neural Networks
No ratings yet
Types of Neural Networks
7 pages
Samuel Muigai - Final Thesis 2024
No ratings yet
Samuel Muigai - Final Thesis 2024
158 pages
Unit II
No ratings yet
Unit II
35 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
Age Face
No ratings yet
Age Face
41 pages
Hand Gesture Recognition Report-Updated
No ratings yet
Hand Gesture Recognition Report-Updated
62 pages
Sensors 23 07850
No ratings yet
Sensors 23 07850
20 pages
Capstone Report - Docx 2
No ratings yet
Capstone Report - Docx 2
38 pages
ANPD Project Documentation
No ratings yet
ANPD Project Documentation
49 pages
Pneumonia Detection Using Deep Learning
No ratings yet
Pneumonia Detection Using Deep Learning
5 pages
Continuous Human Action Recognition For Human Machine Interaction A Review
No ratings yet
Continuous Human Action Recognition For Human Machine Interaction A Review
31 pages
Saumya Rai Mini Project 2024
No ratings yet
Saumya Rai Mini Project 2024
35 pages
Wu 2018
No ratings yet
Wu 2018
5 pages
Transfer Learning CNN
No ratings yet
Transfer Learning CNN
21 pages
Pseudo Trained YOLO R - CNN Model For Weapon Detection With A Real-Time Kaggle Dataset
No ratings yet
Pseudo Trained YOLO R - CNN Model For Weapon Detection With A Real-Time Kaggle Dataset
15 pages
Semantic Segmentation of Remote Sensing Images Usi
No ratings yet
Semantic Segmentation of Remote Sensing Images Usi
12 pages
A Comprehensive Survey of The R-CNN Family For Object Detection
No ratings yet
A Comprehensive Survey of The R-CNN Family For Object Detection
6 pages
REAL TIME STRESS DETECTION AND ANALYSIS USING FACIAL EMOTION RECOGNITION Ijariie23620
No ratings yet
REAL TIME STRESS DETECTION AND ANALYSIS USING FACIAL EMOTION RECOGNITION Ijariie23620
16 pages
ED6001 Project Report
No ratings yet
ED6001 Project Report
9 pages
Structural Crack Detection From Benchmark Data Sets Using Pruned Fully Convolutional Networks
No ratings yet
Structural Crack Detection From Benchmark Data Sets Using Pruned Fully Convolutional Networks
14 pages
Btech Major Project Paper
No ratings yet
Btech Major Project Paper
3 pages
Faculty Profile Rajan
No ratings yet
Faculty Profile Rajan
12 pages
A Fast Indoor Positioning Using A Knowledge-Distilled Convolutional Neural Network KD-CNN
No ratings yet
A Fast Indoor Positioning Using A Knowledge-Distilled Convolutional Neural Network KD-CNN
13 pages
Wireless Technology Identification Employing Dynamic Mode Decomposition Modeling 3
No ratings yet
Wireless Technology Identification Employing Dynamic Mode Decomposition Modeling 3
15 pages
Automated Phenotyping of Herbaceous Biomass Using U-Net Architecture For - CT Images Segmentation
No ratings yet
Automated Phenotyping of Herbaceous Biomass Using U-Net Architecture For - CT Images Segmentation
6 pages
Harnessing Artificial Intelligence For HIV Drug Resistance Prediction and Personalized Treatment (WWW - Kiu.ac - Ug)
No ratings yet
Harnessing Artificial Intelligence For HIV Drug Resistance Prediction and Personalized Treatment (WWW - Kiu.ac - Ug)
6 pages
Comparison of Tissue Segmentation Performance Between 2D U-Net and 3D U-Net On Brain MR Images
No ratings yet
Comparison of Tissue Segmentation Performance Between 2D U-Net and 3D U-Net On Brain MR Images
4 pages