0% found this document useful (0 votes)

24 views5 pages

Unit IV Ensemble Unsupervised Learning

The document covers Ensemble Learning and Unsupervised Learning, detailing techniques to improve model performance and analyze unlabeled data. Ensemble Learning combines multiple models through methods like Bagging, Boosting, and Stacking, while Unsupervised Learning includes clustering and dimensionality reduction techniques. Key concepts such as Random Forest, AdaBoost, K-Means, and PCA are discussed, highlighting their advantages and disadvantages.

Uploaded by

Boomika G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views5 pages

Unit IV Ensemble Unsupervised Learning

Uploaded by

Boomika G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Unit IV: Ensemble Learning & Unsupervised Learning – Study Material

Ensemble Learning

Ensemble Learning is a technique where multiple models are combined to improve

overall performance. It reduces errors, increases accuracy, and handles data variability
better than individual models.

### Key Features:

1. Combines multiple weak learners to create a strong learner.
2. Improves generalization and reduces overfitting.
3. Works well for both classification and regression tasks.

### Types of Ensemble Learning:

- **Bagging**: Reduces variance by training multiple models on random subsets (e.g.,
Random Forest).
- **Boosting**: Reduces bias by training models sequentially, giving more weight to
misclassified instances (e.g., AdaBoost, Gradient Boosting).
- **Stacking**: Combines multiple models using a meta-learner for final predictions.

Model Combination Schemes

Different strategies exist for combining multiple models in ensemble learning.

1. **Voting**: In classification, multiple models vote, and the majority class is selected.
2. **Error-Correcting Output Codes (ECOC)**: Decomposes multi-class problems into
multiple binary classifications.
3. **Bagging (Bootstrap Aggregating)**: Trains models independently on different
subsets of data and averages results.
4. **Boosting**: Models are trained sequentially, correcting errors from previous
models.
5. **Stacking**: Outputs from base learners are combined using another model (meta-
learner) for final predictions.

Bagging: Random Forest

Bagging is a technique that improves stability and accuracy by reducing overfitting.

### **Random Forest**:
- Uses multiple Decision Trees trained on different subsets of data.
- Predictions are averaged (regression) or majority-voted (classification).
- Handles missing values and large datasets well.

### **Advantages**:
- Reduces overfitting.
- Works well with high-dimensional data.
- Can be used for feature importance ranking.

### **Disadvantages**:
- Requires more computational power.
- Loses interpretability compared to individual Decision Trees.

Boosting: AdaBoost

Boosting combines weak models sequentially, giving more weight to misclassified

instances.

### AdaBoost (Adaptive Boosting):

- Assigns weights to each sample and updates them iteratively.
- Focuses on misclassified samples to improve predictions.
- Uses weak classifiers like Decision Stumps.

### **Advantages**:
- Reduces bias, improving weak classifiers.
- More accurate than bagging for complex datasets.

### **Disadvantages**:
- Sensitive to noise in the dataset.
- Slower training due to sequential model building.

Unsupervised Learning

Unsupervised Learning finds patterns in unlabeled data. Unlike supervised learning,

it does not rely on predefined outputs.

### Key Features:

1. Works with **unlabeled** data.
2. Groups similar data points or reduces dimensionality.
3. Used in anomaly detection, recommendation systems, and exploratory data analysis.

### Main Types:

- **Clustering**: Groups similar data points.
- **Dimensionality Reduction**: Reduces dataset complexity while preserving essential
information (e.g., PCA, LLE, Factor Analysis).

Clustering: Introduction

Clustering is an unsupervised learning technique that groups similar data points

based on some similarity measure.

### Types of Clustering:

1. **Hierarchical Clustering**: Builds a hierarchy of clusters (e.g., AGNES, DIANA).
2. **Partitional Clustering**: Divides data into distinct clusters (e.g., K-Means, K-
Mode).
3. **Density-Based Clustering**: Identifies clusters based on dense regions (e.g.,
DBSCAN, Mean-Shift).

Hierarchical Clustering: AGNES & DIANA

Hierarchical Clustering builds a nested structure of clusters.

### AGNES (Agglomerative Nesting):

- A **bottom-up** approach: Each data point starts as its own cluster and merges step by
step.
- Uses linkage methods (single, complete, average).

### DIANA (Divisive Analysis):

- A **top-down** approach: All data points start in one cluster and are split iteratively.

### **Advantages**:
- No need to predefine the number of clusters.
- Dendrograms provide visual insights.

### **Disadvantages**:
- Computationally expensive for large datasets.
- Sensitive to noise and outliers.

Partitional Clustering: K-Means & K-Mode

Partitional Clustering divides data into fixed K clusters.

### K-Means Clustering:

- Assigns data points to **K clusters** based on distance (usually Euclidean).
- Iteratively updates centroids to minimize variance.

### K-Mode Clustering:

- Used for categorical data instead of numerical values.
- Replaces means with **modes** (most frequent values).

### **Advantages**:
- Fast and scalable for large datasets.
- Works well when clusters are well-separated.

### **Disadvantages**:
- Sensitive to initial cluster centers.
- Does not handle outliers well.

Dimensionality Reduction: PCA & LLE

Dimensionality reduction techniques help reduce the number of features while preserving
important information.

### Principal Component Analysis (PCA):

- Finds new feature axes (principal components) that maximize variance.
- Used in image compression, face recognition.

### Locally Linear Embedding (LLE):

- A nonlinear technique preserving local relationships in data.
- Suitable for highly nonlinear structures.

### **Advantages**:
- Reduces noise and redundancy.
- Speeds up model training.
### **Disadvantages**:
- Can lose interpretability.
- Assumes linearity (for PCA).

Ensemble Learning
100% (1)
Ensemble Learning
7 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
Unit 4 Ensemble Techniques and Unsupervised Learning
100% (1)
Unit 4 Ensemble Techniques and Unsupervised Learning
25 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Ensemble, Voting, Bagging, Boosting
No ratings yet
Ensemble, Voting, Bagging, Boosting
15 pages
Ensemble Learning
No ratings yet
Ensemble Learning
26 pages
Machine Learning Unit3
No ratings yet
Machine Learning Unit3
26 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
Aiml Unit 4
No ratings yet
Aiml Unit 4
26 pages
Machine Learning
No ratings yet
Machine Learning
76 pages
Unit 4 Part 1
No ratings yet
Unit 4 Part 1
47 pages
AI & ML Unit 4 Notes
No ratings yet
AI & ML Unit 4 Notes
16 pages
Unit 4
No ratings yet
Unit 4
24 pages
Unit 4 ML
No ratings yet
Unit 4 ML
25 pages
Unit 4
No ratings yet
Unit 4
24 pages
Unit 4 - ML
No ratings yet
Unit 4 - ML
38 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
ML Unit 3 V2
No ratings yet
ML Unit 3 V2
47 pages
AIML Unit 4
No ratings yet
AIML Unit 4
26 pages
Module 3: Advanced ML Algorithms and Hardware Design Optimization
No ratings yet
Module 3: Advanced ML Algorithms and Hardware Design Optimization
38 pages
Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
10 pages
Machine Learning Lecture 2,3,4
No ratings yet
Machine Learning Lecture 2,3,4
26 pages
Unit IV Aiml
No ratings yet
Unit IV Aiml
32 pages
Ensemble Methods - Bagging, Boosting and Stacking - Towards Data Science PDF
No ratings yet
Ensemble Methods - Bagging, Boosting and Stacking - Towards Data Science PDF
37 pages
Ensemble Methods
No ratings yet
Ensemble Methods
27 pages
Ensemble Methods - Bagging, Boosting and Stacking - by Joseph Rocca - Towards Data Science
No ratings yet
Ensemble Methods - Bagging, Boosting and Stacking - by Joseph Rocca - Towards Data Science
20 pages
MLTAHER
No ratings yet
MLTAHER
14 pages
Machine Learning Theory Updated
No ratings yet
Machine Learning Theory Updated
8 pages
ML Unit 3-1
No ratings yet
ML Unit 3-1
14 pages
Unit 4
No ratings yet
Unit 4
17 pages
Unit Iv
No ratings yet
Unit Iv
14 pages
ML Ass
No ratings yet
ML Ass
21 pages
Unit 5 ML
No ratings yet
Unit 5 ML
14 pages
UNIT III Word File
No ratings yet
UNIT III Word File
13 pages
Aiml Unit 4
No ratings yet
Aiml Unit 4
17 pages
Ensemble Learning: Comprehensive Explanation: Base Models
No ratings yet
Ensemble Learning: Comprehensive Explanation: Base Models
20 pages
Ensemble Learning
No ratings yet
Ensemble Learning
16 pages
Ensemble Methods Advanced ML
No ratings yet
Ensemble Methods Advanced ML
14 pages
Ensemble-Based Techniques - XAI
No ratings yet
Ensemble-Based Techniques - XAI
13 pages
M4 - FDS
No ratings yet
M4 - FDS
15 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Construction of Nfa and Dfa From R
100% (2)
Construction of Nfa and Dfa From R
15 pages
Unit 3 Aml
No ratings yet
Unit 3 Aml
9 pages
Unit 4 Updated Notes
No ratings yet
Unit 4 Updated Notes
13 pages
Abhisheksur 121
No ratings yet
Abhisheksur 121
8 pages
Technical Report
No ratings yet
Technical Report
10 pages
Lecture 5
No ratings yet
Lecture 5
11 pages
33 - Assignment 7 - Implementation of Ensemble Techniques
No ratings yet
33 - Assignment 7 - Implementation of Ensemble Techniques
7 pages
Unit 4 ML
No ratings yet
Unit 4 ML
9 pages
Bagging
No ratings yet
Bagging
7 pages
Automata - Shubhanshu Singh
No ratings yet
Automata - Shubhanshu Singh
59 pages
Entropy (S) Log (P) : I 1c I I
No ratings yet
Entropy (S) Log (P) : I 1c I I
5 pages
Time To Explore (5) ML
No ratings yet
Time To Explore (5) ML
9 pages
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
No ratings yet
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
6 pages
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
No ratings yet
Machine Learning: Video 106: Gradient Boosting Explained - How Gradient Boosting Works?
6 pages
Slides - Ensemble
No ratings yet
Slides - Ensemble
6 pages
Daa Question Paper Winter 2024
No ratings yet
Daa Question Paper Winter 2024
8 pages
Ensemble Learning Techniques 12 Marks
No ratings yet
Ensemble Learning Techniques 12 Marks
3 pages
DC-PPT 5
No ratings yet
DC-PPT 5
44 pages
Ensemble Learning: Proprietary Content. ©great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
No ratings yet
Ensemble Learning: Proprietary Content. ©great Learning. All Rights Reserved. Unauthorized Use or Distribution Prohibited
6 pages
16.548 Notes 15:: Concatenated Codes, Turbo Codes and Iterative Processing
No ratings yet
16.548 Notes 15:: Concatenated Codes, Turbo Codes and Iterative Processing
84 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
4 pages
Fundamentals of Artificial Neural Networks
No ratings yet
Fundamentals of Artificial Neural Networks
27 pages
C Programming Full
No ratings yet
C Programming Full
93 pages
Full ml-2
No ratings yet
Full ml-2
1 page
Overview of Unsupervised Learning
No ratings yet
Overview of Unsupervised Learning
2 pages
13.1 Graph Theory1
No ratings yet
13.1 Graph Theory1
32 pages
Operator - Operand - and Arithmetic
No ratings yet
Operator - Operand - and Arithmetic
37 pages
Escape From The Ivory Tower Feb12
No ratings yet
Escape From The Ivory Tower Feb12
78 pages
Walmart Most Asked Question On LeetCode
No ratings yet
Walmart Most Asked Question On LeetCode
4 pages
Cs3491 - Aiml - Unit III - Linear Regression Models
No ratings yet
Cs3491 - Aiml - Unit III - Linear Regression Models
34 pages
Complete ID3 Decision Tree
No ratings yet
Complete ID3 Decision Tree
15 pages
Depth First Search
No ratings yet
Depth First Search
23 pages
LMI Methods in Optimal and Robust Control
No ratings yet
LMI Methods in Optimal and Robust Control
31 pages
Breadth First Search
No ratings yet
Breadth First Search
17 pages
Lecture 2 Countability
No ratings yet
Lecture 2 Countability
22 pages
Lab DSA
No ratings yet
Lab DSA
7 pages
Journal 1
No ratings yet
Journal 1
9 pages
DSA Assignment
No ratings yet
DSA Assignment
22 pages
An LLL Algorithm With Quadratic Complexity: Abstract. The Lenstra-Lenstra-Lov
No ratings yet
An LLL Algorithm With Quadratic Complexity: Abstract. The Lenstra-Lenstra-Lov
30 pages
CS8501 Theory of Computation
No ratings yet
CS8501 Theory of Computation
3 pages
ClarkeWright Saving Heuristics
No ratings yet
ClarkeWright Saving Heuristics
18 pages
AI Practical Exam
No ratings yet
AI Practical Exam
8 pages
ID3 Decision Tree
No ratings yet
ID3 Decision Tree
5 pages
6.2 Solution::, W, W) M (1, 2, 3, 5, 6) Can Be Implemented
No ratings yet
6.2 Solution::, W, W) M (1, 2, 3, 5, 6) Can Be Implemented
3 pages
Linear Programming
No ratings yet
Linear Programming
10 pages
Envelope TheoreTms, Bordered Hessians - BI Norwegian School Managment
No ratings yet
Envelope TheoreTms, Bordered Hessians - BI Norwegian School Managment
20 pages
A12 Route Visit Bus Report
No ratings yet
A12 Route Visit Bus Report
2 pages
Code Optimization Questions For Discussion
No ratings yet
Code Optimization Questions For Discussion
5 pages
Chapter 2
No ratings yet
Chapter 2
4 pages
Introductiont To Data Structure & Algorithm
No ratings yet
Introductiont To Data Structure & Algorithm
3 pages
Milk Billing System Documentation
No ratings yet
Milk Billing System Documentation
1 page
A12-Passed Out Students Count-1
No ratings yet
A12-Passed Out Students Count-1
1 page
OD4 PL Network Optimization Models
No ratings yet
OD4 PL Network Optimization Models
4 pages
LU Decomposition: (Chapter 20, Section 20.2 Kreyszig)
No ratings yet
LU Decomposition: (Chapter 20, Section 20.2 Kreyszig)
3 pages
Polynomial Regression From Scratch in Python - by Rashida Nasrin Sucky - Towards Data Science
No ratings yet
Polynomial Regression From Scratch in Python - by Rashida Nasrin Sucky - Towards Data Science
1 page
W1 W2 W3 W4 Supply F1 14 25 45 5 6 F2 65 25 35 55 8 F3 35 3 65 15 16 Demand 4 7 6 13
No ratings yet
W1 W2 W3 W4 Supply F1 14 25 45 5 6 F2 65 25 35 55 8 F3 35 3 65 15 16 Demand 4 7 6 13
2 pages
C++ Test
No ratings yet
C++ Test
1 page

Unit IV Ensemble Unsupervised Learning

Uploaded by

Unit IV Ensemble Unsupervised Learning

Uploaded by

Unit IV: Ensemble Learning & Unsupervised Learning – Study Material

Ensemble Learning is a technique where multiple models are combined to improve

### Key Features:

### Types of Ensemble Learning:

Model Combination Schemes

Different strategies exist for combining multiple models in ensemble learning.

Bagging: Random Forest

Bagging is a technique that improves stability and accuracy by reducing overfitting.

Boosting combines weak models sequentially, giving more weight to misclassified

### **AdaBoost (Adaptive Boosting)**:

Unsupervised Learning finds patterns in **unlabeled data**. Unlike supervised learning,

### **Key Features**:

### **Main Types**:

Clustering is an unsupervised learning technique that **groups similar data points**

### **Types of Clustering**:

Hierarchical Clustering: AGNES & DIANA

Hierarchical Clustering builds a nested structure of clusters.

### **AGNES (Agglomerative Nesting)**:

### **DIANA (Divisive Analysis)**:

Partitional Clustering: K-Means & K-Mode

Partitional Clustering divides data into **fixed K clusters**.

### **K-Means Clustering**:

### **K-Mode Clustering**:

Dimensionality Reduction: PCA & LLE

### **Principal Component Analysis (PCA)**:

### **Locally Linear Embedding (LLE)**:

You might also like

### AdaBoost (Adaptive Boosting):

Unsupervised Learning finds patterns in unlabeled data. Unlike supervised learning,

### Key Features:

### Main Types:

Clustering is an unsupervised learning technique that groups similar data points

### Types of Clustering:

### AGNES (Agglomerative Nesting):

### DIANA (Divisive Analysis):

Partitional Clustering divides data into fixed K clusters.

### K-Means Clustering:

### K-Mode Clustering:

### Principal Component Analysis (PCA):

### Locally Linear Embedding (LLE):