DSF Unit 3
DSF Unit 3
MACHINE
LEARNING
The modeling process - Types of machine learning -
Supervised learning – Unsupervised learning -Semi-
supervised learning- Classification, regression -
Clustering – Outliers and Outlier Analysis.
MODELING PROCESS
There are 10 steps are involved to make better
machine learning model.
1. Problem Definition
2. Data Collection
3. Data Exploration and Preprocessing
4. Feature Selection
5. Model Selection
6. Model Training
7. Model Evaluation
8. Model Tuning
9. Model Deployment
10. Model Maintenance
Machine learning
Machine learning is a subset of AI, which enables the
machine to automatically learn from data, improve
performance from past experiences, and make
predictions.
Machine learning uses algorithms and data sets to
teach computers to learn from data and improve with
experience.
In simple words, ML teaches the systems to think and
understand like humans by learning from the data.
Types of machine
learning
1. Supervised Machine Learning
2. Unsupervised Machine Learning
3. Semi-Supervised Machine Learning
4. Reinforcement Learning
Supervised Learning
Supervised machine learning is based on
supervision. It this, we train the machines using
the "labelled" dataset, and based on the
training, the machine predicts the output.
First, we train the machine with the input and
corresponding output, and then we ask the
machine to predict the output using the test
dataset.
The main goal of the supervised learning
technique is to map the input variable(x) with
the output variable(y). Some real-world
applications of supervised learning are Risk
Assessment, Fraud Detection, Spam filtering, etc
Example
TYPES OF SUPERVISED MACHINE
LEARNING
Supervised machine learning can be classified
into two types of problems, which are given
below:
1. Classification
2. Regression
CLASSIFICATION:
Classification deals with predicting categorical
1. Linear Regression
2. Regression Trees
3. Non-Linear Regression
4. Bayesian Linear Regression
5. Polynomial Regression
UNSUPERVISED
LEARNING
As its name suggests, there is no need for
supervision. It means, in unsupervised
machine learning, the machine is trained
using the unlabeled dataset, and the machine
predicts the output without any supervision.
Here, the models are trained with the data
classification
Drug discovery
OUTLIER
Outliers in machine learning refer to data
points that are significantly different from the
majority of the data. These data points can be
anomalous, noisy, or errors in measurement.
An outlier is a data point that significantly
2. Contextual outliers:
These are unusual in a specific context but