0% found this document useful (0 votes)

14 views49 pages

ML_MU_Unit_1_Introduction_to_MLpdf__2025_02_07_10_53_02 (2)

This document outlines a machine learning course that covers key concepts such as supervised, unsupervised, and reinforcement learning, as well as practical applications and the machine learning life cycle. It emphasizes the importance of data visualization and provides insights into various machine learning algorithms and their applications. The document also discusses the types of data used for visualization and the tools available for effective data analysis.

Uploaded by

iamkarmabhatt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views49 pages

ML_MU_Unit_1_Introduction_to_MLpdf__2025_02_07_10_53_02 (2)

Uploaded by

iamkarmabhatt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 49

After completion of this course, students will be able to

 Understand machine-learning concepts.

 Understand and implement Classification concepts.
Course
 Understand and analyse the different Regression
Outcomes algorithms.
 Apply the concept of Unsupervised Learning.
 Apply the concepts of Artificial Neural Networks.
Introduction to ML:
 Motivation and Applications
 Importance of Data Visualization
Topics
 Basics of Supervised, Unsupervised, and Reinforcement
Learning
 Current research trends in ML
 ML is an interdisciplinary field:
 Data Analyst: visualize, analyze data, optimization
 Data Engineers: build and test scalable / stable /
optimal ecosystems for data scientists to run their
Machine algorithms
Learning  Database Administrator: responsible for the
Introduction proper functioning of all the databases.
 Data Scientist: perform predictive analysis and
offer actionable insights.
 Statistician: extract and offer valuable insights
from the data using statistical theory and tools.
Machine
Learning
Introduction
 AI stands for Artificial Intelligence, and is basically the
study/process which enables machines to mimic human
behavior through particular algorithm.
 ML stands for Machine Learning, and is the study that uses
statistical methods enabling machines to improve with
Machine experience.
Learning  DL stands for Deep Learning, and is the study that makes use
Introduction of Neural Networks(similar to neurons present in human brain)
to imitate functionality just like a human brain.
 Data science is the field of applying advanced analytics
techniques and scientific principles to extract valuable
information from data for business decision-making, strategic
planning and other uses.
Evaluation of
Machine
Learning
 Continued

Evaluation of
Machine
Learning
 Continued

Evaluation of
Machine
Learning
 In cognitive science, learning is typically referred to as
the process of gaining information through
observation.
 A task can be as simple as walking down the street or
doing the homework; or as complex as deciding the
What is angle in which a rocket should be launched so that it
Human can have a particular trajectory.
Learning?  Why do we need to learn?
 With more knowledge, the ability to do homework
with less number of mistakes increases
 Thus, With more learning, tasks can be performed
more efficiently.
1. Learning under expert guidance
 Somebody who is an expert in the subject directly teaches us.
 The process of gaining information from a person having
sufficient knowledge due to past experience. (e.g. learning of
child)
Types of 2. Learning guided by knowledge gained from experts
Human  we build our own notion indirectly based on what we have
Learning learnt from the expert in the past
 learning also happens with the knowledge which has been
imparted by teacher or mentor at some point of time in some
other form
 E.g. a kid can select one odd word from a set of words because
it is a verb and other words being all nouns, due to English
learned in school
 3. Learning by self
 We do it ourselves, may be after multiple attempts,
Types of some being unsuccessful.
Human  Learning from our mistakes in past.
Learning  E.g. Child learning to walk through obstacles.
 “Machine learning is the field of study that gives
computers the ability to learn without being
explicitly programmed”
- Arthur Samuel, AI pioneer, 1959
 “A computer program is said to learn from experience E
What is with respect to some class of tasks T and performance
Machine measure P, if its performance at tasks in T, as measured by
P, improves with experience E”
Learning? - Tom Mitchell, ML Professor at CMU
 Algorithms that
 improve their performance (P)
 at some task (T)
 with experience (E)
Traditional v/s
Machine
Learning
 Data Input: Past data or information is utilized as a
basis for future decision-making
 Abstraction: The input data is represented in a broader
way through the underlying algorithm
 Generalization: The abstracted representation is
generalized to form a framework for making decisions
How do
machine learn?
 For defining a new problem, which can be solved using ML, a
simple framework can be used. The framework involves
answering three questions:
 What is the problem?
 Describe the problem informally and formally and list
Well-posed assumptions and similar problems.

Learning  Why does the problem need to be solved?

Problem  List the motivation for solving the problem, the benefits that the
solution will provide and how the solution will be used.
 How would I solve the problem?
 Describe how the problem would be solved manually to flush
domain knowledge.
Machine
learning Life
cycle
Machine learning life cycle involves seven major steps, which
are given below:

 Gathering Data

 Data preparation
Machine  Data Wrangling
learning Life  Analyse Data
cycle
 Train the model

 Test the model

 Deployment
 Data Gathering is the first step of the machine learning life cycle. The goal of
this step is to identify and obtain all data-related problems.

 In this step, we need to identify the different data sources, as data can be
collected from various sources such as files, database, internet, or mobile
devices. It is one of the most important steps of the life cycle. The quantity
and quality of the collected data will determine the efficiency of the output.

1. Gathering The more will be the data, the more accurate will be the prediction.

Data  This step includes the below tasks:

 Identify various data sources

 Collect data

 Integrate the data obtained from different sources

 By performing the above task, we get a coherent set of data, also called as
a dataset. It will be used in further steps.
 After collecting the data, we need to prepare it for further steps.
Data preparation is a step where we put our data into a suitable
place and prepare it to use in our machine learning training.

 In this step, first, we put all data together, and then randomize the

2. Data ordering of data.

preparation  Data exploration: It is used to understand the nature of data that

we have to work with. We need to understand the characteristics,
format, and quality of data.

 A better understanding of data leads to an effective outcome. In

this, we find Correlations, general trends, and outliers.
 Data wrangling is the process of cleaning and converting raw data into a useable format.
It is the process of cleaning the data, selecting the variable to use, and transforming the
data in a proper format to make it more suitable for analysis in the next step. It is one of
the most important steps of the complete process. Cleaning of data is required to
address the quality issues.

 It is not necessary that data we have collected is always of our use as some of the data
3. Data may not be useful. In real-world applications, collected data may have various issues,
Wrangling / including:

Data pre-  Missing Values

 Duplicate data
processing
 Invalid data

 Noise

 So, we use various filtering techniques to clean the data.

 It is mandatory to detect and remove the above issues because it can negatively affect
the quality of the outcome.
 Now the cleaned and prepared data is passed on to the analysis
step. This step involves:
 Selection of analytical techniques
 Building models
 Review the result

 The aim of this step is to build a machine learning model to

4. Data analyze the data using various analytical techniques and review
Analysis the outcome. It starts with the determination of the type of the
problems, where we select the machine learning techniques such
as Classification, Regression, Cluster analysis, Association, etc.
then build the model using prepared data, and evaluate the
model.

 Hence, in this step, we take the data and use machine learning
algorithms to build the model.
 Now the next step is to train the model, in this step we
train our model to improve its performance for better
outcome of the problem.

 We use datasets to train the model using various

5. Train Model
machine learning algorithms. Training a model is
required so that it can understand the various patterns,
rules, and, features.
 Once our machine learning model has been trained on
a given dataset, then we test the model. In this step,
we check for the accuracy of our model by providing a
test dataset to it.
6. Test Model
 Testing the model determines the percentage accuracy
of the model as per the requirement of project or
problem.
 The last step of machine learning life cycle is
deployment, where we deploy the model in the real-
world system.

 If the above-prepared model is producing an accurate

7. Deployment result as per our requirement with acceptable speed,
then we deploy the model in the real system. But
before deploying the project, we will check whether it
is improving its performance using available data or
not. The deployment phase is similar to making the
final report for a project
Types of
Machine
Learning
Supervised
Learning
 Supervised learning is the types of machine learning in
which machines are trained using well "labelled"
training data, and on basis of that data, machines
predict the output.
 The labelled data means some input data is already
tagged with the correct output.
Supervised
Learning
Classification (Discrete value output) Regression (Predict real value
output)

Types of
Supervised
Learning
 Unsupervised learning is a machine learning
technique in which models are not supervised using
training dataset.
 Instead, models itself find the hidden patterns and
insights from the given data. It can be compared to
learning which takes place in the human brain while
learning new things.
Unsupervised
Learning
Clustering Association

Types of
Unsupervised
Learning
 Reinforcement Learning is a feedback-based (reward)
Machine learning technique in which an agent learns to
behave in an environment by performing the actions
and seeing the results of actions.
 For each good action, the agent gets positive feedback,
and for each bad action, the agent gets negative
Reinforcement feedback or penalty.
Learning
Criteria Supervised ML Unsupervised ML Reinforcement ML
Trained using Works on
Learns by using unlabelled data interacting with the
Definition
labelled data without any environment
guidance. (reward based)
Comparison – Type of data Labelled data Unlabelled data
No – predefined
data
Supervised,
Type of Regression and Association and Exploitation or
Unsupervised problems classification Clustering Exploration
and Supervision Extra supervision No supervision No supervision
Reinforcement Linear Regression, K – Means,
Q – Learning,
Algorithms Logistic Regression, PCA, DBSCAN,
Learning SVM, KNN, NB, DT. Apriori
SARSA

Discover underlying Learn a series of

Aim Calculate outcomes
patterns action
Recommendation
Risk Evaluation, Self Driving Cars,
Application System, Anomaly
Forecast Sales Gaming, Healthcare
Detection
 Many video games are based on artificial intelligence
technique called Expert System. This technique can
Did you know? imitate areas of human behavior, with a goal to mimic the
human ability of senses, perception, and reasoning.
 Machine learning should not be applied to tasks in
which humans are very effective or frequent human
intervention is needed.
 For example, air traffic control is a very complex task
needing intense human involvement.
When not to  Also, for very simple tasks which can be implemented
using traditional programming paradigms, there is no
use ML? sense of using machine learning.
 For example, simple rule-driven or formula-based
applications like price calculator engine, dispute
tracking application, etc. do not need machine learning
techniques.
Application of
ML
Tools for
Machine
Learning
 Data visualization is a crucial aspect of machine learning that
enables analysts to understand and make sense of data patterns,
relationships, and trends.

 Through data visualization, insights and patterns in data can be

Data
easily interpreted and communicated to a wider audience, making
Visualization in it a critical component of machine learning.
Machine  Data visualization is the graphical representation of information
Learning and data.

 By using visual elements like charts, graphs, and maps, data

visualization tools provide an accessible way to see and
understand trends, outliers, and patterns in data.
 Data visualization translates complex data sets
into visual formats that are easier for the human brain
to comprehend. This can include a variety of visual
tools such as:
 Charts: Bar charts, line charts, pie charts, etc.
What is Data
Visualization?  Graphs: Scatter plots, histograms, etc.
 Maps: Geographic maps, heat maps, etc.
 Dashboards: Interactive platforms that combine
multiple visualizations.
 Performing accurate visualization of data is very critical
to market research where both numerical and
categorical data can be visualized, which helps increase
the impact of insights and also helps in reducing the
Types of Data
for risk of analysis paralysis. So, data visualization is

Visualization categorized into the following categories:

 Numerical Data

 Categorical Data
Types of Data
for
Visualization
Machine learning may make use of a wide variety of data
visualization approaches. That include:

 Line Charts
Types of Data  Scatter Plots
Visualization
 Bar Charts
Approaches
 Heat Maps

 Tree Maps

 Box Plots
 In a line chart, each data point is represented by a point
on the graph, and these points are connected by a line.
We may find patterns and trends in the data across
time by using line charts. Time-series data is frequently
displayed using line charts.

1. Line Charts
 A quick and efficient method of displaying the
relationship between two variables is to use scatter
plots. With one variable plotted on the x-axis and the
other variable drawn on the y-axis, each data point in a
scatter plot is represented by a point on the graph. We
2. Scatter Plots may use scatter plots to visualize data to find patterns,
clusters, and outliers.
 Bar charts are a common way of displaying categorical
data. In a bar chart, each category is represented by a
bar, with the height of the bar indicating the frequency
or proportion of that category in the data. Bar graphs
are useful for comparing several categories and seeing
3. Bar Charts patterns over time.
 Heat maps are a type of graphical representation that
displays data in a matrix format. The value of the data
point that each matrix cell represents determines its
hue. Heatmaps are often used to visualize the
correlation between variables or to identify patterns in
4. Heat Maps time-series data.
 Tree maps are used to display
hierarchical data in a compact
format and are useful in

5. Tree Maps showing the relationship

between different levels of a
hierarchy.
 Box plots are a graphical representation of the
distribution of a set of data. In a box plot, the median is
shown by a line inside the box, while the center box
depicts the range of the data. The whiskers extend
from the box to the highest and lowest values in the
data, excluding outliers. Box plots can help us to
6. Box Plots
identify the spread and skewness of the data.
 Identify trends and patterns in data: It may be challenging to
spot trends and patterns in data using conventional approaches,
but data visualization tools may be utilized to do so.

 Communicate insights to stakeholders: Data visualization can be

used to communicate insights to stakeholders in a format that is
Uses of Data
easily understandable and can help to support decision-making
Visualization in processes.
Machine
 Monitor machine learning models: Data visualization can be used
Learning to monitor machine learning models in real time and to identify
any issues or anomalies in the data.

 Improve data quality: Data visualization can be used to identify

outliers and inconsistencies in the data and to improve data
quality by removing them.
Any
Thank you
Queries..??

Machine Learning Notes
100% (10)
Machine Learning Notes
19 pages
General Power of Attorney Indian Bank New
No ratings yet
General Power of Attorney Indian Bank New
3 pages
Risk Assessment - Wall & Floor Tiling: April 2010
89% (9)
Risk Assessment - Wall & Floor Tiling: April 2010
2 pages
Machine Learning 1
No ratings yet
Machine Learning 1
34 pages
ML
No ratings yet
ML
19 pages
Machine Learning Lecture-01
No ratings yet
Machine Learning Lecture-01
37 pages
Cluster
No ratings yet
Cluster
42 pages
Shanthi ML PPT
No ratings yet
Shanthi ML PPT
26 pages
Machine Learning 3
No ratings yet
Machine Learning 3
30 pages
ML 1
No ratings yet
ML 1
79 pages
Machine Learning
No ratings yet
Machine Learning
116 pages
Big-Data Unit-3
100% (1)
Big-Data Unit-3
54 pages
Unit 1
No ratings yet
Unit 1
32 pages
Unit 1 - Machine Learning - NOTES1 - ML
No ratings yet
Unit 1 - Machine Learning - NOTES1 - ML
52 pages
MLES
No ratings yet
MLES
30 pages
An Enlightenment To Machine Learning
100% (1)
An Enlightenment To Machine Learning
16 pages
Lecture 2 Unit 1
No ratings yet
Lecture 2 Unit 1
60 pages
Machine Learning Unit 1
No ratings yet
Machine Learning Unit 1
72 pages
Part 2 Introduction To ML
No ratings yet
Part 2 Introduction To ML
13 pages
Updated Unit 1
No ratings yet
Updated Unit 1
57 pages
10 Machine Learning
No ratings yet
10 Machine Learning
9 pages
Unit 1
No ratings yet
Unit 1
41 pages
Machine Learning Life Cycle
No ratings yet
Machine Learning Life Cycle
4 pages
Flow Diagram of Machine Learning or Life Cycle of Machine Learning
No ratings yet
Flow Diagram of Machine Learning or Life Cycle of Machine Learning
91 pages
Lecture 1
No ratings yet
Lecture 1
24 pages
An Enlightenment To Machine Learning - Resp
No ratings yet
An Enlightenment To Machine Learning - Resp
22 pages
LECTURE-2
No ratings yet
LECTURE-2
36 pages
ML 3170724 Unit-1
No ratings yet
ML 3170724 Unit-1
27 pages
Unit No: 1 Introduction To Machine Learning (3170724) : Department of CE
No ratings yet
Unit No: 1 Introduction To Machine Learning (3170724) : Department of CE
28 pages
ML Life Cycle
No ratings yet
ML Life Cycle
4 pages
ML Lec 1(1) (1)
No ratings yet
ML Lec 1(1) (1)
51 pages
ML 1
No ratings yet
ML 1
35 pages
Chap 10-Machine Learning
No ratings yet
Chap 10-Machine Learning
25 pages
Lecture 1
No ratings yet
Lecture 1
65 pages
Unit 3 - DS - 1st year
No ratings yet
Unit 3 - DS - 1st year
5 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Describe Machine Learning Lifecycle
No ratings yet
Describe Machine Learning Lifecycle
4 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
ML UNIT-I
No ratings yet
ML UNIT-I
28 pages
ARTIFICIAL INTELLIGENCE LEC 1 PDF
No ratings yet
ARTIFICIAL INTELLIGENCE LEC 1 PDF
15 pages
ML UNIT-I
No ratings yet
ML UNIT-I
34 pages
Machine Learning-1
No ratings yet
Machine Learning-1
64 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
12 pages
Social Media Analytics Techniques[1] (1)
No ratings yet
Social Media Analytics Techniques[1] (1)
77 pages
Introduction to ML Unit-1 PPT
No ratings yet
Introduction to ML Unit-1 PPT
90 pages
Chapter1 Machine Learning (1)
No ratings yet
Chapter1 Machine Learning (1)
26 pages
Machine Learning
No ratings yet
Machine Learning
74 pages
Introduction To Machine Learning Notes
No ratings yet
Introduction To Machine Learning Notes
26 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
19 pages
Basic of Machine Learning
No ratings yet
Basic of Machine Learning
7 pages
22wj8a6630ml ppt
No ratings yet
22wj8a6630ml ppt
12 pages
Data Science Process and Machine Learning
No ratings yet
Data Science Process and Machine Learning
6 pages
TIS - Intro To Machine Learning
No ratings yet
TIS - Intro To Machine Learning
18 pages
ML Unit 1
No ratings yet
ML Unit 1
22 pages
1 ML M1503-Introduction - ABP
No ratings yet
1 ML M1503-Introduction - ABP
14 pages
Introduction To Data Science Module 3
No ratings yet
Introduction To Data Science Module 3
24 pages
Machine Learning: From: Atul Ranjan Jha
No ratings yet
Machine Learning: From: Atul Ranjan Jha
11 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
ML-cahp-1
No ratings yet
ML-cahp-1
35 pages
ML R20 Material
No ratings yet
ML R20 Material
96 pages
1. ML Introduction
No ratings yet
1. ML Introduction
54 pages
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
Controller Design For Electric Power Steering System Using T-S Fuzzy Model Approach PDF
No ratings yet
Controller Design For Electric Power Steering System Using T-S Fuzzy Model Approach PDF
6 pages
BFP - Clean Agent
No ratings yet
BFP - Clean Agent
47 pages
Brochure PDF
No ratings yet
Brochure PDF
8 pages
Operation Manual
No ratings yet
Operation Manual
18 pages
Arc6990 2022 23 Module Handbook
No ratings yet
Arc6990 2022 23 Module Handbook
20 pages
Study Material MBA Corporate Finance 1 - Session 1 To 16
No ratings yet
Study Material MBA Corporate Finance 1 - Session 1 To 16
56 pages
On Air Bags
No ratings yet
On Air Bags
14 pages
Land Form34 Lease or Sub Lease of Customary Land or Right of Occupancy
No ratings yet
Land Form34 Lease or Sub Lease of Customary Land or Right of Occupancy
2 pages
Lubricating Oil System Intervals59-003
No ratings yet
Lubricating Oil System Intervals59-003
3 pages
Public and Private Organizations
No ratings yet
Public and Private Organizations
8 pages
JSU110 Lecture 1 - Intro&choice of discipline
No ratings yet
JSU110 Lecture 1 - Intro&choice of discipline
19 pages
ML Project Report
No ratings yet
ML Project Report
16 pages
Resume - Dhanraj Pawar
No ratings yet
Resume - Dhanraj Pawar
6 pages
G2 DOVE RESEARCH PAPER Slay
No ratings yet
G2 DOVE RESEARCH PAPER Slay
77 pages
2 Page Literature Review
100% (1)
2 Page Literature Review
6 pages
European Union
No ratings yet
European Union
4 pages
Traffic Separation Scheme by Gapoy
No ratings yet
Traffic Separation Scheme by Gapoy
2 pages
LUYỆN TẬP TÍNH TỪ TRẠNG TỪ
No ratings yet
LUYỆN TẬP TÍNH TỪ TRẠNG TỪ
6 pages
Pipeline Compliance Standard Summary Checklist Pipeline Owners and Operators
No ratings yet
Pipeline Compliance Standard Summary Checklist Pipeline Owners and Operators
3 pages
Extended Refinery Gas Analyzer
No ratings yet
Extended Refinery Gas Analyzer
2 pages
Chapter 13 Business Taxation
No ratings yet
Chapter 13 Business Taxation
5 pages
Annexure-A: (See Rule 44 (3) of The UPVAT Rules 2007) List of Purchase Made Against Tas Invoice
No ratings yet
Annexure-A: (See Rule 44 (3) of The UPVAT Rules 2007) List of Purchase Made Against Tas Invoice
12 pages
A Model Data Management Plan Standard Operating Procedure: Results From The DIA Clinical Data Management Community, Committee On Clinical Data Management Plan
No ratings yet
A Model Data Management Plan Standard Operating Procedure: Results From The DIA Clinical Data Management Community, Committee On Clinical Data Management Plan
10 pages
PCI Bank V. CA
No ratings yet
PCI Bank V. CA
2 pages
000 AirlineReservation
No ratings yet
000 AirlineReservation
68 pages
GXT745
No ratings yet
GXT745
24 pages
Bihar Agriculture Land (Conversion For Non-Agriculture Purposes) Act, 2010 PDF
No ratings yet
Bihar Agriculture Land (Conversion For Non-Agriculture Purposes) Act, 2010 PDF
14 pages
HRM Assignment
No ratings yet
HRM Assignment
51 pages

ML_MU_Unit_1_Introduction_to_MLpdf__2025_02_07_10_53_02 (2)

Uploaded by

ML_MU_Unit_1_Introduction_to_MLpdf__2025_02_07_10_53_02 (2)

Uploaded by

After completion of this course, students will be able to

 Understand machine-learning concepts.

Learning  Why does the problem need to be solved?

 Test the model

Data  This step includes the below tasks:

 Integrate the data obtained from different sources

2. Data ordering of data.

preparation  Data exploration: It is used to understand the nature of data that

 A better understanding of data leads to an effective outcome. In

Data pre-  Missing Values

 So, we use various filtering techniques to clean the data.

 The aim of this step is to build a machine learning model to

 We use datasets to train the model using various

 If the above-prepared model is producing an accurate

Discover underlying Learn a series of

 Through data visualization, insights and patterns in data can be

 By using visual elements like charts, graphs, and maps, data

Visualization categorized into the following categories:

5. Tree Maps showing the relationship

 Communicate insights to stakeholders: Data visualization can be

 Improve data quality: Data visualization can be used to identify

You might also like