ML_MU_Unit_1_Introduction_to_MLpdf__2025_02_07_10_53_02 (2)
ML_MU_Unit_1_Introduction_to_MLpdf__2025_02_07_10_53_02 (2)
Evaluation of
Machine
Learning
Continued
Evaluation of
Machine
Learning
In cognitive science, learning is typically referred to as
the process of gaining information through
observation.
A task can be as simple as walking down the street or
doing the homework; or as complex as deciding the
What is angle in which a rocket should be launched so that it
Human can have a particular trajectory.
Learning? Why do we need to learn?
With more knowledge, the ability to do homework
with less number of mistakes increases
Thus, With more learning, tasks can be performed
more efficiently.
1. Learning under expert guidance
Somebody who is an expert in the subject directly teaches us.
The process of gaining information from a person having
sufficient knowledge due to past experience. (e.g. learning of
child)
Types of 2. Learning guided by knowledge gained from experts
Human we build our own notion indirectly based on what we have
Learning learnt from the expert in the past
learning also happens with the knowledge which has been
imparted by teacher or mentor at some point of time in some
other form
E.g. a kid can select one odd word from a set of words because
it is a verb and other words being all nouns, due to English
learned in school
3. Learning by self
We do it ourselves, may be after multiple attempts,
Types of some being unsuccessful.
Human Learning from our mistakes in past.
Learning E.g. Child learning to walk through obstacles.
“Machine learning is the field of study that gives
computers the ability to learn without being
explicitly programmed”
- Arthur Samuel, AI pioneer, 1959
“A computer program is said to learn from experience E
What is with respect to some class of tasks T and performance
Machine measure P, if its performance at tasks in T, as measured by
P, improves with experience E”
Learning? - Tom Mitchell, ML Professor at CMU
Algorithms that
improve their performance (P)
at some task (T)
with experience (E)
Traditional v/s
Machine
Learning
Data Input: Past data or information is utilized as a
basis for future decision-making
Abstraction: The input data is represented in a broader
way through the underlying algorithm
Generalization: The abstracted representation is
generalized to form a framework for making decisions
How do
machine learn?
For defining a new problem, which can be solved using ML, a
simple framework can be used. The framework involves
answering three questions:
What is the problem?
Describe the problem informally and formally and list
Well-posed assumptions and similar problems.
Problem List the motivation for solving the problem, the benefits that the
solution will provide and how the solution will be used.
How would I solve the problem?
Describe how the problem would be solved manually to flush
domain knowledge.
Machine
learning Life
cycle
Machine learning life cycle involves seven major steps, which
are given below:
Gathering Data
Data preparation
Machine Data Wrangling
learning Life Analyse Data
cycle
Train the model
Deployment
Data Gathering is the first step of the machine learning life cycle. The goal of
this step is to identify and obtain all data-related problems.
In this step, we need to identify the different data sources, as data can be
collected from various sources such as files, database, internet, or mobile
devices. It is one of the most important steps of the life cycle. The quantity
and quality of the collected data will determine the efficiency of the output.
1. Gathering The more will be the data, the more accurate will be the prediction.
Collect data
By performing the above task, we get a coherent set of data, also called as
a dataset. It will be used in further steps.
After collecting the data, we need to prepare it for further steps.
Data preparation is a step where we put our data into a suitable
place and prepare it to use in our machine learning training.
In this step, first, we put all data together, and then randomize the
It is not necessary that data we have collected is always of our use as some of the data
3. Data may not be useful. In real-world applications, collected data may have various issues,
Wrangling / including:
Duplicate data
processing
Invalid data
Noise
It is mandatory to detect and remove the above issues because it can negatively affect
the quality of the outcome.
Now the cleaned and prepared data is passed on to the analysis
step. This step involves:
Selection of analytical techniques
Building models
Review the result
Hence, in this step, we take the data and use machine learning
algorithms to build the model.
Now the next step is to train the model, in this step we
train our model to improve its performance for better
outcome of the problem.
Types of
Supervised
Learning
Unsupervised learning is a machine learning
technique in which models are not supervised using
training dataset.
Instead, models itself find the hidden patterns and
insights from the given data. It can be compared to
learning which takes place in the human brain while
learning new things.
Unsupervised
Learning
Clustering Association
Types of
Unsupervised
Learning
Reinforcement Learning is a feedback-based (reward)
Machine learning technique in which an agent learns to
behave in an environment by performing the actions
and seeing the results of actions.
For each good action, the agent gets positive feedback,
and for each bad action, the agent gets negative
Reinforcement feedback or penalty.
Learning
Criteria Supervised ML Unsupervised ML Reinforcement ML
Trained using Works on
Learns by using unlabelled data interacting with the
Definition
labelled data without any environment
guidance. (reward based)
Comparison – Type of data Labelled data Unlabelled data
No – predefined
data
Supervised,
Type of Regression and Association and Exploitation or
Unsupervised problems classification Clustering Exploration
and Supervision Extra supervision No supervision No supervision
Reinforcement Linear Regression, K – Means,
Q – Learning,
Algorithms Logistic Regression, PCA, DBSCAN,
Learning SVM, KNN, NB, DT. Apriori
SARSA
Categorical Data
Types of Data
for
Visualization
Machine learning may make use of a wide variety of data
visualization approaches. That include:
Line Charts
Types of Data Scatter Plots
Visualization
Bar Charts
Approaches
Heat Maps
Tree Maps
Box Plots
In a line chart, each data point is represented by a point
on the graph, and these points are connected by a line.
We may find patterns and trends in the data across
time by using line charts. Time-series data is frequently
displayed using line charts.
1. Line Charts
A quick and efficient method of displaying the
relationship between two variables is to use scatter
plots. With one variable plotted on the x-axis and the
other variable drawn on the y-axis, each data point in a
scatter plot is represented by a point on the graph. We
2. Scatter Plots may use scatter plots to visualize data to find patterns,
clusters, and outliers.
Bar charts are a common way of displaying categorical
data. In a bar chart, each category is represented by a
bar, with the height of the bar indicating the frequency
or proportion of that category in the data. Bar graphs
are useful for comparing several categories and seeing
3. Bar Charts patterns over time.
Heat maps are a type of graphical representation that
displays data in a matrix format. The value of the data
point that each matrix cell represents determines its
hue. Heatmaps are often used to visualize the
correlation between variables or to identify patterns in
4. Heat Maps time-series data.
Tree maps are used to display
hierarchical data in a compact
format and are useful in