Unit 2 Advanced Concepts of Modeling in AI
Unit 2 Advanced Concepts of Modeling in AI
b) Regression:
o it is a type of supervised learning method in learning based
approach.
o uses continuous labelled datasets to predict output.
Example:
1. To predict your next salary, put in the data of your
previous salary, any increments, etc., train the model.
2. Weather Prediction using past data. Here, the data which has been fed to the
machine is continuous.
c) Clustering:
o it is a type of unsupervised learning method in
learning based approach.
o uses discrete unlabelled data that can be
grouped into specific categories.
Example:
1. To predict your next salary, put in the data of your previous salary, any
increments, etc., train the model.
2. Weather Prediction using past data. Here, the data which has been fed to the
machine is continuous.
d) Dimensionality Reduction:
o it is a type of unsupervised learning method in learning based approach.
o uses unlabelled data.
o makes complex data into simple even though it comes to the cost of losing some
information.
o while reducing 3-Dimension to 2-Dimension still be able to make sense of the data.
Example:
1. Document classification
2. Image compression
e) Association:
o It is a type of unsupervised learning method in
learning based approach.
o uses large unlabelled datasets.
o involves finding relationships or patterns in
data where the occurrence of one item is
associated with the occurrence of another.
Example:
3. Market basket analysis
4. Recommender systems
5. Web usage mining.
12. Write the differences between Supervised, Unsupervised and Reinforcement learning.
Supervised Learning Unsupervised Learning Reinforcement Learning
Algorithms Algorithms Algorithms
Data provided is labeled Data provided is unlabeled The machine learns from its
data, with specified output data, the output is not environment using rewards &
values. specified, machine makes errors.
its own prediction.
Used to solve Regression Used to solve Association Used to solve Reward based
and classification and Clustering problems. problems.
problems.
Labeled data is used. Unlabeled data is used. No predefined data is used.
External Supervision. No Supervision. No Supervision.
• Solve problems by Solves problems by Follows Trail and Error
mapping labeled input to understanding patterns problem solving Approach.
known output. and discovering output.
Forecast Sales, Recommendation system, Self-driving cars, Gaming,
Risk evaluation, Email Anomaly Detection Healthcare, Parking car,
spam detector, Humanoid walking
13. Identify the type of learning (supervised, unsupervised, reinforcement learning) are the
following case studies most likely based on?
a) Case Study 1:
A company wants to predict customer churn based on past purchasing behavior, demographics,
and customer interactions. They have a dataset with labelled examples of customers who churned
and those who did not.
Ans: Supervised Learning
b) Case Study 2:
A social media platform wants to group users based on their interests and behavior to recommend
relevant content. They have a large dataset of user interactions but no predefined categories.
Which type of learning is this case study most likely based on?
Ans: Unsupervised Learning
c) Case Study 3: An autonomous vehicle is learning to navigate through a city environment. It
receives feedback in the form of rewards for reaching its destination safely and penalties for traffic
violations. Which type of learning is this case study most likely based on?
Ans: Reinforcement Learning
d) Case Study 4:
A healthcare provider wants to identify patterns in patient data to personalize treatment plans.
They have a dataset with various patient attributes but no predefined labels indicating specific
treatment plans. Which type of learning is this case study most likely based on?
Ans: Unsupervised Learning
e) Case Study 5:
A manufacturing company wants to optimize its production process by detecting anomalies in
sensor data from machinery. They have a dataset with examples of normal and anomalous
behavior. Which type of learning is this case study most likely based on?
Ans: Supervised Learning
14. Identify the type of model (classification, regression, clustering, association model) are the
following case studies most likely based on?
a) A bank wants to predict whether a loan applicant will "default" or "non-default" on their loan
payments. They have a dataset containing information such as income, credit score, loan amount,
and employment status.
Ans: Classification
b) A real estate agency wants to predict the selling price of houses based on various features
such as size, location, number of bedrooms, and bathrooms. They have a dataset containing
historical sales data.
Ans: Regression
c) A marketing company wants to segment its customer base into distinct groups based on
purchasing behavior for targeted marketing campaigns. They have a dataset containing
information such as purchase history, frequency of purchases, and amount spent.
ANS: clustering
d) A grocery store wants to identify associations between different products purchased by
customers to understand which products are commonly bought together. They have a transaction
dataset containing records of items purchased together during each transaction.
Ans: Association
15. A healthcare provider wants to improve patient care by predicting the length of hospital stays
for different medical conditions. They have a dataset containing patient demographics,
medical history, and treatment details. The task involves:
a) To predict whether a patient will have a short or long hospital stay.
ANS: Classification
b) To predict the number of days a patient will stay in the hospital.
ANS: Regression
c) To segment patients into groups with similar characteristics for personalized treatment
plans.
Ans: Clustering
d) To identify patterns in patient treatments and outcomes.
Ans: Association
Identify the type of model (classification, regression, clustering, and association model) in the
above tasks.
16. Define the following:
a) Data: Data is information in any form
b) Features: Columns of the tables are called features
c) Label: Data Labelling is the process of attaching meaning to data
d) Labelled data: Data to which some tag/label is attached.
e) Unlabelled data: The raw form of data to which no tag is attached.