0% found this document useful (0 votes)

365 views

IML-IITKGP - Assignment 1 Solution

This document provides a 15 question multiple choice quiz on machine learning concepts. The questions cover topics like classification vs regression, precision and recall, overfitting, cross-validation, and feature spaces. For each question, the correct answer(s) and a brief explanation of the concept(s) tested are provided. The quiz is assessing foundational machine learning knowledge through practical application type questions.

Uploaded by

Netaji Gandi

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

365 views

IML-IITKGP - Assignment 1 Solution

Uploaded by

Netaji Gandi

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Introduction to Machine Learning -IITKGP

Assignment - 1
TYPE OF QUESTION: MCQ/MSQ
Number of questions: 15 Total mark: 2 * 15 = 30

1. Which of the following is/are classification tasks?

a. Find the gender of a person by analyzing his writing style

b. Predict the price of a house based on floor area, number of rooms, etc.
c. Predict whether there will be abnormally heavy rainfall next year
d. Predict the number of copies of a book that will be sold this month

Correct Answers: a, c

Explanation: In (c), the amount of rainfall is a continuous variable. But, we are predicting
whether there will be abnormally heavy rainfall next year or not. So it is a Classification
task. Similarly, the number of classes in gender identification (a) is discrete. So, it’s a
classification task. The output variable is a continuous class in other options, so these are
regression tasks.

2. A feature F1 can take certain values: A, B, C, D, E, F, and represents the grade of students
from a college. Which of the following statement is true in the following case?
a. Feature F1 is an example of a nominal variable.
b. Feature F1 is an example of an ordinal variable.
c. It doesn’t belong to any of the above categories.
d. Both of these

Correct Answer: b

Explanation: Ordinal variables are the variables that have some order in their categories.
For example, grade A should be considered a high grade than grade B.

_______________________________________________________________________
3. Suppose I have 10,000 emails in my mailbox out of which 200 are spams. The spam
detection system detects 150 emails as spams, out of which 50 are actually spam. What is
the precision and recall of my spam detection system?

a. Precision = 33.333%, Recall = 25%

b. Precision = 25%, Recall = 33.33%
c. Precision = 33.33%, Recall = 75%
d. Precision = 75%, Recall = 33.33%

Correct Answer: a

Explanation:

We know that,
𝑇𝑝
Precision =
𝑇𝑝+𝐹𝑝

50
=
150

= 33.333%
𝑇𝑝
Recall =
𝑇𝑝+𝐹𝑛

50
=
200

= 25%
___________________________________________________________________________

4. Which of the following statements describes what is most likely TRUE when the amount
of training data increases?

a. Training error usually decreases and generalization error usually increases.

b. Training error usually decreases and generalization error usually decreases.
c. Training error usually increases and generalization error usually decreases.
d. Training error usually increases and generalization error usually increases.

Correct Answer: a

Explanation: When the training data increases, the decision boundary becomes very complex
to fit the data. So, the generalization capability usually reduces with the increase in training
data.

___________________________________________________________________________
5. You trained a learning algorithm, and plot the learning curve. The following figure is
obtained.

The algorithm is suffering from

a. High bias
b. High variance
c. Neither

Correct Answer: a

Explanation: In the plot, the training error is increased with the training set size. The true error
is around 0.4 which is quite high. Thus, we can say that the bias is high.
___________________________________________________________________________
6. I am the marketing consultant of a leading e-commerce website. I have been given a task
of making a system that recommends products to users based on their activity on Facebook.
I realize that user interests could be highly variable. Hence, I decide to

T1) Cluster the users into communities of like-minded people and

T2) Train separate models for each community to predict which product category (e.g.,
electronic gadgets, cosmetics, etc.) would be the most relevant to that community.

The task T1 is a/an ______________ learning problem and T2 is a/an

________________ problem.
Choose from the options:
a. Supervised and unsupervised
b. Unsupervised and supervised
c. Supervised and supervised
d. Unsupervised and unsupervised

Correct Answer: b

Explanation: From the definition of supervised and unsupervised learning

___________________________________________________________________________

7. Select the correct equations.

TP - True Positive, TN - True Negative, FP - False Positive, FN - False Negative
𝑇𝑝
i. Precision =
𝑇𝑝+𝐹𝑝
𝐹𝑝
ii. Recall =
𝑇𝑝+𝐹𝑝
𝑇𝑝
iii. Recall =
𝑇𝑝+𝐹𝑛
𝑇𝑝+𝐹𝑛
iv. Accuracy=
𝑇𝑝+𝐹𝑝+𝑇𝑛+𝐹𝑛
a. i, iii, iv
b. i and iii
c. ii and iv
d. i, ii, iii, iv
Correct Answer: a
Explanation: From the definition of Precision, Recall, and Accuracy
_________________________________________________________________________

8. Which of the following tasks is NOT a suitable machine learning task(s)?

a. Finding the shortest path between a pair of nodes in a graph
b. Predicting if a stock price will rise or fall
c. Predicting the price of petroleum
d. Grouping mails as spams or non-spams
Correct Answer: a
Explanation: Finding the shortest path between a pair of nodes in a graph is not a suitable
machine-learning task because it falls under the category of graph algorithms and can be
efficiently solved using algorithms like Dijkstra's algorithm. Machine learning is typically used
for tasks that involve pattern recognition, prediction, or classification based on data. In this
case, the task of finding the shortest path in a graph is better suited for algorithmic or graph
theory-based approaches rather than machine learning.
__________________________________________________________________________

9. Which of the following is/are associated with overfitting in machine learning?

a. High bias
b. Low bias
c. Low variance
d. High variance
e. Good performance on training data
f. Poor performance on test data

Correct Answers: b, d, e, f
Explanation: Overfitting is characterized by good performance on the training data, as the
model has essentially memorized the data. However, it leads to poor performance on the
test data because the model fails to generalize well. Overfitting is associated with low bias
and high variance, meaning the model is sensitive to noise or fluctuations in the training
data.
________________________________________________________________________

10. Which of the following statements about cross-validation in machine learning is/are true?
a. Cross-validation is used to evaluate a model's performance on the training data.
b. Cross-validation guarantees that a model will generalize well to unseen data.
c. Cross-validation is only applicable to classification problems and not regression
problems.
d. Cross-validation helps in estimating the model's performance on unseen data by
simulating the test phase.
Correct Answer: d
Explanation: Cross-validation is a technique used in machine learning to assess the
performance and generalization ability of a model. It involves dividing the available labeled
data into multiple subsets or folds. The model is trained on a portion of the data (training set)
and evaluated on the remaining portion (validation or test set). By repeating this process with
different partitions of the data, cross-validation provides an estimate of the model's
performance on unseen data.
___________________________________________________________________________
11. What does k-fold cross-validation involve in machine learning?
a. Splitting the dataset into k equal-sized training and test sets.
b. Splitting the dataset into k unequal-sized training and test sets.
c. Partitioning the dataset into k subsets, and iteratively using each subset as a
validation set while the remaining k-1 subsets are used for training.
d. Dividing the dataset into k subsets, where each subset represents a unique class label
for classification tasks.
Correct Answer: c
Explanation: K-fold cross-validation involves dividing the dataset into k subsets or folds. The
process then iterates k times, where each time, one of the k subsets is used as the validation set,
while the remaining k-1 subsets are used for training the model. This ensures that each subset
is used as the validation set exactly once, and the model is trained and evaluated k times, with
each fold serving as the validation set once.
___________________________________________________________________________
12. What does the term "feature space" refer to in machine learning?
a. The space where the machine learning model is trained.
b. The space where the machine learning model is deployed.
c. The space which is formed by the input variables used in a machine learning model.
d. The space where the output predictions are made by a machine learning model.
Correct Answer: c
Explanation: The feature space in machine learning refers to the space formed by the input
variables or features used in a model. It represents the space where the data points reside.
___________________________________________________________________________

13. Which of the following statements is/are true regarding supervised and unsupervised
learning?
a. Supervised learning can handle both labeled and unlabeled data.
b. Unsupervised learning requires human experts to label the data.
c. Supervised learning can be used for regression and classification tasks.
d. Unsupervised learning aims to find hidden patterns in the data.
Correct Answers: c, d
Explanation:
Option “a” is incorrect. Supervised learning specifically requires labeled data, while
unsupervised learning deals with unlabeled data.
Option “b” is incorrect. Unsupervised learning does not require human experts to label the data;
it learns from the raw, unlabeled data.
Option “c” is correct. Supervised learning encompasses both regression, where the output
variable is continuous, and classification, where the output variable is categorical.
Option “d” is correct. Unsupervised learning aims to find hidden patterns, structures, or
relationships within the data without any prior knowledge of the output labels.
___________________________________________________________________________
14. One of the ways to mitigate overfitting is
a. By increasing the model complexity
b. By reducing the amount of training data
c. By adding more features to the model
d. By decreasing the model complexity

Correct Answer: d
Explanation: Overfitting occurs when a machine learning model performs well on the training
data but fails to generalize to new, unseen data. It usually happens when the model becomes
too complex and starts to memorize the training examples instead of learning the underlying
patterns. To mitigate overfitting, one of the effective approaches is to decrease the model
complexity.
__________________________________________________________________________

15. How many Boolean functions are possible with 𝑁 features?

𝑁
a. (22 )
b. (2𝑁 )
c. (𝑁 2 )
d. (4𝑁 )

Correct Answer: a
Explanation: Any variable ‘A’ can have 2 values (i.e., 0 or 1)
For ‘N’ variables there are 2N entries in the truth table.
And each output of any particular row in the truth table can be 0 or 1.
𝑁
Hence, we have (22 ) different Boolean functions with N variables.
___________________________________________________________________________

Introduction To Machine Learning Assignment-Week 4
No ratings yet
Introduction To Machine Learning Assignment-Week 4
5 pages
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Understanding Machine Learning Solution Manual: 2 Gentle Start
No ratings yet
Understanding Machine Learning Solution Manual: 2 Gentle Start
67 pages
Data Analytics With Python - Unit 13 - Week 11
No ratings yet
Data Analytics With Python - Unit 13 - Week 11
4 pages
2022 ML Assignments
No ratings yet
2022 ML Assignments
45 pages
2023 ML Assignment
No ratings yet
2023 ML Assignment
57 pages
Thank You For Taking The Week 3: Assignment 3. Week 3: Assignment 3
No ratings yet
Thank You For Taking The Week 3: Assignment 3. Week 3: Assignment 3
3 pages
Introduction To Machine Learning - Unit 3 - Week 1
No ratings yet
Introduction To Machine Learning - Unit 3 - Week 1
3 pages
Introduction To Machine Learning - Unit 4 - Week 2
100% (1)
Introduction To Machine Learning - Unit 4 - Week 2
3 pages
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 8 - Week 5
100% (1)
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 8 - Week 5
5 pages
ML Assignment 3
No ratings yet
ML Assignment 3
5 pages
Deep Learning
No ratings yet
Deep Learning
6 pages
Introduction To Machine Learning - Unit 3 - Week 1 - Non - Graded
No ratings yet
Introduction To Machine Learning - Unit 3 - Week 1 - Non - Graded
3 pages
Week3 Assignment
No ratings yet
Week3 Assignment
6 pages
Assignment 11
100% (1)
Assignment 11
4 pages
Assignment - Week 6 (Neural Networks) Type of Question: MCQ/MSQ
No ratings yet
Assignment - Week 6 (Neural Networks) Type of Question: MCQ/MSQ
4 pages
MCQ Question
No ratings yet
MCQ Question
5 pages
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
10 pages
ML Ass 2
No ratings yet
ML Ass 2
6 pages
Assignment 6
No ratings yet
Assignment 6
2 pages
Deep Learning - IIT Ropar - Unit 4 - Week 1
No ratings yet
Deep Learning - IIT Ropar - Unit 4 - Week 1
8 pages
Assignment 4: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Assignment 4: Reinforcement Learning Prof. B. Ravindran
4 pages
Deep Learning - IIT Ropar - Unit 6 - Week 4
No ratings yet
Deep Learning - IIT Ropar - Unit 6 - Week 4
5 pages
Deep Learning
No ratings yet
Deep Learning
2 pages
Assignment 4: Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 4: Introduction To Machine Learning Prof. B. Ravindran
2 pages
Assignment 5 (Sol.) : Reinforcement Learning
100% (1)
Assignment 5 (Sol.) : Reinforcement Learning
4 pages
Assignment 2
No ratings yet
Assignment 2
7 pages
DAA UNIT 4 - Final
No ratings yet
DAA UNIT 4 - Final
12 pages
Deep Learning - IIT Ropar - Unit 6 - Week 3
No ratings yet
Deep Learning - IIT Ropar - Unit 6 - Week 3
4 pages
Reinforcement Learning - Unit 6 - Week 4
No ratings yet
Reinforcement Learning - Unit 6 - Week 4
3 pages
Deep Learning - IIT Ropar - Unit 7 - Week 4
100% (1)
Deep Learning - IIT Ropar - Unit 7 - Week 4
5 pages
Deep Learning - IIT Ropar - Unit 4 - Week 1
No ratings yet
Deep Learning - IIT Ropar - Unit 4 - Week 1
5 pages
Introduction To Machine Learning - IITKGP - Unit 4 - Week 2
No ratings yet
Introduction To Machine Learning - IITKGP - Unit 4 - Week 2
5 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
MCQ
No ratings yet
MCQ
4 pages
Assignment 7
No ratings yet
Assignment 7
3 pages
Introduction To Deep Learning - Assignment
No ratings yet
Introduction To Deep Learning - Assignment
4 pages
NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
100% (1)
NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
4 pages
PA12
100% (2)
PA12
3 pages
Machine Learning, ML Ass 5
No ratings yet
Machine Learning, ML Ass 5
6 pages
Natural Language Processing - Unit 10 - Week 8
No ratings yet
Natural Language Processing - Unit 10 - Week 8
6 pages
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
100% (2)
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Assignment 6 (COPY)
No ratings yet
Assignment 6 (COPY)
6 pages
Deep Learning - IIT Ropar - - Unit 8 - Week 5
No ratings yet
Deep Learning - IIT Ropar - - Unit 8 - Week 5
4 pages
Assignment Week 4-Deep-Learning PDF
100% (1)
Assignment Week 4-Deep-Learning PDF
7 pages
Introduction To Machine Learning - IITKGP - Unit 5 - Week 3
No ratings yet
Introduction To Machine Learning - IITKGP - Unit 5 - Week 3
4 pages
Machine Learning, ML Ass 6
No ratings yet
Machine Learning, ML Ass 6
11 pages
1000 Machine Learning MCQ (Multiple Choice Questions) - Sanfoundry
No ratings yet
1000 Machine Learning MCQ (Multiple Choice Questions) - Sanfoundry
16 pages
Week 7 Assignment 1
No ratings yet
Week 7 Assignment 1
6 pages
Optimal Binary Search Tree (OBST)
No ratings yet
Optimal Binary Search Tree (OBST)
104 pages
Deep Learning - IIT Ropar - Unit 3 - Week 1
100% (1)
Deep Learning - IIT Ropar - Unit 3 - Week 1
3 pages
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 6 - Week 3
No ratings yet
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 6 - Week 3
5 pages
Machine Learning Unit 2 MCQ
No ratings yet
Machine Learning Unit 2 MCQ
17 pages
NLP Assignment-1 Solution
No ratings yet
NLP Assignment-1 Solution
4 pages
Deep Learning - IIT Ropar - Unit 12 - Week 9
No ratings yet
Deep Learning - IIT Ropar - Unit 12 - Week 9
4 pages
UNIT 1 Practice Quiz - MCQs - ML
100% (1)
UNIT 1 Practice Quiz - MCQs - ML
10 pages
Assignment 3: Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 3: Introduction To Machine Learning Prof. B. Ravindran
4 pages
Practice Assignment 4: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Practice Assignment 4: Reinforcement Learning Prof. B. Ravindran
2 pages
CS230 Midterm Fall 2022
No ratings yet
CS230 Midterm Fall 2022
14 pages
Computational Machine Learning Mock Test
No ratings yet
Computational Machine Learning Mock Test
6 pages
Assignment 12-New
No ratings yet
Assignment 12-New
4 pages
Assignment Solutions 5
No ratings yet
Assignment Solutions 5
3 pages
Assignment 4
No ratings yet
Assignment 4
3 pages
SRP
No ratings yet
SRP
3 pages
Week 4 Solution PDS
No ratings yet
Week 4 Solution PDS
9 pages
Week 11
No ratings yet
Week 11
3 pages
Timetable Rprogramming
No ratings yet
Timetable Rprogramming
1 page
Week 9
No ratings yet
Week 9
3 pages
JP - IT-A 2023-Updated
No ratings yet
JP - IT-A 2023-Updated
11 pages
Statistics With R Programming
No ratings yet
Statistics With R Programming
2 pages
Programming For Problem Solving - 1
No ratings yet
Programming For Problem Solving - 1
10 pages
T-Sheet: I B.Tech I Semester (VR23) Regular Examinations, February 2024 Branch: IT 23NM1A1201 SGPA: 8.37 CGPA: 8.37
No ratings yet
T-Sheet: I B.Tech I Semester (VR23) Regular Examinations, February 2024 Branch: IT 23NM1A1201 SGPA: 8.37 CGPA: 8.37
21 pages
Internal Winners & Runners List, 25.1.24
No ratings yet
Internal Winners & Runners List, 25.1.24
10 pages
Outcome Based Pedagogic Principles For Effective Teaching NPTEL
100% (1)
Outcome Based Pedagogic Principles For Effective Teaching NPTEL
18 pages
eLMS Activity Dropbox ARG by Jep2
No ratings yet
eLMS Activity Dropbox ARG by Jep2
3 pages
Cambridge CELTA Language Analysis Sheet Example
No ratings yet
Cambridge CELTA Language Analysis Sheet Example
2 pages
F Chapter Iv
No ratings yet
F Chapter Iv
19 pages
What Is Language Acquisition
No ratings yet
What Is Language Acquisition
7 pages
Syllabus in Legal Techniques and Logic
No ratings yet
Syllabus in Legal Techniques and Logic
6 pages
ORT - GKA - L10 - The - Baby - Sitter - 20200403 - 200403220604
No ratings yet
ORT - GKA - L10 - The - Baby - Sitter - 20200403 - 200403220604
27 pages
Sources To Resistance of Change
100% (1)
Sources To Resistance of Change
3 pages
ZEAL EDUCATION SYSTEM USER GUIDE (3)
No ratings yet
ZEAL EDUCATION SYSTEM USER GUIDE (3)
21 pages
Study Guide RWS
No ratings yet
Study Guide RWS
2 pages
ภาพถ่ายหน้าจอ 2567-02-15 เวลา 14.44.53
No ratings yet
ภาพถ่ายหน้าจอ 2567-02-15 เวลา 14.44.53
13 pages
General Wriring Practice Test 2
No ratings yet
General Wriring Practice Test 2
4 pages
Synopsis: Hallowell's Childhood Roots of Adult Happiness
No ratings yet
Synopsis: Hallowell's Childhood Roots of Adult Happiness
13 pages
Maria Math Term III Comments
No ratings yet
Maria Math Term III Comments
10 pages
Reference Book For Special Education
100% (20)
Reference Book For Special Education
281 pages
Revealing Coranderrk
No ratings yet
Revealing Coranderrk
8 pages
PROJECT C Emotional Intelligence
No ratings yet
PROJECT C Emotional Intelligence
10 pages
English For Specific Purposes (ENGL 222) : Reporters: Janneth S. Estebes
No ratings yet
English For Specific Purposes (ENGL 222) : Reporters: Janneth S. Estebes
47 pages
Download Current Issues in Reading Writing and Visual Literacy Research and Practice 1st Edition Christina Gitsaki ebook All Chapters PDF
100% (2)
Download Current Issues in Reading Writing and Visual Literacy Research and Practice 1st Edition Christina Gitsaki ebook All Chapters PDF
71 pages
Curriculum Vitae New
No ratings yet
Curriculum Vitae New
2 pages
Terjemah Resume - Wahyudin.18574013.Analisi and Design - KPD.id - en
No ratings yet
Terjemah Resume - Wahyudin.18574013.Analisi and Design - KPD.id - en
4 pages
Year 6 PBL Food Preservation
No ratings yet
Year 6 PBL Food Preservation
4 pages
Schafer Group Lesson Plan
No ratings yet
Schafer Group Lesson Plan
4 pages
The Effects of Brand Name Suggestiveness On Advertising Recall.
No ratings yet
The Effects of Brand Name Suggestiveness On Advertising Recall.
11 pages
Relationships Between Critical Thinking & Creative Thinking
100% (19)
Relationships Between Critical Thinking & Creative Thinking
13 pages
Doing Philosophy 1 2
No ratings yet
Doing Philosophy 1 2
15 pages
The Application of Interactive Games in Enhancing Kindergarten’s Level of Letter Recognition
No ratings yet
The Application of Interactive Games in Enhancing Kindergarten’s Level of Letter Recognition
14 pages
Twinkle: Teacher'S Book A1
No ratings yet
Twinkle: Teacher'S Book A1
7 pages
Knowledge of Language in Action
No ratings yet
Knowledge of Language in Action
23 pages
Lesson Plan For 5/C
No ratings yet
Lesson Plan For 5/C
20 pages
Focus On The Learner
100% (1)
Focus On The Learner
7 pages

IML-IITKGP - Assignment 1 Solution

Uploaded by

IML-IITKGP - Assignment 1 Solution

Uploaded by

Introduction to Machine Learning -IITKGP

1. Which of the following is/are classification tasks?

a. Find the gender of a person by analyzing his writing style

a. Precision = 33.333%, Recall = 25%

a. Training error usually decreases and generalization error usually increases.

The algorithm is suffering from

T1) Cluster the users into communities of like-minded people and

The task T1 is a/an ______________ learning problem and T2 is a/an

Explanation: From the definition of supervised and unsupervised learning

7. Select the correct equations.

8. Which of the following tasks is NOT a suitable machine learning task(s)?

9. Which of the following is/are associated with overfitting in machine learning?

15. How many Boolean functions are possible with 𝑁 features?

You might also like