0% found this document useful (0 votes)
175 views4 pages

Mock Exam Paper

Uploaded by

silva
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
175 views4 pages

Mock Exam Paper

Uploaded by

silva
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

CM2604: Machine Learning

School School of Computing


Course BSc (Hons) Artificial Intelligence and Data Science
Stage Year 2
Academic Year 2023/2024 (Mock Exam)
Semester Semester 2
Date TBD
Start Time TBD
End Time TBD
Duration 2 Hours
No. of Pages (including cover sheet) 4

Instructions to Candidates

You are advised (but not required) to spend the first ten minutes of the examination
reading the questions and planning how you will answer those you have selected.

This examination paper comprises 4 questions. Answer all questions.

THIS PAPER MUST NOT BE TAKEN OUT OF THE EXAMINATION ROOM


DO NOT TURN OVER THIS PAGE UNTIL THE INVIGILATOR INSTRUCTS YOU TO DO SO

Special Stationery (if applicable)


You may make use of a non-programmable calculator in this exam.
Module Code: CM2604 Exam Period: TBD
Module Title: Machine Learning

QUESTION 1 (Total 25 marks)

1. Briefly explain the importance of dimensionality reduction in machine learning based


tasks.
(5 marks)

2. You’re training an ensemble model and notice that the validation error is significant.
lower than the training error. Name two possible reasons for this to happen.

(5 marks)
3. Give one similarity and one difference between feature selection and principal
component analysis (PCA).
(6 marks)
4. Which of the following tends to work best on small datasets? Explain briefly.
a. Logistic Regression
b. K-nearest neighbor
(9 marks)
QUESTION 2 (Total 25 marks)

1. Regression can be performed with categorical and continuous variables. Briefly explain.

(6 marks)
2. Both PCA and linear regression can be thought of as algorithms for minimizing a sum of
squared errors. Explain which error is being minimized in each algorithm.
(4 marks)
3. Explain what effect the following operations have on the bias and variance of your
model will.
a. Regularizing the weights in a linear regression model
b. Regularizing the weights in a logistic regression model
c. Pruning a decision tree
(9 marks)
4. List three potential real-world scenarios where logistic regression has been applied.
(6 marks)

Page 2 of 4
Module Code: CM2604 Exam Period: TBD
Module Title: Machine Learning

QUESTION 3 (Total 25 marks)

1. Suppose we are using a linear SVM with some large C value and are given the following
dataset. Draw the decision boundary of linear SVM. Provide a brief explanation.

(6 marks)

2. The following dataset is used to learn a decision tree which predicts if people pass
machine learning (Yes or No), based on their previous GPA (High, Medium, or Low) and
whether or not they studied. Draw the full decision tree that would be learned for this
dataset.

GPA Studied Passed


L F F
L T T
M F F
M T T
H F T
H T T

(12 marks)

3. What is the purpose of Apriori property?


(3 marks)

Page 3 of 4
Module Code: CM2604 Exam Period: TBD
Module Title: Machine Learning

4. Briefly explain the usage of Apriori function in association mining using an example.
(4 marks)

QUESTION 4 (Total 25 marks)

1. What is data imbalance in machine learning? What practical strategies can be proposed
for mitigating the data imbalance scenario in machine learning? Briefly discuss.
(6 marks)

2. Suppose you are going to release a new dataset by applying resampling strategies to an
existing dataset which is suffering from class imbalance issue. Critically discuss the
ethical impact of this process.
(4 marks)

3. Compare and contrast boosting and stacking in machine learning.


(6 marks)

4. Consider the following confusion table.

Predicted Class
Corpus
Tree Bush Grass
Tree 8 2 6
Actual Class Bush 6 7 5
Grass 10 8 4

Determine the Precision, Recall and F1 Score.


(9 marks)

END OF QUESTION PAPER

Page 4 of 4

You might also like