0% found this document useful (0 votes)

410 views

Cos4852 2023 Assignment 1

This document provides the questions for Assignment 1 for the 2023 Machine Learning course COS4852. It includes 6 questions assessing students' understanding of key machine learning concepts from assigned reading materials. Students are asked to summarize chapters from two textbooks, explain the k-nearest neighbor algorithm, analyze version spaces and hypotheses for a given dataset, represent boolean functions as binary decision trees, and use the ID3 algorithm to construct a decision tree for a given truth table. Formulas, examples and step-by-step working must be shown.

Uploaded by

Thabang Thema

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

410 views

Cos4852 2023 Assignment 1

Uploaded by

Thabang Thema

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

COS4852/A1/0/2023

Tutorial Letter A1/0/2023

Machine Learning
COS4852

Year module

Department of Computer Science

School of Computing

CONTENTS

This document contains the questions for Assignment 1 for COS4852 for 2023.

university
Define tomorrow. of south africa
CONTENTS

1 INTRODUCTION ..................................................................................................................5

2 Assignment 1 ......................................................................................................................5

2
COS4852/A1

LIST OF FIGURES

1 Instance space with positive and negative instances. ..............................................................8

2 Instance space with a donut hypothesis h ← h2, 5i. .................................................................9

3
LIST OF TABLES

1 Truth table for f5 . ................................................................................................................12

4
COS4852/A1

1 INTRODUCTION

This document discusses the questions in Assignment 1 for COS4852 for 2023.

Each question (except Q1 = 10 marks) will be assigned a mark out of 100 and the total mark for the
assignment is then calculated out of (10 + (5 × 100)) = 510.

When we mark the question we want to see that YOU understand the work. Simply copying or
regurgitating other peoples’ work (from the web, previous solutions, other students’ work) does not
show that YOU understand the work. Show ALL your assumption, definitions, variables, and full
calculations.

2 Assignment 1

Question 1

Find and download the following online textbooks on Machine Learning:

• Introduction to Machine Learning, Nils J. Nilsson, 1998.

• A first encounter with Machine Learning, Max Welling, 2011.

Give the complete URL where you found these textbooks, as well as the file size of the PDF you’ve
downloaded.

10 marks for complete and correct URL and size

5
Question 2

Read Nilsson’s book, Chapter 2. Summarise the chapter in 2-4 pages in such a way that you can
show that you thoroughly understand the concepts described there. Use different example functions
from the ones in the book to show that you understand the concepts.

Mark out of 100.

40 or less for clear indication that student does not understand the topic or evidence of plagiarism
50 for a fair understanding
60-70 for understanding and clear well defined examples
80+ for exceptional detail

6
COS4852/A1

Question 3

Read Chapter 5 of Welling’s book. Do some research on the k-nearest neighbour classification
algorithm and write a 2-page report on how the algorithm works. Your report should include a
detailed example, with all calculations shown.

Mark out of 100.

7
Question 4

Let X be an instance space consisting of points in the Euclidian plane with integer coordinates (x, y),
with positive and negative instances as shown in Figure 1.

y
10 Positive instances:
(5, 5)
(−6, 4)
(−3, −4)
(2, −4)
5
Negative instances:
(−1, 2)
(−2, 0)
(6, 7)
x (8, −8)
-10 -5 5 10

-5

-10

Figure 1: Instance space with positive and negative instances.

Let H be the set of hypothesesp consisting of origin-centered donuts. Formally, the donut hypothesis
has the form h ← ha < x + y 2 < bi, where a < b and a, b ∈ Z ( Z is the set of non-negative
2

integers, {0, 1, 2, 3, ...} ). This can be shortened to h ← ha, bi.

An example of a donut hypothesis is h ← h2, 5i and is shown in Figure 2. Notice that this hypothesis
does not explain the data correctly, since there are both positive and negative instances inside the
donut and neither does the donut contain all the positive or all the negative instances, exclusively.

(a) What is the S-boundary set of the given version space? Write out the hypotheses in the form
given above and draw them.
(b) What is the G-boundary set of the given version space? Write out the hypotheses in the form
given above and draw them.
(c) Suppose that the learner now suggests a new (x, y) instance and asks the trainer for its
classification. Suggest a query guaranteed to reduce the size of the version space, regardless
of how the trainer classifies it. Suggest one that will not reduce the size of the version space,
regardless of how the trainer classifies is. Explain why in each case.

8
COS4852/A1

x
-10 -5 5 10

-5

-10

Figure 2: Instance space with a donut hypothesis h ← h2, 5i.

9
(d) The donuts are one of many possible hypothesis spaces that could explain this data set.
Propose one alternative hypothesis space and explicitly define its parameters as was done
using a and b for the donuts. Choose an instance from your hypothesis space that separates
the given data. Write out this hypothesis and sketch it.

Here are some resources you could consult on this topic:

• http://cse-wiki.unl.edu/wiki/index.php/Concept_Learning_and_the_General-to-Specific_
Ordering

• http://www.cs.northwestern.edu/~pardo/courses/mmml/lectures/NU%20EECS%20349%20Fall%
2009%20topic%201%20-%20version%20spaces.pdf

• http://www.ccs.neu.edu/home/rjw/csg220/lectures/version-spaces.pdf

Mark out of 100.

40 or less for clear indication that student does not understand the topic or evidence of plagiarism,
or answers are correct, but have not shown complete workings
50 correct and sufficient workings
60-70 correct and complete workings
80+ indicating thorough understanding of the work

10
COS4852/A1

Question 5

Give binary decision trees to represent the following Boolean functions:

(a) f1 (A, B) = ¬A ∧ B

(b) f2 (A, B, C) = [A ∧ B] ∨ C]

(d) f4 (A, B, C, D) = [A ∨ B] ∧ [C ∨ D]

Remember that there is a difference between a graph and a tree.

Read: https://www.geeksforgeeks.org/difference-between-graph-and-tree/

The symbol Y represents the Boolean operator for XOR (exclusive-or). For this exercise you do not
need to do the Gain or Entropy calculations. There is a direct mapping between a Boolean function
and its corresponding binary decision tree. The binary decision tree can usually by simplified as
well to produce a simpler, more compact tree. Do not just write down the final, simplified tree. Show
how you do the simplification.

Here are resources you could consult on this topic - they are also a good introduction to material for
the next question:

• https://www.cs.cmu.edu/~fp/courses/15122-f10/lectures/19-bdds.pdf

• http://cs.nyu.edu/~dsontag/courses/ml12/slides/lecture11.pdf

Mark out of 100.

11
Question 6

Use the ID3 algorithm to construct a decision tree for the data in Table 1. Show all your calculations,
including all the steps of the Gain and Entropy calculations. Show the formulas that you use. Clearly
explain your choices.

A B C D f5
F F F F no
F F F T no
F F T F no
F F T T no
F T F F no
F T F T yes
F T T F yes
F T T T yes
T F F F no
T F F T yes
T F T F yes
T F T T yes
T T F F no
T T F T yes
T T T F yes
T T T T yes

Table 1: Truth table for f5 .

Here are some resources you could consult on this topic (focus on the ID3 algorithm):

• https://medium.com/deep-math-machine-learning-ai/chapter-4-decision-trees-algorithms-

• https://www.cise.ufl.edu/~ddd/cap6635/Fall-97/Short-papers/2.htm

• http://www.ke.tu-darmstadt.de/lehre/archiv/ws0809/mldm/dt.pdf

• https://cis.temple.edu/~ingargio/cis587/readings/id3-c45.html

Mark out of 100.

12
COS4852/A1

Total marks out of 510.

Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
89% (45)
12 Week Program: Summer Body Starts Now
70 pages
Read People Like A Book by Patrick King-Edited
59% (76)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Cheat Code To The Universe
94% (78)
Cheat Code To The Universe
34 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (542)
How To Develop and Write A Grant Proposal
17 pages
Workbook For The Body Keeps The Score
88% (52)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (30)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
77% (13)
27 Feedback Mechanisms Pogil Key
6 pages
Phone Codes
78% (27)
Phone Codes
5 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
Sample Mental Health Progress Note
96% (47)
Sample Mental Health Progress Note
3 pages
2025 MandateForLeadership FULL
70% (10)
2025 MandateForLeadership FULL
920 pages
How To Kiss A Woman's Breast
60% (114)
How To Kiss A Woman's Breast
14 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
80% (35)
100 Questions To Ask Your Partner
2 pages
Satanic Calendar
25% (56)
Satanic Calendar
4 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
100% (7)
14 Easiest & Hardest Muscles To Build (Ranked With Solutions)
27 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
1001 Songs
70% (71)
1001 Songs
1,798 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
Cos4852 2018 A1
No ratings yet
Cos4852 2018 A1
11 pages
NSERC Tips To Writing The Outline of Proposed Research
No ratings yet
NSERC Tips To Writing The Outline of Proposed Research
3 pages
Edexcel Biology As Core Practical Workbook
91% (33)
Edexcel Biology As Core Practical Workbook
62 pages
Boolean Functions - Theory, Algorithms, and Applications (Crama & Hammer 2011-05-16)
No ratings yet
Boolean Functions - Theory, Algorithms, and Applications (Crama & Hammer 2011-05-16)
711 pages
Syllabus (Last Modified 20-01-29 - 20-17)
No ratings yet
Syllabus (Last Modified 20-01-29 - 20-17)
13 pages
The NBA and Fatigue
100% (1)
The NBA and Fatigue
42 pages
Chapter 6 Problem Definition Zikmund
50% (2)
Chapter 6 Problem Definition Zikmund
24 pages
Solving University Course Timetabling Problem Using Genetic Algorithm
No ratings yet
Solving University Course Timetabling Problem Using Genetic Algorithm
54 pages
Intro To AI Course Outline Spring2021-SG-V2.0
No ratings yet
Intro To AI Course Outline Spring2021-SG-V2.0
2 pages
Econometrics Syllabus
No ratings yet
Econometrics Syllabus
4 pages
Parallax Occlusion Mapping
No ratings yet
Parallax Occlusion Mapping
8 pages
Short Truth Table Method Steps Easiest Case
No ratings yet
Short Truth Table Method Steps Easiest Case
4 pages
Peasant Life in China
From Everand
Peasant Life in China
Hsiao-Tung Fei
4.5/5 (2)
Islamiyat Project
No ratings yet
Islamiyat Project
10 pages
Santos Number Theory
No ratings yet
Santos Number Theory
101 pages
The First 100,000 Prime Numbers
From Everand
The First 100,000 Prime Numbers
Archive Classics
No ratings yet
Mandelbrot Julia Sets
No ratings yet
Mandelbrot Julia Sets
13 pages
Appendix 1: SAGE Harvard Reference Style: General
No ratings yet
Appendix 1: SAGE Harvard Reference Style: General
5 pages
Wilson Four CSC S
100% (1)
Wilson Four CSC S
37 pages
Bloomberg API Manual
No ratings yet
Bloomberg API Manual
17 pages
Trends in The Ptoe Quiz
No ratings yet
Trends in The Ptoe Quiz
2 pages
Instant Download Operating System Concepts 10th 10th Edition Abraham Silberschatz PDF All Chapters
100% (7)
Instant Download Operating System Concepts 10th 10th Edition Abraham Silberschatz PDF All Chapters
50 pages
Wireshark Opnet
No ratings yet
Wireshark Opnet
5 pages
Programming Just Basic Tutorials PDF
No ratings yet
Programming Just Basic Tutorials PDF
360 pages
Braverman
No ratings yet
Braverman
183 pages
Undergraduate Economics Reading List 2016-17
No ratings yet
Undergraduate Economics Reading List 2016-17
8 pages
Software Engineering: A Practitioner's Approach 9th Edition Roger S. Pressman download pdf
100% (1)
Software Engineering: A Practitioner's Approach 9th Edition Roger S. Pressman download pdf
36 pages
COS4851 Assignment 1 2024
No ratings yet
COS4851 Assignment 1 2024
5 pages
Compre FoDS
No ratings yet
Compre FoDS
2 pages
2024_autumn_mid_term_ec252
No ratings yet
2024_autumn_mid_term_ec252
4 pages
Graded Assignment 1 - Questions - Student
No ratings yet
Graded Assignment 1 - Questions - Student
4 pages
HW 3
No ratings yet
HW 3
5 pages
CS 540-1: Introduction To Artificial Intelligence: Closed Book (Two Sheets of Notes and Calculators Allowed)
No ratings yet
CS 540-1: Introduction To Artificial Intelligence: Closed Book (Two Sheets of Notes and Calculators Allowed)
10 pages
DIT865 2018 Mar Solution
No ratings yet
DIT865 2018 Mar Solution
9 pages
HW 1
No ratings yet
HW 1
5 pages
Mathematics_Grade_7_Investigation_Term_2.docx
No ratings yet
Mathematics_Grade_7_Investigation_Term_2.docx
7 pages
Project C: Dr. Shahin Tavakoli Applied Bayesian Statistics Project 1
No ratings yet
Project C: Dr. Shahin Tavakoli Applied Bayesian Statistics Project 1
2 pages
2020 Specimen Paper 6
No ratings yet
2020 Specimen Paper 6
12 pages
02aa As Mathematics Specimen Papers - Paper 2 Statistics & Mechanics (Word)
No ratings yet
02aa As Mathematics Specimen Papers - Paper 2 Statistics & Mechanics (Word)
10 pages
BEC 341 2022 Assign 3 - 231120 - 152534
No ratings yet
BEC 341 2022 Assign 3 - 231120 - 152534
4 pages
Sample Test
No ratings yet
Sample Test
9 pages
Test 1 PCK - EMTS71024 - MKT - 2023 - Memo - Discussion
No ratings yet
Test 1 PCK - EMTS71024 - MKT - 2023 - Memo - Discussion
7 pages
Compre FoDS
No ratings yet
Compre FoDS
3 pages
Maths P2 2021 May TZ2
No ratings yet
Maths P2 2021 May TZ2
12 pages
AIML-CSBS 3rd Semester 2023
No ratings yet
AIML-CSBS 3rd Semester 2023
33 pages
MST-01 To MSTL-02 Assignments 2021
No ratings yet
MST-01 To MSTL-02 Assignments 2021
31 pages
Final f02
No ratings yet
Final f02
12 pages
Bazg524 Sep30 FN
No ratings yet
Bazg524 Sep30 FN
2 pages
Statistical Methods and Inference: Toaxyz - Raphaellee - T1923161 (Omit D/O, S/O)
No ratings yet
Statistical Methods and Inference: Toaxyz - Raphaellee - T1923161 (Omit D/O, S/O)
7 pages
MT2023-Sol
No ratings yet
MT2023-Sol
8 pages
SIG742 Task1
No ratings yet
SIG742 Task1
9 pages
NC v1.0
No ratings yet
NC v1.0
41 pages
Ceng222 hw1
No ratings yet
Ceng222 hw1
4 pages
Final Exam
No ratings yet
Final Exam
18 pages
2021_Summer_CSE173_final_s11-1
No ratings yet
2021_Summer_CSE173_final_s11-1
2 pages
Sample Final AI
No ratings yet
Sample Final AI
9 pages
Lab 1
No ratings yet
Lab 1
12 pages
SLA Mid-termV2 Soln
No ratings yet
SLA Mid-termV2 Soln
5 pages
Stat 401B Exam 1 F16-Key
No ratings yet
Stat 401B Exam 1 F16-Key
7 pages
Allama Iqbal Open University Islamabad (Department of Business Administration)
No ratings yet
Allama Iqbal Open University Islamabad (Department of Business Administration)
9 pages
Homework 1: Background Test: Due 12 A.M. Tuesday, September 06, 2020
No ratings yet
Homework 1: Background Test: Due 12 A.M. Tuesday, September 06, 2020
4 pages
Final f03
No ratings yet
Final f03
8 pages
2019_November_PG_M. Sc.,( Computer Science & IT) 2017 onwards_M. Sc.,( Computer Science & IT)
No ratings yet
2019_November_PG_M. Sc.,( Computer Science & IT) 2017 onwards_M. Sc.,( Computer Science & IT)
92 pages
AB1202 Quiz 3 Prep Special R-Skills v1 Nov'20oubhjnl
No ratings yet
AB1202 Quiz 3 Prep Special R-Skills v1 Nov'20oubhjnl
2 pages
Tut102 HPCOS81 2023 Assignments 1,2,3
No ratings yet
Tut102 HPCOS81 2023 Assignments 1,2,3
12 pages
HPCOS81 Learning Unit 1 2023
No ratings yet
HPCOS81 Learning Unit 1 2023
10 pages
COS4852 2023 Unit 0 - Introduction
No ratings yet
COS4852 2023 Unit 0 - Introduction
5 pages
COS4852 2023 Unit 2 - KNN
No ratings yet
COS4852 2023 Unit 2 - KNN
10 pages
National Security Cultures Patterns Of Global Governance 1st Edition Emil J. Kirchner - Experience the full ebook by downloading it now
100% (1)
National Security Cultures Patterns Of Global Governance 1st Edition Emil J. Kirchner - Experience the full ebook by downloading it now
79 pages
Biology Form One Notes
No ratings yet
Biology Form One Notes
135 pages
Get Limnoecology 2nd Edition Winfried Lampert free all chapters
100% (6)
Get Limnoecology 2nd Edition Winfried Lampert free all chapters
50 pages
Research Problem, Lit Review and Research Design
No ratings yet
Research Problem, Lit Review and Research Design
41 pages
The Study of Design Richard Buchanan
No ratings yet
The Study of Design Richard Buchanan
29 pages
Byrne CH-01 SEM Basics
No ratings yet
Byrne CH-01 SEM Basics
7 pages
Blooms Taxonomy
0% (1)
Blooms Taxonomy
32 pages
Quantitative and Qualitative Research
No ratings yet
Quantitative and Qualitative Research
9 pages
MCQs of Research Methods
100% (1)
MCQs of Research Methods
13 pages
4. Research Hypotheses-2025
No ratings yet
4. Research Hypotheses-2025
54 pages
Module 2 STS and The Human Condition PDF
No ratings yet
Module 2 STS and The Human Condition PDF
10 pages
PR2-long Quiz
No ratings yet
PR2-long Quiz
4 pages
Open 2022 - 2023 Lab Manual Edited For The Students - T.smith
No ratings yet
Open 2022 - 2023 Lab Manual Edited For The Students - T.smith
54 pages
Quant
No ratings yet
Quant
7 pages
Business Research
No ratings yet
Business Research
136 pages
Chapter 1 Test Bank Biology
No ratings yet
Chapter 1 Test Bank Biology
7 pages
Mavis PDF
No ratings yet
Mavis PDF
2 pages
A Knowledge-Based Method For Road Damage Detection Using High-Resolution Remote Sensing Image
No ratings yet
A Knowledge-Based Method For Road Damage Detection Using High-Resolution Remote Sensing Image
4 pages
Lipton Induction
No ratings yet
Lipton Induction
16 pages
Sports Research
No ratings yet
Sports Research
4 pages
Sains - Tahun 6 (English)
96% (24)
Sains - Tahun 6 (English)
46 pages
Jehn (1995)
No ratings yet
Jehn (1995)
28 pages