0% found this document useful (0 votes)

41 views5 pages

ISYE6740 Fall2024 HW4 Rubric

The document outlines Homework 4 for ISYE 6740, focusing on optimization problems related to logistic regression, Bayes classifier for spam filtering, and comparing classifiers for divorce prediction. It includes specific tasks such as deriving gradients, writing pseudo-code, calculating class priors, and performing maximum likelihood estimation. Additionally, it requires students to report testing accuracy of classifiers and analyze decision boundaries after applying PCA.

Uploaded by

Christopher Lindeman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views5 pages

ISYE6740 Fall2024 HW4 Rubric

Uploaded by

Christopher Lindeman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

ISYE 6740, Fall 2024, Homework 4

100 points

Prof. Yao Xie

1. Optimization (35 points).

Consider a simplified logistic regression problem. Given m training samples (xi , y i ), i =
1, . . . , m. The data xi ∈ R, and y i ∈ {0, 1}. To fit a logistic regression model for classification,
we solve the following optimization problem, where θ ∈ R is a parameter we aim to find:
max ℓ(θ), (1)
θ

where the log-likelhood function

m
X
− log(1 + exp{−θT xi }) + (y i − 1)θT xi .

ℓ(θ) =
i=1

1. (10 points) Show step-by-step mathematical derivation for the gradient of the cost
function ℓ(θ) in (1).
Rubric:
Any reasonable attempt 2pt
Correct gradient 8pts
Partial credit may be awarded as appropriate.

2. (5 points) Write a pseudo-code for performing gradient descent to find the optimizer
θ∗ . This is essentially what the training procedure does.
Rubric:
Any reasonable attempt 1pt
Reasonable pseudo-code of gradient descent 4pts

3. (5 points) Write the pseudo-code for performing the stochastic gradient descent
algorithm to solve the training of logistic regression problem (1). Please explain the
difference between gradient descent and stochastic gradient descent for training logistic
regression.
Rubric:
Reasonable pseudo-code of Gradient descent 3pts
Reasonable explanation of the difference 2pts

1
(15 points) We will show that the training problem in basic logistic regression
problem is concave. Derive the Hessian matrix of ℓ(θ) and based on this, show the
training problem (1) is concave. Explain why the problem can be solved efficiently and
gradient descent will achieve a unique global optimizer, as we discussed in class.

Rubric:
Correct Hessian expression 10pts
Reasonable attempt 2pt
Resonable explanation 3pt
Explanation should contain some comments on why the problem can be solved efficiently
and gradient descent will achieve a unique global optimizer, as stated in the question.

2. Bayes Classifier for spam filtering (35 points)

In this problem, we will use the Bayes Classifier algorithm to fit a spam filter by hand. This
will enhance your understanding to the Bayes classifier and build intuition. This question
does not involve any programming but only derivation and hand calculation. Tools can be
used (Python, Excel, Etc.) but all calculations and derivations must still be provided in
your report.
Spam filters are used in all email services to classify received emails as “Spam” or “Not
Spam”. A simple approach involves maintaining a vocabulary of words that commonly
occur in “Spam” emails and classifying an email as “Spam” if the number of words from
the dictionary that are present in the email is over a certain threshold. We are given the
vocabulary consists of 15 words

V = {free, money, no, cap, crypto, real, cash, prize, for, you, big, chance, pizza, is, vibe}.

We will use Vi to represent the ith word in V . As our training dataset, we are also given 3
example spam messages,

• free money no cap

• crypto real money

• real cash prize money for you

and 4 example non-spam messages

• big free chance

• no cash no pizza

• money money money

• pizza is big vibe for real

2
Recall that the Naive Bayes classifier assumes the probability of an input depends on
(i) (i) (i)
its input feature. The feature for each sample is defined as x(i) = [x1 , x2 , . . . , xd ]T , i =
1, . . . , m and the class of the ith sample is y (i) . In our case the length of the input vector is
(i)
d = 15, which is equal to the number of words in the vocabulary V . Each entry xj is equal
to the number of times word Vj occurs in the i-th message.

1. (5 points) Calculate class prior P(y = 0) and P(y = 1) from the training data, where
y = 0 corresponds to spam messages, and y = 1 corresponds to non-spam messages.
Note that these class prior essentially corresponds to the frequency of each class in the
training sample. Write down the feature vectors for each spam and non-spam messages.

Rubric:
Each correct prior, 1pt, total 2pts
Each correct feature vector, 0.5pt, total 3pts

2. (15 points) Assuming the keywords follow a multinomial distribution, the likelihood of
a sentence with its feature vector x given a class c is given by
d
n! Y xk
P(x|y = c) = θc,k , c = {0, 1}
x1 ! · · · xd ! k=1

where n = x1 + · · · xd , 0 ≤ θc,k ≤ 1 is the probability of word k appearing in class c,

which satisfies
X d
θc,k = 1, c = {0, 1}.
k=1

Given this, the complete log-likelihood function for our training data is given by
m X
X d
(i)
ℓ(θ0,1 , . . . , θ0,d , θ1,1 , . . . , θ1,d ) = xk log θy(i) ,k
i=1 k=1

(In this example, m = 7.) Calculate the maximum likelihood estimates of θ0,1 , θ0,6 ,
θ1,2 , θ1,15 by maximizing the log-likelihood function above.
(Hint: We are solving a constrained maximization problem and you will need to intro-
duce Lagrangian multipliers and consider the Lagrangian function.)

Rubric:
11 Points for a proper Lagrangian definition. Partial credit may be given, but the
two Lagrangian multipliers must be present similar to the solution for full credit
to be received. Each correct MLE requested, 1pt each, 4 pts total.

3. (15 points) Given a test message “money for real”, using the Naive Bayes classier that
you have trained in Part (a)-(b), to calculate the posterior and decide whether it is
spam or not spam. Derivations must be shown in your report.

3
Rubric:
Reasonable posterior results 9pts
Correct classification result 6pts

3. Comparing classifiers: Divorce classification/prediction (30 points)

In lectures, we learn different classifiers. This question is compare them on two datasets.
Python users, please feel free to use Scikit-learn, which is a commonly-used and powerful
Python library with various machine learning tools. But you can also use other similar
libraries in other languages of your choice to perform the tasks.
This dataset is about participants who completed the personal information form and
a divorce predictors scale. The data is a modified version of the publicly available at
https://archive.ics.uci.edu/ml/datasets/Divorce+Predictors+data+set (by inject-
ing noise so you will not get the exactly same results as on UCI website). The dataset
marriage.csv is contained in the homework folder. There are 170 participants and 54 at-
tributes (or predictor variables) that are all real-valued. The last column of the CSV file is
label y (1 means “divorce”, 0 means “no divorce”). Each column is for one feature (predictor
variable), and each row is a sample (participant). A detailed explanation for each feature
(predictor variable) can be found at the website link above. Our goal is to build a classifier
using training data, such that given a test sample, we can classify (or essentially predict)
whether its label is 0 (“no divorce”) or 1 (“divorce”).
We are going to compare the following classifiers (Naive Bayes, Logistic Regression,
and KNN). Use the first 80% data for training and the remaining 20% for testing. If you
use scikit-learn you can use train test split to split the dataset.
Remark: Please note that, here, for Naive Bayes, this means that we have to estimate the
variance for each individual feature from training data. When estimating the variance, if the
variance is zero to close to zero (meaning that there is very little variability in the feature),
you can set the variance to be a small number, e.g., ϵ = 10−3 . We do not want to have
include zero or nearly variance in Naive Bayes. This tip holds for both Part One and Part
Two of this question.

1. (15 points) Report testing accuracy for each of the three classifiers. Comment on their
performance: which performs the best and make a guess why they perform the best in
this setting.

Rubric:
Classification accuracy above 90% 4pts, 12pts total
Reasonable explanations 3pts

2. (15 points) Now perform PCA to project the data into two-dimensional space. Build
the classifiers (Naive Bayes, Logistic Regression, and KNN) using the two-
dimensional PCA results. Plot the data points and decision boundary of each classifier

4
in the two-dimensional space. Comment on the difference between the decision bound-
ary for the three classifiers. Please clearly represent the data points with different
labels using different colors.

Rubric:
Each reasonable decision boundary 4pts, 12pts total
Reasonable explanations 3pts

CS 229, Summer 2019 Problem Set #2 Solutions
No ratings yet
CS 229, Summer 2019 Problem Set #2 Solutions
18 pages
School of Engineering: Lab Manual On Machine Learning Lab
No ratings yet
School of Engineering: Lab Manual On Machine Learning Lab
23 pages
Midterm Exam - Summer 21
No ratings yet
Midterm Exam - Summer 21
6 pages
Midterm Sol
No ratings yet
Midterm Sol
23 pages
hw2 311
No ratings yet
hw2 311
4 pages
A5 Sol
No ratings yet
A5 Sol
13 pages
Midterm Solutions
No ratings yet
Midterm Solutions
8 pages
Lecture3 Linear Classifiers
No ratings yet
Lecture3 Linear Classifiers
36 pages
CS388N Practice Questions Answers
No ratings yet
CS388N Practice Questions Answers
48 pages
ML Lab Experiments (1) - Pages-3
No ratings yet
ML Lab Experiments (1) - Pages-3
11 pages
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 2
No ratings yet
Department of Electrical Engineering School of Science and Engineering EE514/CS535 Machine Learning Homework 2
8 pages
Questions and Solutions On Bayes Theorem
No ratings yet
Questions and Solutions On Bayes Theorem
10 pages
Exam 2011
No ratings yet
Exam 2011
22 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Ifjo 320 Fy 98324 Fo 3 F 2 Ifr
No ratings yet
Ifjo 320 Fy 98324 Fo 3 F 2 Ifr
6 pages
Assignment 2 Specification
No ratings yet
Assignment 2 Specification
3 pages
ps2 Sol
No ratings yet
ps2 Sol
19 pages
ASSIGNMENT 3 - Probabilistic Models, GBDT, SVM
No ratings yet
ASSIGNMENT 3 - Probabilistic Models, GBDT, SVM
3 pages
178 hw1
No ratings yet
178 hw1
4 pages
Ps 2
No ratings yet
Ps 2
11 pages
Machine Learning Laboratory
No ratings yet
Machine Learning Laboratory
44 pages
Version 1
No ratings yet
Version 1
18 pages
# ELG 5255 Applied Machine Learning Fall 2020 # Quiz 1 (Bayesian Decision Theory)
No ratings yet
# ELG 5255 Applied Machine Learning Fall 2020 # Quiz 1 (Bayesian Decision Theory)
6 pages
CS725 2020 Quiz1
No ratings yet
CS725 2020 Quiz1
3 pages
Practical 3
No ratings yet
Practical 3
11 pages
ML Question Bank
No ratings yet
ML Question Bank
7 pages
hw5 1
No ratings yet
hw5 1
6 pages
Probabilistic Reasoning Lab Procedure
No ratings yet
Probabilistic Reasoning Lab Procedure
4 pages
Assignment - 01
No ratings yet
Assignment - 01
4 pages
Homework3 Sol
No ratings yet
Homework3 Sol
5 pages
HW 1
No ratings yet
HW 1
4 pages
Homework 6
No ratings yet
Homework 6
2 pages
Final Exam Solutions
No ratings yet
Final Exam Solutions
12 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
COL774: Assignment 4 Naive Bayes & Collaborative Filtering: Released On: 2nd October, 2024
No ratings yet
COL774: Assignment 4 Naive Bayes & Collaborative Filtering: Released On: 2nd October, 2024
4 pages
Machine Learning PYQ 2021
No ratings yet
Machine Learning PYQ 2021
4 pages
ML Hw1
No ratings yet
ML Hw1
2 pages
Data Science and ML - End Term
No ratings yet
Data Science and ML - End Term
4 pages
IR Prac 5
No ratings yet
IR Prac 5
3 pages
Homework 4
0% (1)
Homework 4
4 pages
Ai ML Exam - 1march 16 2022-Michael Magreola
No ratings yet
Ai ML Exam - 1march 16 2022-Michael Magreola
8 pages
178 hw3
No ratings yet
178 hw3
3 pages
DSCI 303: Machine Learning For Data Science Fall 2020
No ratings yet
DSCI 303: Machine Learning For Data Science Fall 2020
5 pages
Midterm Solutions For Machine Learning
No ratings yet
Midterm Solutions For Machine Learning
13 pages
COL 774: Assignment 2
No ratings yet
COL 774: Assignment 2
3 pages
CSCI 5521 Spring 2025 Final Exam
No ratings yet
CSCI 5521 Spring 2025 Final Exam
8 pages
Practice Midterm Solutions
No ratings yet
Practice Midterm Solutions
7 pages
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
No ratings yet
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
8 pages
ML End Sem Nov2024 Paper
No ratings yet
ML End Sem Nov2024 Paper
4 pages
2011 End Spring 2011 Computer Science Machine Learning
No ratings yet
2011 End Spring 2011 Computer Science Machine Learning
10 pages
CMPUT 466/551 - Assignment 1: Paradox?
No ratings yet
CMPUT 466/551 - Assignment 1: Paradox?
6 pages
Midterm Practice Questions
No ratings yet
Midterm Practice Questions
14 pages
Cse Machine Learning Lab Manual
No ratings yet
Cse Machine Learning Lab Manual
22 pages
07au Midterm
No ratings yet
07au Midterm
17 pages
Midterm 2006
No ratings yet
Midterm 2006
11 pages
cs675 SS2022 Midterm Solution PDF
No ratings yet
cs675 SS2022 Midterm Solution PDF
10 pages
Unit 3 LOGISTIC
No ratings yet
Unit 3 LOGISTIC
7 pages
CS178 Homework #1: Problem 0: Getting Connected
No ratings yet
CS178 Homework #1: Problem 0: Getting Connected
4 pages
Di Pression
No ratings yet
Di Pression
1 page
Ai Chatbot Using Python Report
No ratings yet
Ai Chatbot Using Python Report
21 pages
Python Programs Andhra University Cse 3-1
100% (2)
Python Programs Andhra University Cse 3-1
35 pages
Weka Software Manuala
No ratings yet
Weka Software Manuala
20 pages
Sms Spam Detection Project Final
No ratings yet
Sms Spam Detection Project Final
59 pages
Sarowar 2025 Ijca 924776
100% (1)
Sarowar 2025 Ijca 924776
34 pages
AI Project Cycle
No ratings yet
AI Project Cycle
33 pages
PG Certificate in PM Brochure
No ratings yet
PG Certificate in PM Brochure
26 pages
Newfinalsouvenir
No ratings yet
Newfinalsouvenir
86 pages
Using Scouting Reports Text To Predict NCAA NBA Performance
No ratings yet
Using Scouting Reports Text To Predict NCAA NBA Performance
16 pages
Ai ML Virtual Internship
No ratings yet
Ai ML Virtual Internship
50 pages
Stress Detection With Machine Learning and Deep
No ratings yet
Stress Detection With Machine Learning and Deep
7 pages
V1-CH-6-Classification and Prediction
No ratings yet
V1-CH-6-Classification and Prediction
38 pages
Logistic Regression in Python Tutorial
100% (2)
Logistic Regression in Python Tutorial
23 pages
Worksheet 8
No ratings yet
Worksheet 8
17 pages
Manuscript Spi
No ratings yet
Manuscript Spi
19 pages
Thiru Docs 25
No ratings yet
Thiru Docs 25
48 pages
Assignment 2 (Cse121)
No ratings yet
Assignment 2 (Cse121)
6 pages
12 LLM Notes
No ratings yet
12 LLM Notes
10 pages
Solytics Partners Asd
No ratings yet
Solytics Partners Asd
8 pages
Situating Methods in The Magic of Big Da
No ratings yet
Situating Methods in The Magic of Big Da
25 pages
Saket's Resume (UPDATED)
No ratings yet
Saket's Resume (UPDATED)
1 page
Quantum Machine Learning Classifier and Neural Net
No ratings yet
Quantum Machine Learning Classifier and Neural Net
14 pages
Exploring Large Language Models For Knowledge Graph Completion
No ratings yet
Exploring Large Language Models For Knowledge Graph Completion
7 pages
Ex 5 NN Wheat Seed Data
No ratings yet
Ex 5 NN Wheat Seed Data
6 pages
Ocean Engineering: Pin Zhang, Zhen-Yu Yin, Yuanyuan Zheng, Fu-Ping Gao
No ratings yet
Ocean Engineering: Pin Zhang, Zhen-Yu Yin, Yuanyuan Zheng, Fu-Ping Gao
13 pages
Software Projects List RbitsTechnologies
No ratings yet
Software Projects List RbitsTechnologies
11 pages
Link For Google Colab Note Book: Pa Ge
No ratings yet
Link For Google Colab Note Book: Pa Ge
17 pages
Classification of Gender by Voice Recognition Using Machine Learning Algorithms
No ratings yet
Classification of Gender by Voice Recognition Using Machine Learning Algorithms
13 pages
Open Electives Circular VII Sem AY 2021-22
No ratings yet
Open Electives Circular VII Sem AY 2021-22
31 pages
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet

ISYE6740 Fall2024 HW4 Rubric

Uploaded by

ISYE6740 Fall2024 HW4 Rubric

Uploaded by

ISYE 6740, Fall 2024, Homework 4

Prof. Yao Xie

1. Optimization (35 points).

where the log-likelhood function

2. Bayes Classifier for spam filtering (35 points)

• free money no cap

• crypto real money

• real cash prize money for you

and 4 example non-spam messages

• big free chance

• money money money

• pizza is big vibe for real

where n = x1 + · · · xd , 0 ≤ θc,k ≤ 1 is the probability of word k appearing in class c,

3. Comparing classifiers: Divorce classification/prediction (30 points)

You might also like