Riphah International University
Introduction to Machine Spring 2021
Learning
Assignment 1 Marks: 20
Due on March 29, 2021
Topic Included: Multiple Linear Regression in Python
A dataset [Link] is uploaded along with this assignment document. The dataset file contains 3
features namely exam1, exam2 and exam3 representing the scores of a student in quizzes, assignment
and midterm exam. The last column is the target representing the scores in final exam. Do the following
1. Print the minimum and maximum scores in each exam
2. Show the average marks in each exam
3. Plot a graph of exam score in each exam against the final exam score
4. Perform Simple linear regression i.e . with one feature using each of the 3 exam scores to
predict the final exam score. Report your r2 value
5. Perform multiple linear regression on all possible pairs of features to predict the final exam
score and report r2 value
6. Perform multiple linear regression using all 3 features to predict exam score and report the r2
value
Note: select rows randomly in the test_train split
You will submit the following table along with your code
Features R2 value
Exam 1
Exam 2
Exam 3
Exam 1, Exam 2
Exam1, Exam 3
Exam 2, Exam 3
Exam 1, Exam 2, Exam 3
Evaluation Rubric
Criteria 0-30% 31-70% 71-100%
Syntactic Correctness More than 4 2-3 independent Less than 2
(4) independent syntax syntax errors independent syntax
errors errors
Logical Correctness(6) 3 or more logical 1-2 logical errors 1 or no logical errors
errors
Results Accuracy (4) More than 4 wrong 2-3 wrong values 1 or no wrong values
values reported
Completion (6) More than 3 tasks 1 or few tasks undone All tasks completed
undone