0% found this document useful (0 votes)

4 views

Write a lab report on Linear Regression and Logistic Regression. Include the cost function differentiation and the code in the report.

Uploaded by

213002178

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Write a lab report on Linear Regression and Logistic Regression. Include the cost function differentiation and the code in the report.

Uploaded by

213002178

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Green University of Bangladesh

Department of Computer Science and Engineering (CSE)

Faculty of Sciences and Engineering
Semester: (Fall, Year:2024), B.Sc. in CSE (Day)

LAB REPORT NO #01

Course Title: Machine Learning Lab
Course Code: CSE-412 Section: 213-D4

Lab Experiment Name: Write a lab report on Linear Regression and Logistic Regression.
Include the cost function differentiation and the code in the report.

Student Details

Name ID
1. Shahedul Islam 213002178

Lab Date : _ _06-11-2024 _ _ _ _ _ _ _ _ _ _ _ _ _

Submission Date : _ _ 15-11-2024_ _ _ _ _ _ _ _ _ _ _ _ _
Course Teacher’s Name : _ _ Farhan Mahmud_ _

[For Teachers use only: Don’t Write Anything inside this box]

Lab Report Status

Marks: ………………………………… Signature: .....................

Comments: .............................................. Date: ..............................

Introduction

Logistic regression is a fundamental statistical and machine learning algorithm used for binary classification
problems. Unlike linear regression, which predicts continuous numerical values, logistic regression predicts the
probability of an outcome belonging to one of two classes. For example, it can be used to determine whether a
patient has a specific disease (1) or not (0) based on their medical features.

In this lab, we implemented logistic regression to classify individuals as diabetic or non-diabetic based on a
medical dataset. The dataset includes features such as glucose levels, blood pressure, BMI, and insulin levels,
along with the target variable (Outcome), which indicates whether the patient has diabetes.

Objective

The objective of this lab is to:

1. Implement Linear Regression and Logistic Regression models.

2. Understand the mathematical foundations of these algorithms, including the cost function and its
differentiation.
3. Analyze datasets using Python.

1. Linear Regression

Mathematical Foundation

Linear Regression aims to model the relationship between a dependent variable y and one or more independent
variables X. The model is expressed as:

Cost Function

The cost function for Linear Regression is the Mean Squared Error (MSE):

Where:

• is the hypothesis function.

• m is the number of samples.

Gradient Descent

To minimize the cost function:

Update rule:

Implementation

Dataset

We use the dataset provided, focusing on the Price (dependent variable) and other attributes as independent
variables.

# Importing necessary libraries

import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
import matplotlib.pyplot as plt

# Dataset preparation
data = pd.read_csv('Car_Raw_Data.csv')
df = pd.DataFrame(data)

# Checking for missing values

print("Missing values in each column before handling:")
print(df.isnull().sum())

# Handling missing values by imputing them with suitable values

df['Mileage'].fillna(df['Mileage'].mean(), inplace=True)
df['EngineV'].fillna(df['EngineV'].median(), inplace=True)
df['Price'].fillna(df['Price'].mean(), inplace=True)
df['Year'].fillna(df['Year'].median(), inplace=True)

# Verifying that there are no missing values

print("\nMissing values in each column after handling:")
print(df.isnull().sum())

# Preprocessing
df['Age'] = 2024 - df['Year'] # Calculate the age of the car
X = df[['Mileage', 'EngineV', 'Age']] # Independent variables
y = df['Price'] # Dependent variable

# Splitting dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

# Linear Regression Model

model = LinearRegression()
model.fit(X_train, y_train)

# Predictions
y_pred = model.predict(X_test)

# Results
print("\nModel Coefficients:")
print(model.coef_)
print(f"Intercept: {model.intercept_}")

# Plotting actual vs predicted prices

plt.scatter(y_test, y_pred, alpha=0.7)
plt.xlabel('Actual Prices')
plt.ylabel('Predicted Prices')
plt.title('Actual vs Predicted Prices')
plt.show()

OUTPUT:
2. Logistic Regression
Mathematical Foundation
Logistic Regression is used for classification tasks. It uses the sigmoid function to map predictions to
probabilities:

Cost Function
The cost function for logistic regression is:

Gradient Descent
The gradients are calculated as:

Implementation
Dataset
We use the dataset for predicting Outcome (dependent variable) based on independent variables.
# Importing necessary libraries
import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score, confusion_matrix, classification_report
import matplotlib.pyplot as plt

# Dataset preparation
data = pd.read_csv('diabetes.csv')
df = pd.DataFrame(data)

# Checking for missing values

print("Missing values in each column before handling:")
print(df.isnull().sum())
# Handling missing values by imputing suitable values
df['Glucose'].fillna(df['Glucose'].median(), inplace=True)
df['BloodPressure'].fillna(df['BloodPressure'].median(), inplace=True)
df['SkinThickness'].fillna(df['SkinThickness'].median(), inplace=True)
df['Insulin'].fillna(df['Insulin'].median(), inplace=True)
df['BMI'].fillna(df['BMI'].median(), inplace=True)

# Verifying that there are no missing values

print("\nMissing values in each column after handling:")
print(df.isnull().sum())

# Separating features and target variable

X = df.drop('Outcome', axis=1) # Independent variables
y = df['Outcome'] # Target variable

# Splitting the dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

# Logistic Regression Model

model = LogisticRegression(max_iter=1000)
model.fit(X_train, y_train)

# Making predictions
y_pred = model.predict(X_test)

# Evaluation Metrics
accuracy = accuracy_score(y_test, y_pred)
conf_matrix = confusion_matrix(y_test, y_pred)
class_report = classification_report(y_test, y_pred)

# Displaying results
print(f"\nAccuracy: {accuracy:.2f}")
print("\nConfusion Matrix:")
print(conf_matrix)
print("\nClassification Report:")
print(class_report)

# Visualization of the Confusion Matrix

plt.matshow(conf_matrix, cmap='Blues')
plt.title('Confusion Matrix')
plt.colorbar()
plt.xlabel('Predicted')
plt.ylabel('Actual')
plt.show()
OUTPUT:

Discussion

The logistic regression implementation provided a practical introduction to classification modeling. The
pipeline—from data preprocessing to evaluation—highlights best practices, including missing value imputation
and performance measurement. However, further steps such as hyperparameter tuning, feature scaling, and
handling class imbalance could enhance the model’s predictive capabilities.

Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
AB1202 Statistics and Analysis: Model Building
No ratings yet
AB1202 Statistics and Analysis: Model Building
20 pages
GREEN Belt Statistics Cheat Sheet
No ratings yet
GREEN Belt Statistics Cheat Sheet
13 pages
Data Mining Using Python Lab
100% (1)
Data Mining Using Python Lab
63 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
Aakash S Project Report
No ratings yet
Aakash S Project Report
12 pages
ML LAB
No ratings yet
ML LAB
23 pages
ML manoj
No ratings yet
ML manoj
51 pages
ML Report 1
No ratings yet
ML Report 1
23 pages
Document 4 (1)
No ratings yet
Document 4 (1)
4 pages
lab mannual of ML
No ratings yet
lab mannual of ML
43 pages
20dit073 Jay Prajapati ML
No ratings yet
20dit073 Jay Prajapati ML
68 pages
AIDS - DM Using Python - Lab Programs
No ratings yet
AIDS - DM Using Python - Lab Programs
19 pages
AIH_LAB1
No ratings yet
AIH_LAB1
10 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
Question
No ratings yet
Question
10 pages
Vishal AIML 2.2
No ratings yet
Vishal AIML 2.2
4 pages
AI lab8
No ratings yet
AI lab8
8 pages
Agniva
No ratings yet
Agniva
16 pages
ML Hota Assign3
No ratings yet
ML Hota Assign3
4 pages
Tamizhalagan Internship Document
No ratings yet
Tamizhalagan Internship Document
25 pages
ML_recordjp
No ratings yet
ML_recordjp
35 pages
B-56 Sanket Jambhulkar MLA-2
No ratings yet
B-56 Sanket Jambhulkar MLA-2
8 pages
Ilovepdf_merged (1)_merged - Copy
No ratings yet
Ilovepdf_merged (1)_merged - Copy
30 pages
MACHINE LEARNING PROJECT
No ratings yet
MACHINE LEARNING PROJECT
29 pages
AI_Phase3
No ratings yet
AI_Phase3
2 pages
ML PR-2
No ratings yet
ML PR-2
11 pages
Finance & Accounting Courses - Udemy (13K+ Course)
No ratings yet
Finance & Accounting Courses - Udemy (13K+ Course)
29 pages
B24 ML Exp-1
No ratings yet
B24 ML Exp-1
10 pages
ML Record
No ratings yet
ML Record
6 pages
Data Mining Report
No ratings yet
Data Mining Report
7 pages
Module I Complete Notes
No ratings yet
Module I Complete Notes
136 pages
Prac5 AAM
No ratings yet
Prac5 AAM
2 pages
OOP Lab 8
No ratings yet
OOP Lab 8
13 pages
Experiment 7
No ratings yet
Experiment 7
7 pages
Pravesh 6301
No ratings yet
Pravesh 6301
11 pages
20102A0071 DL Experiment5.b
No ratings yet
20102A0071 DL Experiment5.b
5 pages
ML RECORD - Merged
No ratings yet
ML RECORD - Merged
33 pages
University Institute of Engineering Department of Computer Science & Engineering
No ratings yet
University Institute of Engineering Department of Computer Science & Engineering
7 pages
Machine Learning
No ratings yet
Machine Learning
115 pages
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
No ratings yet
ML-Lab07-Building and Evaluating Multivariate Regression Models in Python
5 pages
ICT-4202, DIP Lab Manual - 8
No ratings yet
ICT-4202, DIP Lab Manual - 8
20 pages
Class X a Project File
No ratings yet
Class X a Project File
10 pages
Module I-Part 1
No ratings yet
Module I-Part 1
48 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
Machine Learning Toolkit User Manual
No ratings yet
Machine Learning Toolkit User Manual
7 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
Iot Da3
No ratings yet
Iot Da3
12 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
Lab Manual 04
No ratings yet
Lab Manual 04
12 pages
AIML 7 To 11
No ratings yet
AIML 7 To 11
7 pages
Arnav MLlab05
No ratings yet
Arnav MLlab05
12 pages
AIML Lab Manual
67% (3)
AIML Lab Manual
31 pages
Project Report
No ratings yet
Project Report
11 pages
ML Lab 08 Manual - Logisitic Regression (Ver7)
No ratings yet
ML Lab 08 Manual - Logisitic Regression (Ver7)
9 pages
Irjet V10i395
No ratings yet
Irjet V10i395
4 pages
Machine learning lab manual
No ratings yet
Machine learning lab manual
22 pages
BME303_Lab7_NinaSawaf
No ratings yet
BME303_Lab7_NinaSawaf
18 pages
JETIR2403387
No ratings yet
JETIR2403387
5 pages
INSY446 - 02 - Linear Model Part 1
No ratings yet
INSY446 - 02 - Linear Model Part 1
27 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Marketing Analysis Homework II
No ratings yet
Marketing Analysis Homework II
15 pages
AnalytixLabs - Linear Regression - 1623137749089
No ratings yet
AnalytixLabs - Linear Regression - 1623137749089
41 pages
Statistical Process Control For An Attribute
No ratings yet
Statistical Process Control For An Attribute
2 pages
Sta 312 Regression Analysis and Analysis of Variance
No ratings yet
Sta 312 Regression Analysis and Analysis of Variance
5 pages
Chapter 15 Chi Square Applications
No ratings yet
Chapter 15 Chi Square Applications
23 pages
Suggested Solution To Finals Long Quiz
No ratings yet
Suggested Solution To Finals Long Quiz
5 pages
TS Chap 2
No ratings yet
TS Chap 2
105 pages
Additional Assignment V - Solution
No ratings yet
Additional Assignment V - Solution
4 pages
CUHK STAT3004 Assignment2 Solution
No ratings yet
CUHK STAT3004 Assignment2 Solution
4 pages
For The Students - MODULE 3 - Week 5-7 - Numerical Techniques in Describing Data
No ratings yet
For The Students - MODULE 3 - Week 5-7 - Numerical Techniques in Describing Data
24 pages
Correlation & Regression: (DP IB Maths: AA SL)
No ratings yet
Correlation & Regression: (DP IB Maths: AA SL)
1 page
Muhamad Hisyam Ramadhani, Akhmad Arif Musadad, Musa Pelu
No ratings yet
Muhamad Hisyam Ramadhani, Akhmad Arif Musadad, Musa Pelu
13 pages
Additional Practice Questions: Time (T Seconds) Frequency
100% (1)
Additional Practice Questions: Time (T Seconds) Frequency
8 pages
JBI Critical Appraisal Quasy Experiment Desigh: Lampiran 1
No ratings yet
JBI Critical Appraisal Quasy Experiment Desigh: Lampiran 1
10 pages
JPUD Volume 14 Issue 2 Pages 89-116
No ratings yet
JPUD Volume 14 Issue 2 Pages 89-116
28 pages
Chapter 2: Descriptive Statistics: Tabular and Graphical Methods
100% (1)
Chapter 2: Descriptive Statistics: Tabular and Graphical Methods
7 pages
LESSON 5 Paired and Unpaired T Test Calculations
No ratings yet
LESSON 5 Paired and Unpaired T Test Calculations
32 pages
Unit 10 Practice Problems
No ratings yet
Unit 10 Practice Problems
22 pages
Exploratory Data Analysis - Satyajit
No ratings yet
Exploratory Data Analysis - Satyajit
35 pages
Chapter III Jerlyn
No ratings yet
Chapter III Jerlyn
6 pages
Modelling in R
No ratings yet
Modelling in R
47 pages
C677 15
No ratings yet
C677 15
8 pages
MSA Changes Englisch
No ratings yet
MSA Changes Englisch
8 pages
Sathian Et Al 2010
No ratings yet
Sathian Et Al 2010
7 pages
DDE STUDY MAT SEM-3 Statistics - Final
No ratings yet
DDE STUDY MAT SEM-3 Statistics - Final
278 pages
Regression and Correlation
No ratings yet
Regression and Correlation
9 pages
Q4 Activity - No 2 - Statistics and Probability
No ratings yet
Q4 Activity - No 2 - Statistics and Probability
3 pages
15 Types of Regression in Data Science PDF
No ratings yet
15 Types of Regression in Data Science PDF
42 pages