0% found this document useful (0 votes)

8 views

Regression Analysis

Uploaded by

datasciencetrainingnucot

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Regression Analysis

Uploaded by

datasciencetrainingnucot

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Regression analysis in data mining or machine learning

What is regression ?
This is a data mining technique or approach the data in a different way .
It is a technique that predicts the value of Y variable based on the values of X
variables.
Y is dependent on X1,X2,X3,…,Xn variables.

Use cases of regression

For example, As temperature decreases sale of Jackets increases. So there is direct
relationship between sale of Jacket with the weather. Here there is a direct
relationship between these two variables.
As temperature increases sales of ice cream increases, here again there is a direct
relationship between these two variables.
Regression helps you to confirm that direct relationship between any two or more
variables
Types of regression
Simple linear regression
We all know the straight line formula , that is y=mx+c
Linear regression is based on this formula.
Salary, Uber cab fare , weight, height are continuous variables.
You cannot do linear regression on discreate data.
Multiple linear regression
Understand simple linear regression with
observations
Understand simple linear regression with
multiple observations
Implement regression in python
So here we do not need to write a program to implement the formula y=mx+c , sci-
kit learn package will do the work for us.
Sci-kit learn library is the machine learning algorithm . where all the algorithm is
already implemented previously.
So here you just need to call that particular function and complete the work.

So 1st we will train our model and then use it on the data set for which we need to
run the simple linear regression .
So here in the statement model.fit(x,y) you are asking Python to please train for the
dataset that I am having in x and y
Steps to be followed :
Linear Regression:
Linear regression is used for predicting continuous values based on input features.

1. Data Preparation
- Data Cleaning: Handle missing values, outliers, etc.
- Feature Selection/Engineering: Choose relevant features for prediction.

2. Model Definition:
- Model Hypothesis: Assume a linear relationship between input features X and
output y.
- Model Representation:
3. Cost Function:

4. Optimization:

5. Training:
- Iterate until convergence or predefined number of iterations.

6. Prediction:
- Once trained, predict y for new input x using learned theta θ.

Logistic Regression:
Logistic regression is used for binary classification problems.

1. Data Preparation:
- Same as for linear regression: clean data, select/transform features.

2. Model Definition:
3. Cost Function:

4. Optimization:

5. Training:
- Iterate until convergence or predefined number of iterations.

6. Prediction:
- Classify new instances based on the learned parameters θ.

Overview of Linear Regression

Linear regression is a fundamental supervised learning algorithm used for predictive analysis. It
models the relationship between a dependent variable (target) and one or more independent
variables (features) by fitting a linear equation to observed data. The goal is to find the best-
fitting line (or hyperplane in higher dimensions) that minimizes the sum of squared residuals
between the observed responses in the dataset and the responses predicted by the linear
approximation.

Key Concepts in Linear Regression:

 Simple Linear Regression: Involves one independent variable.

 Multiple Linear Regression: Involves multiple independent variables.
 Assumptions: Assumes a linear relationship between the predictors and the target,
independence of errors, homoscedasticity (constant variance of errors), and normality of
errors.

Implementation of Linear Regression in Python

Python provides several libraries for implementing linear regression, with scikit-learn being
one of the most popular for machine learning tasks. Here’s a step-by-step guide to implementing
linear regression in Python using scikit-learn:

1. Import Libraries:

python
import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.metrics import mean_squared_error, r2_score

2. Load and Prepare Data:

python
# Example: Loading data from a CSV file
data = pd.read_csv('data.csv')
X = data[['feature1', 'feature2', ...]] # Features
y = data['target'] # Target variable

3. Split Data into Training and Testing Sets:

python
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

4. Initialize and Fit the Model:

python
model = LinearRegression()
model.fit(X_train, y_train)

5. Predictions and Evaluation:

python
y_pred = model.predict(X_test)

# Evaluate the model

mse = mean_squared_error(y_test, y_pred)
r2 = r2_score(y_test, y_pred)
print(f'Mean Squared Error: {mse}')
print(f'R^2 Score: {r2}')

Notes on Implementation:
 Feature Scaling: Linear regression assumes features are on the same scale. Consider
scaling features (e.g., using StandardScaler) if they have different ranges.
 Interpretation: Coefficients (model.coef_) indicate the impact of each feature on the
target variable.
 Regularization: Use regularization techniques like Ridge (Ridge) or Lasso (Lasso)
regression to prevent overfitting.

Example Application:

Linear regression is used in various domains, such as:

 Finance: Predicting stock prices based on historical data.

 Marketing: Estimating sales based on advertising spend.
 Healthcare: Predicting patient outcomes based on medical data.

By understanding these concepts and implementing linear regression in Python, you can leverage
this powerful algorithm for predictive modeling and data analysis tasks effectively

Overview of Logistic Regression

Logistic regression is a supervised learning algorithm used for binary classification tasks, where
the target variable (dependent variable) is categorical and represents two possible outcomes (e.g.,
0 or 1, yes or no). Despite its name, logistic regression is a linear model for classification rather
than regression. It estimates the probability of the target variable belonging to a particular class
based on the linear combination of predictor variables.

Key Concepts in Logistic Regression:

 Sigmoid Function: Transforms the linear output into a probability score between 0 and
1.
 Logistic Loss (Log-Loss): Measures the performance of the model by penalizing false
classifications.
 Regularization: Helps prevent overfitting by penalizing large coefficients (e.g., L2
regularization in Ridge Logistic Regression).

Implementation of Logistic Regression in Python

Python offers robust libraries like scikit-learn for implementing logistic regression. Below is
a practical guide to implementing logistic regression in Python:

1. Import Libraries:

python
import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score, confusion_matrix,
classification_report

2. Load and Prepare Data:

python
# Example: Loading data from a CSV file
data = pd.read_csv('data.csv')
X = data[['feature1', 'feature2', ...]] # Features
y = data['target'] # Target variable (binary)

3. Split Data into Training and Testing Sets:

python
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

4. Initialize and Fit the Model:

python
model = LogisticRegression()
model.fit(X_train, y_train)

5. Predictions and Evaluation:

python
y_pred = model.predict(X_test)

# Evaluate the model

accuracy = accuracy_score(y_test, y_pred)
confusion_mat = confusion_matrix(y_test, y_pred)
classification_rep = classification_report(y_test, y_pred)

print(f'Accuracy: {accuracy}')
print(f'Confusion Matrix:\n {confusion_mat}')
print(f'Classification Report:\n {classification_rep}')

Notes on Implementation:

 Binary Classification: Logistic regression is suitable for binary outcomes (e.g., yes/no,
spam/not spam).
 Probability Interpretation: Predicted probabilities (from model.predict_proba)
indicate the likelihood of each class.
 Feature Importance: Coefficients (model.coef_) provide insights into the impact of
each feature on the target variable.
 Regularization: Use hyperparameters like C (inverse of regularization strength) to
control overfitting.

Example Application:
Logistic regression finds application in various domains, including:

 Healthcare: Predicting the likelihood of a disease based on patient characteristics.

 Finance: Predicting the likelihood of default based on financial indicators.
 Marketing: Predicting customer churn based on behavioral data.

By understanding these concepts and implementing logistic regression in Python, you can
effectively build and evaluate classification models for binary decision-making tasks in data
science projects.Top of Form

Morse 4400 Manual
100% (1)
Morse 4400 Manual
18 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
FPD-7024 Fire Alarm Control Panel: Smoke Detector Compatibility List
No ratings yet
FPD-7024 Fire Alarm Control Panel: Smoke Detector Compatibility List
2 pages
MIS Lab Practice Exercises
No ratings yet
MIS Lab Practice Exercises
5 pages
Linear Regression Code
No ratings yet
Linear Regression Code
5 pages
ML Lab Manual
100% (1)
ML Lab Manual
37 pages
Intro to Linear and Logistic Reg
No ratings yet
Intro to Linear and Logistic Reg
5 pages
B-56 Sanket Jambhulkar MLA-2
No ratings yet
B-56 Sanket Jambhulkar MLA-2
8 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
Experiment1 Explanation
No ratings yet
Experiment1 Explanation
6 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
Machine Learning Lab Notes
No ratings yet
Machine Learning Lab Notes
3 pages
Implementation of Linear Regression With Python
No ratings yet
Implementation of Linear Regression With Python
5 pages
AIand MLlab 5
No ratings yet
AIand MLlab 5
10 pages
Machine Learning With Python Algorithms
No ratings yet
Machine Learning With Python Algorithms
28 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
ML LN 3
No ratings yet
ML LN 3
44 pages
Linear Regression
No ratings yet
Linear Regression
5 pages
Week-7 DS Practical (1)
No ratings yet
Week-7 DS Practical (1)
8 pages
22UCS303 DS-Unit IV-LINEAR REGRESSION
No ratings yet
22UCS303 DS-Unit IV-LINEAR REGRESSION
19 pages
AI algorithm
No ratings yet
AI algorithm
40 pages
LR-LogReg
No ratings yet
LR-LogReg
53 pages
Module-2_Logistic Regression in Machine Learning
No ratings yet
Module-2_Logistic Regression in Machine Learning
28 pages
Assignment No.4 - (20-Ele-68)
No ratings yet
Assignment No.4 - (20-Ele-68)
17 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
LinearRegression
No ratings yet
LinearRegression
4 pages
Wa0004.
No ratings yet
Wa0004.
9 pages
Unit 5
No ratings yet
Unit 5
171 pages
Unit 2 Regression Analysis
No ratings yet
Unit 2 Regression Analysis
16 pages
AI14 - MachineLearning
No ratings yet
AI14 - MachineLearning
49 pages
4. Logistic Regression
No ratings yet
4. Logistic Regression
21 pages
Linear Regression
No ratings yet
Linear Regression
5 pages
DSBDL - Write - Ups - 4 To 7
No ratings yet
DSBDL - Write - Ups - 4 To 7
11 pages
AI14 - MachineLearning
No ratings yet
AI14 - MachineLearning
49 pages
Lecture Material 11
No ratings yet
Lecture Material 11
14 pages
Lab#10 Ai
No ratings yet
Lab#10 Ai
3 pages
DAV-EXP
No ratings yet
DAV-EXP
11 pages
6 ML Updated
No ratings yet
6 ML Updated
23 pages
Logistic Regression Algorithm
No ratings yet
Logistic Regression Algorithm
8 pages
ML Algorithm
No ratings yet
ML Algorithm
4 pages
Supervised Machine Learning - Regression
No ratings yet
Supervised Machine Learning - Regression
34 pages
Module 4
No ratings yet
Module 4
41 pages
Linear Regression - Jupyter Notebook
100% (3)
Linear Regression - Jupyter Notebook
56 pages
Experiment No 3
No ratings yet
Experiment No 3
7 pages
30 GM ASAP Linear Regression
No ratings yet
30 GM ASAP Linear Regression
10 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
B24 ML Exp-1
No ratings yet
B24 ML Exp-1
10 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
lab mannual of ML
No ratings yet
lab mannual of ML
43 pages
228w1f0065 ML
No ratings yet
228w1f0065 ML
15 pages
Vishal AIML 2.2
No ratings yet
Vishal AIML 2.2
4 pages
Write a lab report on Linear Regression and Logistic Regression. Include the cost function differentiation and the code in the report.
No ratings yet
Write a lab report on Linear Regression and Logistic Regression. Include the cost function differentiation and the code in the report.
7 pages
ML Unit
No ratings yet
ML Unit
23 pages
Regression Modelling
No ratings yet
Regression Modelling
25 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
DSUP_Exp4[1]
No ratings yet
DSUP_Exp4[1]
6 pages
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-34-62
No ratings yet
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-34-62
29 pages
Aychew Chernet
No ratings yet
Aychew Chernet
8 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
AI lab8
No ratings yet
AI lab8
8 pages
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Exploring The Perceptions and Experiences of Some Freshmen Using Online Registration System in Niger
No ratings yet
Exploring The Perceptions and Experiences of Some Freshmen Using Online Registration System in Niger
9 pages
Vocal Function Exercises For Presbylaryn PDF
No ratings yet
Vocal Function Exercises For Presbylaryn PDF
9 pages
SSC CHSL 2023 August 4 Shift 1
No ratings yet
SSC CHSL 2023 August 4 Shift 1
29 pages
Resources, Solar Resources
No ratings yet
Resources, Solar Resources
9 pages
New EASE Focus Project 2023-09-09 19-22
No ratings yet
New EASE Focus Project 2023-09-09 19-22
7 pages
CM Eg0414p
No ratings yet
CM Eg0414p
1 page
Baba God Assn
No ratings yet
Baba God Assn
43 pages
PH1 January 2003
No ratings yet
PH1 January 2003
24 pages
ACN Micro Project
No ratings yet
ACN Micro Project
16 pages
LNT80 - 80,000m LNG Carrier: Main Dimensions Machinery & Propulsion
No ratings yet
LNT80 - 80,000m LNG Carrier: Main Dimensions Machinery & Propulsion
2 pages
Real Time Data Get From Stock Exchange Using PHP
No ratings yet
Real Time Data Get From Stock Exchange Using PHP
6 pages
Portfolio Part 3 - Individual Assessment: - Integrity - Checklist PDF
No ratings yet
Portfolio Part 3 - Individual Assessment: - Integrity - Checklist PDF
2 pages
Defined by Excellence: Annual Report 2020/21
No ratings yet
Defined by Excellence: Annual Report 2020/21
200 pages
CG
No ratings yet
CG
38 pages
Eoi Imported Sand English 2
No ratings yet
Eoi Imported Sand English 2
7 pages
Catalogo SECO
No ratings yet
Catalogo SECO
340 pages
BIMESTRAL 11° 1ER CORTE 2023
No ratings yet
BIMESTRAL 11° 1ER CORTE 2023
2 pages
Reliance Industries Limited by Chirag
100% (1)
Reliance Industries Limited by Chirag
31 pages
Formative Assessment Survival Kit
No ratings yet
Formative Assessment Survival Kit
3 pages
2024上半年四级翻译课讲义 PDF打印版
No ratings yet
2024上半年四级翻译课讲义 PDF打印版
28 pages
1MRK504086-UEN C en Technical Reference Manual Transformer Protection IED RET 670 1.1
No ratings yet
1MRK504086-UEN C en Technical Reference Manual Transformer Protection IED RET 670 1.1
980 pages
(Process Safety Progress 2009-Sep Vol. 28 Iss. 3) Angela E. Summers - Safety Management Is A Virtue (2009) (10.1002 - prs.10337) - Libgen - Li
No ratings yet
(Process Safety Progress 2009-Sep Vol. 28 Iss. 3) Angela E. Summers - Safety Management Is A Virtue (2009) (10.1002 - prs.10337) - Libgen - Li
4 pages
THE GRANGE CHRISTIAN SCHOOL MATHS GRADE 6 PAPER1
No ratings yet
THE GRANGE CHRISTIAN SCHOOL MATHS GRADE 6 PAPER1
2 pages
Classxis
No ratings yet
Classxis
2 pages
Introduction and History of Pharmacovigilance
33% (3)
Introduction and History of Pharmacovigilance
37 pages
Speeding Up The Transition To Collective Awareness: Luce Jacovella Pietro Li o
No ratings yet
Speeding Up The Transition To Collective Awareness: Luce Jacovella Pietro Li o
5 pages
Buckling Analysis of Cold-Formed Steel Members Using CUFSM
No ratings yet
Buckling Analysis of Cold-Formed Steel Members Using CUFSM
17 pages

Regression Analysis

Uploaded by

Regression Analysis

Uploaded by

Regression analysis in data mining or machine learning

Use cases of regression

Overview of Linear Regression

Key Concepts in Linear Regression:

 Simple Linear Regression: Involves one independent variable.

Implementation of Linear Regression in Python

2. Load and Prepare Data:

3. Split Data into Training and Testing Sets:

4. Initialize and Fit the Model:

5. Predictions and Evaluation:

# Evaluate the model

Linear regression is used in various domains, such as:

 Finance: Predicting stock prices based on historical data.

Overview of Logistic Regression

Key Concepts in Logistic Regression:

Implementation of Logistic Regression in Python

2. Load and Prepare Data:

3. Split Data into Training and Testing Sets:

4. Initialize and Fit the Model:

5. Predictions and Evaluation:

# Evaluate the model

 Healthcare: Predicting the likelihood of a disease based on patient characteristics.

You might also like