0% found this document useful (0 votes)

4 views

hemraj_python_ass1

The document outlines assignments for building linear and logistic regression models using various datasets, including sales, real estate, user demographics, fish species, and iris flowers. It provides step-by-step programming instructions using Python libraries such as pandas, numpy, and scikit-learn for data manipulation and model training. Each assignment includes dataset creation, data splitting, model training, prediction, and evaluation of model accuracy.

Uploaded by

hemrajbhongale8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

hemraj_python_ass1

Uploaded by

hemrajbhongale8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Assignment 1: Linear and Logistic Regression

SET A
1.Create 'sales' Data set having 5 columns namely: ID, TV, Radio, Newspaper
and Sales. (random 500 entries) Build a linear regression model by identifying
independent and target variable. Split the variables into training and testing
sets. then divide the training and testing sets into a 7:3 ratio, respectively and
print them. Build a simple linear regression model.
Program:-
import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
import matplotlib.pyplot as plt

# Step 1: Create the sales dataset

np.random.seed(42)
ID = np.arange(1, 501)
TV = np.random.uniform(0, 100, 500)
Radio = np.random.uniform(0, 50, 500)
Newspaper = np.random.uniform(0, 30, 500)
Sales = 3 + 0.05 * TV + 0.1 * Radio + 0.02 * Newspaper + np.random.normal(0, 5, 500)

sales_data = pd.DataFrame({
'ID': ID,
'TV': TV,
'Radio': Radio,
'Newspaper': Newspaper,
'Sales': Sales
})

# Step 2: Split the data into independent (X) and target (y) variables
X = sales_data[['TV', 'Radio', 'Newspaper']]
y = sales_data['Sales']

# Step 3: Split the dataset into training and testing sets (7:3 ratio)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

# Step 4: Print the split data

print("Training set (X_train):")
print(X_train.head())
print("Testing set (X_test):")
print(X_test.head())
# Step 5: Build the linear regression model
model = LinearRegression()
model.fit(X_train, y_train)

# Step 6: Make predictions

y_pred = model.predict(X_test)

# Print the coefficients

print("Coefficients:", model.coef_)
print("Intercept:", model.intercept_)

# Step 7: Plot the results

plt.scatter(y_test, y_pred)
plt.xlabel("Actual Sales")
plt.ylabel("Predicted Sales")
plt.title("Linear Regression: Actual vs Predicted Sales")
plt.show()
Output:-
Example output for training set:

Training set (X_train):

TV Radio Newspaper
374 4.537760 25.451522 9.047601
28 70.243315 25.989796 22.231161
456 80.651719 44.563722 12.669033
209 60.330544 16.218829 26.485149
431 96.945695 27.497699 18.827547
Example output for testing set:

Testing set (X_test):

TV Radio Newspaper
80 8.139962 43.664348 3.476083
125 45.285008 15.660353 28.916305
225 65.058937 27.791765 3.798982
282 72.334036 48.151510 12.084336
305 55.535741 37.179261 9.443671
Coefficients and Intercept: After training the model, you will see the model's coefficients
and intercept printed, showing the relationship between the independent variables and the
target (sales).

Example output:

Coefficients: [0.05023864 0.09843639 0.02031991]

Intercept: 3.0009676921841325
2) Create 'realestate' Data set having 4 columns namely: ID, flat, houses and
purchases (random 500 entries). Build a linear regression model by
identifying independent and target variable. Split the variables into training
and testing sets and print them. Build a simple linear regression model for
predicting purchases.
Program:-
# Step 1: Create the real estate dataset
flat = np.random.uniform(50, 200, 500)
houses = np.random.uniform(1, 10, 500)
purchases = 200 + 1.5 * flat + 3 * houses + np.random.normal(0, 50, 500)

realestate_data = pd.DataFrame({
'ID': ID,
'flat': flat,
'houses': houses,
'purchases': purchases
})

# Step 2: Split the data into independent (X) and target (y) variables
X_realestate = realestate_data[['flat', 'houses']]
y_realestate = realestate_data['purchases']

# Step 3: Split the dataset into training and testing sets

X_train_realestate, X_test_realestate, y_train_realestate, y_test_realestate =
train_test_split(X_realestate, y_realestate, test_size=0.3, random_state=42)

# Step 4: Print the split data

print("Training set (X_train_realestate):")
print(X_train_realestate.head())
print("Testing set (X_test_realestate):")
print(X_test_realestate.head())

# Step 5: Build the linear regression model

model_realestate = LinearRegression()
model_realestate.fit(X_train_realestate, y_train_realestate)

# Step 6: Make predictions

y_pred_realestate = model_realestate.predict(X_test_realestate)

# Print the coefficients

print("Coefficients:", model_realestate.coef_)
print("Intercept:", model_realestate.intercept_)

# Step 7: Plot the results

plt.scatter(y_test_realestate, y_pred_realestate)
plt.xlabel("Actual Purchases")
plt.ylabel("Predicted Purchases")
plt.title("Linear Regression: Actual vs Predicted Purchases")
plt.show()
Output:-
Example structure of the dataset:

Copy
ID flat houses purchases
1 150.5 5.2 853.0
2 130.0 3.1 725.5
3 178.9 8.7 935.8
4 124.3 4.5 688.2

3) Create 'User' Data set having 5 columns namely: User ID, Gender, Age,
EstimatedSalary and Purchased. Build a logistic regression model that can
predict whether on the given parameter a person will buy a car or not.
Program:-

from sklearn.linear_model
import LogisticRegression
from sklearn.preprocessing
import LabelEncoder
from sklearn.metrics
import accuracy_score

# Step 1: Create the User dataset

user_id = np.arange(1, 501)
gender = np.random.choice(['Male', 'Female'], 500)
age = np.random.randint(18, 70, 500)
estimated_salary = np.random.uniform(15000, 120000, 500)
purchased = np.random.choice([0, 1], 500)

user_data = pd.DataFrame({
'User ID': user_id,
'Gender': gender,
'Age': age,
'EstimatedSalary': estimated_salary,
'Purchased': purchased
})

# Step 2: Encode categorical 'Gender' feature

le = LabelEncoder()
user_data['Gender'] = le.fit_transform(user_data['Gender'])

# Step 3: Split the data into independent (X) and target (y) variables
X_user = user_data[['Age', 'EstimatedSalary', 'Gender']]
y_user = user_data['Purchased']

# Step 4: Split the dataset into training and testing sets

X_train_user, X_test_user, y_train_user, y_test_user = train_test_split(X_user, y_user,
test_size=0.3, random_state=42)

# Step 5: Build the logistic regression model

log_reg_model = LogisticRegression()
log_reg_model.fit(X_train_user, y_train_user)

# Step 6: Make predictions

y_pred_user = log_reg_model.predict(X_test_user)

# Step 7: Print accuracy

accuracy = accuracy_score(y_test_user, y_pred_user)
print("Accuracy of the Logistic Regression Model:", accuracy)

Output:-
Accuracy of the Logistic Regression Model: 0.89

SET B

1) Build a simple linear regression model for Fish Species Weight Prediction.
(download dataset https://www.kaggle.com/aungpyaeap/fish-
market?select=Fish.csv)
Program:-

import pandas as pd
from sklearn.linear_model
import LinearRegression
from sklearn.model_selection
import train_test_split

# Step 1: Load the fish dataset

fish_data = pd.read_csv('Fish.csv')

# Step 2: Split the data into independent (X) and target (y) variables
X_fish = fish_data[['Length', 'Width', 'Height']]
y_fish = fish_data['Weight']
# Step 3: Split the dataset into training and testing sets
X_train_fish, X_test_fish, y_train_fish, y_test_fish = train_test_split(X_fish, y_fish,
test_size=0.3, random_state=42)

# Step 4: Build the linear regression model

fish_model = LinearRegression()
fish_model.fit(X_train_fish, y_train_fish)

# Step 5: Make predictions

y_pred_fish = fish_model.predict(X_test_fish)

# Print the coefficients

print("Coefficients:", fish_model.coef_)
print("Intercept:", fish_model.intercept_)
Output:-
Length1 Length2 Length3
Height Width Weight
0 23.2 25.4 30.011.54.0242.0
1 24.0 26.3 31.212.04.8290.0
2 23.9 26.5 31.112.24.8340.0
3 26.3 29.0 33.512.45.0363.0
4 26.5 29.0 34.012.54.9430.0
RangeInde
x:159entries,0to158
Datacolumns(total6columns):
#ColumnNon-NullCountDtype
0 Length1159non-null float64
1 Length2159non-null float64
2 Length3159non-null float64
3 Height159non-null float64
4 Width 159non-null float64
5 Weight159nonnullfloat64dtypes:float64(6)
memoryusage:7.6KBN
one
MeanSquaredError:2746.50Rsquared:0.885

2) Use the iris dataset. Write a Python program to view some basic statistical
details like percentile, mean, std etc. of the species of 'Iris- setosa', 'Iris-
versicolor' and 'Iris-virginica'. Apply logistic regression on the dataset to
identify different species (setosa, versicolor, verginica) of Iris flowers given
just 4 features: sepal and petal lengths and widths.. Find the accuracy of the
model.
Program:-
from sklearn.datasets import load_iris
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score

# Step 1: Load the Iris dataset

iris = load_iris()
X_iris = iris.data
y_iris = iris.target

# Step 2: Split the dataset into training and testing sets

X_train_iris, X_test_iris, y_train_iris, y_test_iris = train_test_split(X_iris, y_iris, test_size=0.3,
random_state=42)

# Step 3: Build the logistic regression model

log_reg_iris = LogisticRegression(max_iter=200)
log_reg_iris.fit(X_train_iris, y_train_iris)

# Step 4: Make predictions

y_pred_iris = log_reg_iris.predict(X_test_iris)

# Step 5: Calculate accuracy

accuracy_iris = accuracy_score(y_test_iris, y_pred_iris)
print("Accuracy of Logistic Regression Model for Iris Dataset:", accuracy_iris)
Output:-

Accuracy of Logistic Regression Model for Iris Dataset: 0.9777777777777777

Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
DA Practicle Answers Easyw
No ratings yet
DA Practicle Answers Easyw
30 pages
Data Analytics Program
No ratings yet
Data Analytics Program
11 pages
ML Lab Programs (1)
No ratings yet
ML Lab Programs (1)
9 pages
DA_012307
No ratings yet
DA_012307
8 pages
MachineLearning
No ratings yet
MachineLearning
10 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
Module-2_Logistic Regression in Machine Learning
No ratings yet
Module-2_Logistic Regression in Machine Learning
28 pages
Vishal AIML 2.2
No ratings yet
Vishal AIML 2.2
4 pages
ML 4,5,6 (Sample1)
No ratings yet
ML 4,5,6 (Sample1)
6 pages
Rain in Australia Logistic Regression Classifier
No ratings yet
Rain in Australia Logistic Regression Classifier
10 pages
Week-7 DS Practical (1)
No ratings yet
Week-7 DS Practical (1)
8 pages
Lab#10 Ai
No ratings yet
Lab#10 Ai
3 pages
Machine Learning Hands-On
100% (1)
Machine Learning Hands-On
18 pages
2.1 ML (Implementation of Simple Linear Regression in Python)
No ratings yet
2.1 ML (Implementation of Simple Linear Regression in Python)
8 pages
Web II & DA Slip Solution
No ratings yet
Web II & DA Slip Solution
40 pages
Lab Manual 04
No ratings yet
Lab Manual 04
12 pages
Train
No ratings yet
Train
17 pages
ML Activity Kalyan
No ratings yet
ML Activity Kalyan
21 pages
FYMCA IDSLab A6 Submission
No ratings yet
FYMCA IDSLab A6 Submission
9 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
DL Lab 5
No ratings yet
DL Lab 5
3 pages
Simple Linear Regression in Machine Learning
No ratings yet
Simple Linear Regression in Machine Learning
7 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
43 pages
Ritesh Mangla ML PracticalFile
No ratings yet
Ritesh Mangla ML PracticalFile
55 pages
ML File
No ratings yet
ML File
10 pages
Data analytics
No ratings yet
Data analytics
10 pages
Wa0004.
No ratings yet
Wa0004.
9 pages
Data analytics assignment solutions
No ratings yet
Data analytics assignment solutions
20 pages
lab mannual of ML
No ratings yet
lab mannual of ML
43 pages
Regression Dataset Example
No ratings yet
Regression Dataset Example
14 pages
4. Logistic Regression
No ratings yet
4. Logistic Regression
21 pages
Home Ai Machine Learning Dbms Java Blockchain Control System Selenium HTML Css Javascript Ds
No ratings yet
Home Ai Machine Learning Dbms Java Blockchain Control System Selenium HTML Css Javascript Ds
11 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
ML LN 3
No ratings yet
ML LN 3
44 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
ML manoj
No ratings yet
ML manoj
51 pages
Write a lab report on Linear Regression and Logistic Regression. Include the cost function differentiation and the code in the report.
No ratings yet
Write a lab report on Linear Regression and Logistic Regression. Include the cost function differentiation and the code in the report.
7 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
Experiment1 Explanation
No ratings yet
Experiment1 Explanation
6 pages
Lab Exam ... Roll No 24cs4103
No ratings yet
Lab Exam ... Roll No 24cs4103
4 pages
SHASHANK ML.docx
No ratings yet
SHASHANK ML.docx
23 pages
ML EXTERNAL XEROX
No ratings yet
ML EXTERNAL XEROX
1 page
Practical # 10
No ratings yet
Practical # 10
5 pages
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
No ratings yet
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
14 pages
Ml Lab Manual
No ratings yet
Ml Lab Manual
36 pages
ml_6_7_8 (1)
No ratings yet
ml_6_7_8 (1)
10 pages
ML Practical File
No ratings yet
ML Practical File
30 pages
Logistic Regression
100% (1)
Logistic Regression
10 pages
Broadly, There Are 3 Types of Machine Learning Algorithms.
No ratings yet
Broadly, There Are 3 Types of Machine Learning Algorithms.
33 pages
ML LAB FILE (2)
No ratings yet
ML LAB FILE (2)
48 pages
Exp 1
No ratings yet
Exp 1
6 pages
Nibedita Dehury, 123CE0079
No ratings yet
Nibedita Dehury, 123CE0079
13 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
Praveen Ai
No ratings yet
Praveen Ai
6 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
Ml Record
No ratings yet
Ml Record
23 pages
Btech1007022_lab5.1
No ratings yet
Btech1007022_lab5.1
9 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Analysis of Trade Before and After The WTO: A Case Study of India
No ratings yet
Analysis of Trade Before and After The WTO: A Case Study of India
8 pages
xtdcce2 help
No ratings yet
xtdcce2 help
25 pages
Luperi 2014 PDF
No ratings yet
Luperi 2014 PDF
20 pages
Econometrics moduleII
100% (2)
Econometrics moduleII
114 pages
Topics in Time Series Econometrics PDF
No ratings yet
Topics in Time Series Econometrics PDF
157 pages
08 212020082 Nalitalia Ramjani
No ratings yet
08 212020082 Nalitalia Ramjani
4 pages
Optimización Fmincon Tutorial 2011
No ratings yet
Optimización Fmincon Tutorial 2011
10 pages
Generalized AutoRegressive Conditional Heteroskedasticity
No ratings yet
Generalized AutoRegressive Conditional Heteroskedasticity
3 pages
Linear Regression Quiz
No ratings yet
Linear Regression Quiz
6 pages
Industrial Market Segmentation
100% (1)
Industrial Market Segmentation
23 pages
Methods For Estimating Regression Discontinuity Design With Multiple Assignment Variables A Comparative Study of Three Estimation Methods
No ratings yet
Methods For Estimating Regression Discontinuity Design With Multiple Assignment Variables A Comparative Study of Three Estimation Methods
73 pages
Week 2 Statprob Q4
No ratings yet
Week 2 Statprob Q4
15 pages
Practice Midterm Questions 1 and 2
No ratings yet
Practice Midterm Questions 1 and 2
4 pages
Dav Exp3 66
No ratings yet
Dav Exp3 66
4 pages
Linear Regression: Rustom D. Sutaria - Avia Intelligence 2016, Dubai
No ratings yet
Linear Regression: Rustom D. Sutaria - Avia Intelligence 2016, Dubai
3 pages
B Dar 2017 Factors Affecting The Growth of Micro - and Small Enterprises
No ratings yet
B Dar 2017 Factors Affecting The Growth of Micro - and Small Enterprises
90 pages
Lecture 5 6 Forecasting
100% (1)
Lecture 5 6 Forecasting
45 pages
Selecting Appropriate Forecast Method On The Basis of Forecast Accuracy
No ratings yet
Selecting Appropriate Forecast Method On The Basis of Forecast Accuracy
10 pages
SEM205 Econometrics Lecture 3
No ratings yet
SEM205 Econometrics Lecture 3
21 pages
The Growth of Firms by Alex Coad
No ratings yet
The Growth of Firms by Alex Coad
208 pages
SPSSTutorial Math Cracker
No ratings yet
SPSSTutorial Math Cracker
43 pages
Improved Comparison of Ili Data and Field Excavations
No ratings yet
Improved Comparison of Ili Data and Field Excavations
4 pages
Rules For Working On AMOS: Rule No.1:: Analysis of Moment Structure (Amos)
100% (1)
Rules For Working On AMOS: Rule No.1:: Analysis of Moment Structure (Amos)
18 pages
QTMS Final Assessment (Spring 2020) PDF
No ratings yet
QTMS Final Assessment (Spring 2020) PDF
6 pages
ECO311 Practice Questions 1
No ratings yet
ECO311 Practice Questions 1
5 pages
Polynomial Regression: Y X X X X XX
No ratings yet
Polynomial Regression: Y X X X X XX
15 pages
Applied Business Forecasting and Planning: Moving Averages and Exponential Smoothing
No ratings yet
Applied Business Forecasting and Planning: Moving Averages and Exponential Smoothing
48 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
Rossmann Sales Prediction Presentation
No ratings yet
Rossmann Sales Prediction Presentation
35 pages
Econometrics 1st Edition K. Nirmal Ravi Kumar instant download
100% (5)
Econometrics 1st Edition K. Nirmal Ravi Kumar instant download
65 pages

hemraj_python_ass1

Uploaded by

hemraj_python_ass1

Uploaded by

Assignment 1: Linear and Logistic Regression

# Step 1: Create the sales dataset

# Step 4: Print the split data

# Step 6: Make predictions

# Print the coefficients

# Step 7: Plot the results

Training set (X_train):

Testing set (X_test):

Coefficients: [0.05023864 0.09843639 0.02031991]

# Step 3: Split the dataset into training and testing sets

# Step 4: Print the split data

# Step 5: Build the linear regression model

# Step 6: Make predictions

# Print the coefficients

# Step 7: Plot the results

# Step 1: Create the User dataset

# Step 2: Encode categorical 'Gender' feature

# Step 4: Split the dataset into training and testing sets

# Step 5: Build the logistic regression model

# Step 6: Make predictions

# Step 7: Print accuracy

# Step 1: Load the fish dataset

# Step 4: Build the linear regression model

# Step 5: Make predictions

# Print the coefficients

# Step 1: Load the Iris dataset

# Step 2: Split the dataset into training and testing sets

# Step 3: Build the logistic regression model

# Step 4: Make predictions

# Step 5: Calculate accuracy

Accuracy of Logistic Regression Model for Iris Dataset: 0.9777777777777777

You might also like