0% found this document useful (0 votes)

2 views

ML Regression

The document outlines a series of tasks involving linear and multiple regression using Python's pandas and sklearn libraries. It demonstrates how to read datasets, preprocess data, fit regression models, make predictions, and evaluate model performance. Key examples include predicting home prices based on area and other features, as well as predicting salaries based on interview scores and experience.

Uploaded by

imbilalbaig

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

ML Regression

Uploaded by

imbilalbaig

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Linear Regression

Task 1
In [1]:
import pandas as pd
import numpy as np
from matplotlib import pyplot as plt

In [2]:

df = pd.read_csv("homeprices.csv")
df.head()
Out[2]:

area price

0 2600 550000

1 3000 565000

2 3200 610000

3 3600 680000

4 4000 725000

In [3]:
inputs = df.drop("price",axis='columns')
inputs
Out[3]:

area

0 2600

1 3000

2 3200

3 3600

4 4000

In [4]:

target = df['price']

In [5]:
from sklearn.linear_model import LinearRegression
model = LinearRegression()

In [6]:
model.fit(inputs.values, target.values)

Out[6]:

▾ LinearRegression
LinearRegression()
In [7]:
model.predict([[3300]])
Out[7]:
array([628715.75342466])

In [8]:
model.predict([[5000]])
Out[8]:
array([859554.79452055])

Task 2
In [9]:
df1 = pd.read_csv("areas.csv")
df1.head()
Out[9]:

area

0 1000

1 1500

2 2300

3 3540

4 4120

In [10]:
p = model.predict(df1.values)
p
Out[10]:
array([ 316404.10958904, 384297.94520548, 492928.08219178,
661304.79452055, 740061.64383562, 799808.21917808,
926090.75342466, 650441.78082192, 825607.87671233,
492928.08219178, 1402705.47945205, 1348390.4109589 ,
1144708.90410959])

In [11]:
df1['prices'] = p
df.head()
Out[11]:

area price

0 2600 550000

1 3000 565000

2 3200 610000

3 3600 680000

4 4000 725000

In [12]:
df1.to_csv("Prices.csv",index=False)
In [13]:
df2 = pd.read_csv("Prices.csv")
df2.head()

Out[13]:

area prices

0 1000 316404.109589

1 1500 384297.945205

2 2300 492928.082192

3 3540 661304.794521

4 4120 740061.643836

Task 3
In [14]:
df3 = pd.read_csv("canada_pci.csv")
df3.head()
Out[14]:

year per capita income (US$)

0 1970 3399.299037

1 1971 3768.297935

2 1972 4251.175484

3 1973 4804.463248

4 1974 5576.514583

In [15]:
inputs1 = df3.drop("per capita income (US$)", axis='columns')
inputs1.head()
Out[15]:

year

0 1970

1 1971

2 1972

3 1973

4 1974

In [16]:
target1 = df3['per capita income (US$)']
target1.head()
Out[16]:
0 3399.299037
1 3768.297935
2 4251.175484
3 4804.463248
4 5576.514583
Name: per capita income (US$), dtype: float64

In [17]:
In [17]:
from sklearn.linear_model import LinearRegression
model1 = LinearRegression()

In [18]:
model1.fit(inputs1.values,target1.values)
Out[18]:

▾ LinearRegression
LinearRegression()

In [19]:
model1.predict([[2020]])
Out[19]:
array([41288.69409442])

In [20]:
model1.score(inputs1.values, target1.values)
Out[20]:
0.890916917957032

Day 2 Multiple Regression

In [21]:
import pandas as pd
import numpy as np
import seaborn as sns

In [22]:
df = pd.read_csv("homerates.csv")
df.head()
Out[22]:

area bedrooms age price

0 2600 3.0 20 550000

1 3000 4.0 15 565000

2 3200 NaN 18 610000

3 3600 3.0 30 595000

4 4000 5.0 8 760000

In [23]:
mn = df.bedrooms.mean()
mn
Out[23]:
4.2

In [24]:
df.bedrooms = df.bedrooms.fillna(mn)
df.head()
Out[24]:
area bedrooms age price

0 2600 3.0 20 550000

1 3000 4.0 15 565000

2 3200 4.2 18 610000

3 3600 3.0 30 595000

4 4000 5.0 8 760000

In [25]:
inputs = df.drop('price',axis='columns')
inputs

Out[25]:

area bedrooms age

0 2600 3.0 20

1 3000 4.0 15

2 3200 4.2 18

3 3600 3.0 30

4 4000 5.0 8

5 4100 6.0 8

In [26]:
target = df['price']
target
Out[26]:
0 550000
1 565000
2 610000
3 595000
4 760000
5 810000
Name: price, dtype: int64

In [27]:
sns.pairplot(df)
Out[27]:
<seaborn.axisgrid.PairGrid at 0x22c1e3b25f0>
In [28]:
x = df.corr(numeric_only=True)
x
Out[28]:

area bedrooms age price

area 1.000000 0.740910 -0.445300 0.901476

bedrooms 0.740910 1.000000 -0.873162 0.910005

age -0.445300 -0.873162 1.000000 -0.734167

price 0.901476 0.910005 -0.734167 1.000000

In [29]:
sns.heatmap(x, annot=True, cmap='coolwarm')
Out[29]:
<Axes: >
In [30]:
from sklearn.linear_model import LinearRegression
model = LinearRegression()

In [31]:
model.fit(inputs.values,target.values)
Out[31]:

▾ LinearRegression
LinearRegression()

In [32]:
model.score(inputs.values, target.values)

Out[32]:
0.9540926625396438

In [33]:
model.predict([[300,3,40]])
Out[33]:
array([175825.67757211])

In [34]:
model.predict([[2500,4,5]])
Out[34]:
array([579906.16685223])

Task Multiple Regression

In [35]:
from word2number import w2n

In [36]:
df1 = pd.read_csv("hiring.csv")
df1.head()
Out[36]:

test_score(out of interview_score(out of
experience salary($)
10) 10)

0 NaN 8.0 9 50000

1 NaN 8.0 6 45000

2 five 6.0 7 60000

3 two 10.0 10 65000

4 seven 9.0 6 70000

In [37]:
df1.experience = df1.experience.fillna("zero")

In [38]:
df1['test_score(out of 10)'] = df1['test_score(out of 10)'].fillna(df1['test_score(out of
10)'].mean())

In [39]:
df1.head()
Out[39]:

test_score(out of interview_score(out of
experience salary($)
10) 10)

0 zero 8.0 9 50000

1 zero 8.0 6 45000

2 five 6.0 7 60000

3 two 10.0 10 65000

4 seven 9.0 6 70000

In [40]:
df1.experience = df1.experience.apply(w2n.word_to_num)

In [41]:
df1.head()
Out[41]:

test_score(out of interview_score(out of
experience salary($)
10) 10)

0 0 8.0 9 50000

1 0 8.0 6 45000

2 5 6.0 7 60000

3 2 10.0 10 65000

4 7 9.0 6 70000

In [42]:
inputs1 = df1.drop("salary($)", axis = 'columns')
inputs1
Out[42]:

test_score(out of interview_score(out of
experience
10) 10)

0 0 8.000000 9

1 0 8.000000 6

2 5 6.000000 7

3 2 10.000000 10

4 7 9.000000 6

5 3 7.000000 10

6 10 7.857143 7

7 11 7.000000 8
In [43]:
target1 = df1['salary($)']
target1
Out[43]:
0 50000
1 45000
2 60000
3 65000
4 70000
5 62000
6 72000
7 80000
Name: salary($), dtype: int64

In [44]:
from sklearn.linear_model import LinearRegression
model1 = LinearRegression()

In [45]:
model1.fit(inputs1.values, target1.values)
Out[45]:

▾ LinearRegression
LinearRegression()

In [46]:
model1.predict([[2,9,6]])
Out[46]:
array([53290.89255945])

In [47]:
model1.predict([[12,10,10]])
Out[47]:
array([92268.07227784])

In [ ]:

MUNAR - Linear Regression - Ipynb - Colaboratory
No ratings yet
MUNAR - Linear Regression - Ipynb - Colaboratory
30 pages
ABAP Test Cockpit - Overview
No ratings yet
ABAP Test Cockpit - Overview
18 pages
Expt 7
No ratings yet
Expt 7
3 pages
Regression Algorithm
No ratings yet
Regression Algorithm
9 pages
Exp4(Linear Regression)
No ratings yet
Exp4(Linear Regression)
2 pages
1 - Linear - Regression - Ipynb - Colaboratory
No ratings yet
1 - Linear - Regression - Ipynb - Colaboratory
7 pages
PythonFile[1]
No ratings yet
PythonFile[1]
5 pages
Exercise4 Solution
No ratings yet
Exercise4 Solution
20 pages
Ash Regression
No ratings yet
Ash Regression
11 pages
IoT Task4 21BEC0384
No ratings yet
IoT Task4 21BEC0384
9 pages
Ex No.: Date: Problem Statement
No ratings yet
Ex No.: Date: Problem Statement
3 pages
vertopal.com_2_linear_regression_multivariate
No ratings yet
vertopal.com_2_linear_regression_multivariate
2 pages
2 - Linear - Regression - Multivariate - Ipynb - Colaboratory
No ratings yet
2 - Linear - Regression - Multivariate - Ipynb - Colaboratory
4 pages
01.multiple Linear Regression - Ipynb - Colaboratory
No ratings yet
01.multiple Linear Regression - Ipynb - Colaboratory
10 pages
ML LinearRegression
No ratings yet
ML LinearRegression
10 pages
Prac - 8 (1) - Jupyter Notebook
No ratings yet
Prac - 8 (1) - Jupyter Notebook
6 pages
Copy of Project 4 _ House Price Prediction.ipynb - Colab
No ratings yet
Copy of Project 4 _ House Price Prediction.ipynb - Colab
5 pages
Data Science Record_05
No ratings yet
Data Science Record_05
20 pages
AIML
No ratings yet
AIML
5 pages
Project Linear Regression
No ratings yet
Project Linear Regression
7 pages
T2_summary_VHA
No ratings yet
T2_summary_VHA
14 pages
Mlext
No ratings yet
Mlext
1 page
One Hot Encoding
No ratings yet
One Hot Encoding
12 pages
Deepak Data Analysis 1
No ratings yet
Deepak Data Analysis 1
31 pages
Price Prediction
No ratings yet
Price Prediction
4 pages
Lab Sheet 1
No ratings yet
Lab Sheet 1
6 pages
New Opendocument Text
No ratings yet
New Opendocument Text
7 pages
a
No ratings yet
a
2 pages
f3683849-7ca6-4854-8f96-af11b6e837ec
No ratings yet
f3683849-7ca6-4854-8f96-af11b6e837ec
20 pages
Search Algorithms Python Implementation
No ratings yet
Search Algorithms Python Implementation
6 pages
Train
No ratings yet
Train
17 pages
USA Real Estate Price Prediction Using Decision Tree Regressor, and AdaBoost Regressor
No ratings yet
USA Real Estate Price Prediction Using Decision Tree Regressor, and AdaBoost Regressor
14 pages
ml2020 Pythonlab02
No ratings yet
ml2020 Pythonlab02
3 pages
Docu 4
No ratings yet
Docu 4
3 pages
1_Lab Manual (ML)
No ratings yet
1_Lab Manual (ML)
42 pages
Report
No ratings yet
Report
40 pages
unit 3 5
No ratings yet
unit 3 5
4 pages
Predicting House Prices
No ratings yet
Predicting House Prices
9 pages
houses prices prediction model
No ratings yet
houses prices prediction model
11 pages
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
No ratings yet
Experiment Number: 3: Aim:-Study of The Linear Regression in The Machine Learning Using The Boston Housing Dataset. 1)
14 pages
MachineLearning
No ratings yet
MachineLearning
10 pages
5 - One - Hot - Encoding - Ipynb - Colaboratory
No ratings yet
5 - One - Hot - Encoding - Ipynb - Colaboratory
8 pages
Week 6 LAB
No ratings yet
Week 6 LAB
13 pages
Code 1
No ratings yet
Code 1
3 pages
Boston Housing Kaggle Challenge With Linear Regression
No ratings yet
Boston Housing Kaggle Challenge With Linear Regression
3 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
4 pages
Document From Jahnavi
No ratings yet
Document From Jahnavi
20 pages
Lab 3 - Linear Regression
No ratings yet
Lab 3 - Linear Regression
15 pages
178 - Regulinear - Ipynb - Colab
No ratings yet
178 - Regulinear - Ipynb - Colab
3 pages
DSBDAL_Assignment no 4
No ratings yet
DSBDAL_Assignment no 4
15 pages
regression analysis on the Boston house price dataset for house price prediction
No ratings yet
regression analysis on the Boston house price dataset for house price prediction
2 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
Sesi 4-2B Linear Regression With Python - Jupyter Notebook
No ratings yet
Sesi 4-2B Linear Regression With Python - Jupyter Notebook
12 pages
Phase 5
No ratings yet
Phase 5
5 pages
Emllab
No ratings yet
Emllab
6 pages
SiddharthShah 1032221195 DivC 50 DL LabAssignment2
No ratings yet
SiddharthShah 1032221195 DivC 50 DL LabAssignment2
7 pages
ml exp-5,6 (1)[1] (1)
No ratings yet
ml exp-5,6 (1)[1] (1)
6 pages
Linear Regression Using Python
No ratings yet
Linear Regression Using Python
18 pages
5 Multiple Linear Regression
No ratings yet
5 Multiple Linear Regression
2 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
Develop Snakes & Ladders Game Complete Guide with Code & Design
From Everand
Develop Snakes & Ladders Game Complete Guide with Code & Design
Anurag Pandey
No ratings yet
CS210 Solutions For Quiz 1-3 Quiz 1: 1. Programming Model
No ratings yet
CS210 Solutions For Quiz 1-3 Quiz 1: 1. Programming Model
16 pages
ICT SA3 Practical
No ratings yet
ICT SA3 Practical
10 pages
North Eastern Mindanao State University: College of Information Technology Education Department of Computer Studies
No ratings yet
North Eastern Mindanao State University: College of Information Technology Education Department of Computer Studies
12 pages
Rogers-Automation Platforms: (PROJECT ID#30763) Knowledge Transfer Session#1
No ratings yet
Rogers-Automation Platforms: (PROJECT ID#30763) Knowledge Transfer Session#1
22 pages
CORBA: Overview: ICS 199 Michael Le
No ratings yet
CORBA: Overview: ICS 199 Michael Le
19 pages
Msi Assignment 2 (Fa22-Bee-014)
No ratings yet
Msi Assignment 2 (Fa22-Bee-014)
5 pages
Oraganize Chapter 1 To 2
No ratings yet
Oraganize Chapter 1 To 2
29 pages
Transferring Data With Variable Message Lengths Via The TCP Protocol
No ratings yet
Transferring Data With Variable Message Lengths Via The TCP Protocol
22 pages
Android Online Test 2
No ratings yet
Android Online Test 2
19 pages
C-Api in Python
No ratings yet
C-Api in Python
162 pages
Python Week 4 All GrPA's Solutions
100% (1)
Python Week 4 All GrPA's Solutions
8 pages
9 Javascript Exercises
No ratings yet
9 Javascript Exercises
4 pages
Closable Tab Control in WPF
No ratings yet
Closable Tab Control in WPF
9 pages
Chapter 2 - Introduction To Python
No ratings yet
Chapter 2 - Introduction To Python
44 pages
Minishell: As Beautiful As A Shell
0% (1)
Minishell: As Beautiful As A Shell
7 pages
C# Keywords and Definitions With Examples
50% (2)
C# Keywords and Definitions With Examples
130 pages
Chapter 1: An Introduction To Professionalism: Objectives
No ratings yet
Chapter 1: An Introduction To Professionalism: Objectives
12 pages
Fybsc-It Sem2 Oop Apr19
No ratings yet
Fybsc-It Sem2 Oop Apr19
2 pages
Syllabus
No ratings yet
Syllabus
83 pages
Report of Socket Programming Assignment
50% (2)
Report of Socket Programming Assignment
9 pages
Ganesh Khalkar
No ratings yet
Ganesh Khalkar
8 pages
Constraint Satisfaction Problems: Basic Algorithms
No ratings yet
Constraint Satisfaction Problems: Basic Algorithms
57 pages
SRS Part 1
No ratings yet
SRS Part 1
29 pages
Cse V Operating Systems Notes - Part2
No ratings yet
Cse V Operating Systems Notes - Part2
72 pages
OOP With Java
No ratings yet
OOP With Java
35 pages
Entity Relationship Diagram - New
No ratings yet
Entity Relationship Diagram - New
10 pages
Tkinter Menubutton
No ratings yet
Tkinter Menubutton
5 pages
Oracle SQL Notes
No ratings yet
Oracle SQL Notes
3 pages
Es Module 4
No ratings yet
Es Module 4
29 pages