Uber Data Analysis
Uber Data Analysis
Analysis
PRESENTED BY :
UNNATI GOYAL
(181500768)
SAUMYA GUPTA
(181500632)
NANDINEE GUPTA
(181500414)
ROSHNI RAWAT
(181500594)
Aim
Complete Data Analysis and
Exploration of Uber Dataset
4
Data Set:
Kaggle
CSV Format
Shape:
322844,56
5
Libraries
6
Exploratory Data Analysis
Exploratory Data Analysis refers to the critical
process of performing initial investigations on data so
as to discover patterns to spot anomalies to test hypothesis
and to check assumptions with the help of summary statistics
and graphical representations.
8
Strip-plot between Name and Price
9
Label Encoding
Label Encoding refers to converting the labels into
numeric form so as to convert it into the machine-
readable form. Machine learning algorithms can then
decide in a better way on how those labels must be
operated. It is an important pre-processing step for the
structured dataset in supervised learning.
NANs(missing values)
11
Feature Selection
Feature Selection is the process of selecting a subset of
relevant feature (variables, predictors) for use in model
construction.
2 40 0.8050662132
3 25 0.80553551515
4 15 0.8050457819
Final Dataset
14
Modeling
15
Linear Regression: Linear Regression is a supervised machine
learning algorithm where the predicted output is continuous and has a
constant slope. It’s used to predict values within a continuous range.
17
After applying different models on final dataset, we found
different accuracy as given below :-
18
Testing
19
With the help of linear regression and random
forest models, we predict the price, plot a graph
between actual and predicted values and find the
following errors:-
21
Free templates for all your presentation needs
For PowerPoint and 100% free for personal or Ready to use, professional Blow your audience away
Google Slides commercial use and customizable with attractive visuals