0% found this document useful (0 votes)

32 views3 pages

Data Analysis with Pandas & Matplotlib

sdadg

Uploaded by

dharshinipugalenthi56

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views3 pages

Data Analysis with Pandas & Matplotlib

sdadg

Uploaded by

dharshinipugalenthi56

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

To load a dataset from a CSV file using Pandas, you'll need to ensure that the file exists

in the specified directory. Here's a complete example that demonstrates how to load the
dataset, perform some basic operations, and visualize the data using Matplotlib.

Let's assume that `[Link]` contains columns `YearsExperience` and `Salary`.

Step-by-Step Example

#### 1. Import Libraries

import numpy as np
import pandas as pd
import [Link] as plt

#### 2. Load the Dataset

Make sure `[Link]` is in the same directory as your script, or provide the full path to
the file.

# Load the dataset

dataset = pd.read_csv('[Link]')

# Display the first few rows of the dataset

print([Link]())
or
[Link]() ( also tail , info , shape , size , describe)

#### 3. Explore the Dataset

# Display basic information about the dataset

print([Link]())

# Display summary statistics

print([Link]())

#### 4. Visualize the Data

Create a scatter plot to visualize the relationship between `YearsExperience` and

`Salary`.

# Scatter plot of YearsExperience vs Salary

[Link](dataset['YearsExperience'], dataset['Salary'], color='blue')

# Adding title and labels
[Link]('Years of Experience vs Salary')
[Link]('Years of Experience')
[Link]('Salary')

# Display the plot

[Link]()

#### 5. Perform Regression Analysis

Let's perform a simple linear regression to predict Salary based on Years of Experience.

from sklearn.model_selection import train_test_split

// sklearn.model_selection is used to split your dataset into training and testing sets//

from sklearn.linear_model import LinearRegression

// LinearRegression to perform a linear regression analysis on a dataset, split the data into
training and testing sets, train the model, make predictions, and evaluate the model.
from [Link] import mean_squared_error, r2_score

//The mean_squared_error and r2_score functions from [Link] are used to

evaluate the performance of a regression model.

 Mean Squared Error (MSE): Measures the average squared difference between
the actual and predicted values. Lower values are better.
 R-squared (R²) score: Represents the proportion of variance in the dependent
variable that is predictable from the independent variable(s). Higher values
(closer to 1) are better.

# Define the features (X) and target (y)

X = dataset[['YearsExperience']]
y = dataset['Salary']

# Split the dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

//  X: The feature(s) of the dataset. In this case, it is YearsExperience.

 y: The target variable. In this case, it is Salary.
 test_size=0.2: 20% of the data will be used as the test set.
 random_state=42: Ensures reproducibility of the split. Using the same random state
will always produce the same split.
# Create a Linear Regression model to Train a Linear Regression Mode
model = LinearRegression()

# Train the model

[Link](X_train, y_train)

# Make predictions on the test set

y_pred = [Link](X_test)

# Evaluate the print('Mean Squared Error:', mse)

print('R-squared:', r2)
model
mse = mean_squared_error(y_test, y_pred)
r2 = r2_score(y_test, y_pred)

# Plot the regression line

[Link](X, y, color='blue')
[Link](X, [Link](X), color='red', linewidth=2)

# Adding title and labels

[Link]('Years of Experience vs Salary (with Regression Line)')
[Link]('Years of Experience')
[Link]('Salary')

# Display the plot

[Link]()
```

Python Simple Linear Regression Guide
No ratings yet
Python Simple Linear Regression Guide
14 pages
Experiment No.8
No ratings yet
Experiment No.8
5 pages
ML Updated File
No ratings yet
ML Updated File
36 pages
Salary Prediction with Linear Regression
No ratings yet
Salary Prediction with Linear Regression
5 pages
2.1 ML (Implementation of Simple Linear Regression in Python)
No ratings yet
2.1 ML (Implementation of Simple Linear Regression in Python)
8 pages
MLP Regressor with Sklearn on Wine Data
No ratings yet
MLP Regressor with Sklearn on Wine Data
10 pages
Salary Prediction with Linear Regression
No ratings yet
Salary Prediction with Linear Regression
7 pages
Python Linear Regression Lab Guide
No ratings yet
Python Linear Regression Lab Guide
5 pages
Simple Linear Regression with Python
No ratings yet
Simple Linear Regression with Python
7 pages
Predicting Salary with Experience
100% (1)
Predicting Salary with Experience
7 pages
Simple Linear Regression in Python
No ratings yet
Simple Linear Regression in Python
7 pages
Simple Linear Regression in Python
No ratings yet
Simple Linear Regression in Python
45 pages
Salary Prediction Using Regression
No ratings yet
Salary Prediction Using Regression
8 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
9 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
ML 1-11
No ratings yet
ML 1-11
27 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Linear - Regression - Ipynb - Colaboratory
No ratings yet
Linear - Regression - Ipynb - Colaboratory
4 pages
Python 1
No ratings yet
Python 1
3 pages
Linear Regression Salary Prediction
No ratings yet
Linear Regression Salary Prediction
2 pages
Machine Learning Hands-On
100% (1)
Machine Learning Hands-On
18 pages
9.2. Data Science - Machine Learning - Simple Linear Regression - Example
No ratings yet
9.2. Data Science - Machine Learning - Simple Linear Regression - Example
10 pages
Simple Linear Regression with Python
No ratings yet
Simple Linear Regression with Python
30 pages
Salary Prediction with Linear Regression
No ratings yet
Salary Prediction with Linear Regression
4 pages
Salary Prediction Using Experience Data
No ratings yet
Salary Prediction Using Experience Data
4 pages
Ai 28-01-25
No ratings yet
Ai 28-01-25
18 pages
Handle Missing Data in Real-Time
No ratings yet
Handle Missing Data in Real-Time
5 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
20 pages
Data Science for Beginners
No ratings yet
Data Science for Beginners
98 pages
Linear and Logistic Regression Guide
No ratings yet
Linear and Logistic Regression Guide
10 pages
Machine Learning Model Evaluations
No ratings yet
Machine Learning Model Evaluations
11 pages
Exp 2 ML
No ratings yet
Exp 2 ML
4 pages
Machine Learning Lab: Python Libraries
No ratings yet
Machine Learning Lab: Python Libraries
12 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
16 pages
Da Rec
No ratings yet
Da Rec
29 pages
Linear Regression Salary Prediction Guide
No ratings yet
Linear Regression Salary Prediction Guide
1 page
Salaries For San Francisco Employee
No ratings yet
Salaries For San Francisco Employee
30 pages
Linear Regression Model in Python
No ratings yet
Linear Regression Model in Python
4 pages
Lab Experiment 4 - AI
No ratings yet
Lab Experiment 4 - AI
7 pages
Regression
No ratings yet
Regression
16 pages
21BEI052 2EI503 ML SpecialAssignmentReport
No ratings yet
21BEI052 2EI503 ML SpecialAssignmentReport
12 pages
Generative AI For Models Development
No ratings yet
Generative AI For Models Development
8 pages
Simple - Linear - Regression - Ipynb - Colaboratory
No ratings yet
Simple - Linear - Regression - Ipynb - Colaboratory
2 pages
Data Exploration and Preprocessing in Python
No ratings yet
Data Exploration and Preprocessing in Python
20 pages
SF Employee Salary Analysis Project
No ratings yet
SF Employee Salary Analysis Project
33 pages
Python Machine Learning Techniques Guide
No ratings yet
Python Machine Learning Techniques Guide
13 pages
LDA and Linear Regression Implementation
No ratings yet
LDA and Linear Regression Implementation
21 pages
Linear Regression 2
No ratings yet
Linear Regression 2
3 pages
BCA Machine Learning Practical Guide
No ratings yet
BCA Machine Learning Practical Guide
59 pages
Da Lab Mannual
No ratings yet
Da Lab Mannual
25 pages
Data Importing and Analysis Cheat Sheet
No ratings yet
Data Importing and Analysis Cheat Sheet
4 pages
NumPy Statistical Functions in Python
No ratings yet
NumPy Statistical Functions in Python
7 pages
ML Prac 1
No ratings yet
ML Prac 1
4 pages
Employee Retention Analysis with Python
No ratings yet
Employee Retention Analysis with Python
9 pages
Salary Prediction Analysis with Python
No ratings yet
Salary Prediction Analysis with Python
2 pages
Data Preprocesing JavaPoint
No ratings yet
Data Preprocesing JavaPoint
19 pages
Etl and Stats Code
No ratings yet
Etl and Stats Code
2 pages
SML Practicals
No ratings yet
SML Practicals
4 pages
Machine Learning Laboratory (BTCS619-18) B.Tech Cse 6Th 2024 EVEN
No ratings yet
Machine Learning Laboratory (BTCS619-18) B.Tech Cse 6Th 2024 EVEN
29 pages
Probability Analysis of FIFA 2010 Goals
No ratings yet
Probability Analysis of FIFA 2010 Goals
10 pages
2.1972 Generalized Linear Models Nelder Wedderburn
No ratings yet
2.1972 Generalized Linear Models Nelder Wedderburn
16 pages
Linear Programming Basics and Examples
No ratings yet
Linear Programming Basics and Examples
17 pages
Nash Equilibrium in Horse Racing
No ratings yet
Nash Equilibrium in Horse Racing
2 pages
Experemental Research 2
No ratings yet
Experemental Research 2
14 pages
Monte Carlo Simulation Explained
No ratings yet
Monte Carlo Simulation Explained
3 pages
MPhil Statistics Course Outline 2018-19
No ratings yet
MPhil Statistics Course Outline 2018-19
4 pages
ch08 Portfolio Selection
No ratings yet
ch08 Portfolio Selection
24 pages
Exercise CH 14 - Adella Rosita P - 2106631223
No ratings yet
Exercise CH 14 - Adella Rosita P - 2106631223
4 pages
Empirical Methods for Finance Students
No ratings yet
Empirical Methods for Finance Students
80 pages
Comparing Exponential Smoothing Methods
No ratings yet
Comparing Exponential Smoothing Methods
6 pages
Econometrics MOOC: Logit Model Analysis
0% (1)
Econometrics MOOC: Logit Model Analysis
3 pages
US25 Stringham
No ratings yet
US25 Stringham
40 pages
Linear vs. Nonlinear Objective Functions
No ratings yet
Linear vs. Nonlinear Objective Functions
6 pages
Quantitative Techniques Assignment-2: One Sample T-Test
No ratings yet
Quantitative Techniques Assignment-2: One Sample T-Test
8 pages
Chi-Square and ANOVA Statistical Tests
No ratings yet
Chi-Square and ANOVA Statistical Tests
2 pages
Understanding ANOVA for Population Means
No ratings yet
Understanding ANOVA for Population Means
8 pages
Pengaruh Modal Dan Tenaga Kerja Terhadap Tenun Ikatkl
No ratings yet
Pengaruh Modal Dan Tenaga Kerja Terhadap Tenun Ikatkl
98 pages
Econometrics Assignment Overview
100% (3)
Econometrics Assignment Overview
8 pages
Black-Scholes Made Easy
No ratings yet
Black-Scholes Made Easy
96 pages
CH06 - Wooldridge - 7e PPT - 2pp
100% (1)
CH06 - Wooldridge - 7e PPT - 2pp
17 pages
Demand Forecasting Method
No ratings yet
Demand Forecasting Method
19 pages
Approaches To Probability
No ratings yet
Approaches To Probability
4 pages
Module 4 Excel Utility II: Hypothesis Tests For Two Populations
No ratings yet
Module 4 Excel Utility II: Hypothesis Tests For Two Populations
5 pages
Greeks and Volatility Smile
100% (1)
Greeks and Volatility Smile
51 pages
Subgame Analysis and Cournot Duopoly
No ratings yet
Subgame Analysis and Cournot Duopoly
2 pages
Week4 - 1
No ratings yet
Week4 - 1
18 pages
Decision-Making Strategies for Engineers
No ratings yet
Decision-Making Strategies for Engineers
25 pages
R.C. Coleman Project Scheduling Analysis
100% (1)
R.C. Coleman Project Scheduling Analysis
5 pages
Data Analysis and Processing Overview
100% (1)
Data Analysis and Processing Overview
30 pages