0% found this document useful (0 votes)

50 views10 pages

Unit2 - Pandas - Jupyter Notebook

Hjivckjfgghkjvhjhggihxjjvh

Uploaded by

neerajboggavarapu098

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views10 pages

Unit2 - Pandas - Jupyter Notebook

Hjivckjfgghkjvhjhggihxjjvh

Uploaded by

neerajboggavarapu098

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

04/05/2023, 10:44 pandas - Jupyter Notebook

pandas
pandas stands for panel data and is the core library for data manipulation,data analysis.

it consists of single and multidimentional ds for data manipulation.

pandas is a python library used for working with data sets.

high performence data analysis tool

working with large data set

represents in tabular way

working on missing data

three ds in pands

1. series- one dimensional

2. dataframe-two dimentional
3. panel- multidimentional (data,major axis,minor axis)

create Pandas Series

In [1]:

import pandas as pd
import numpy as np

In [2]:

arr = np.array([1,2,3,4])
print(arr)

[1 2 3 4]

In [3]:

s = pd.Series(arr)
print(s)
print(type(s))

0 1
1 2
2 3
3 4
dtype: int64
<class 'pandas.core.series.Series'>

localhost:8888/notebooks/anaconda3/Python/pandas.ipynb 1/10
04/05/2023, 10:44 pandas - Jupyter Notebook

In [4]:

print(s[0:5])

0 1
1 2
2 3
3 4
dtype: int64

In [5]:

a[2]

--------------------------------------------------------------------
-------
NameError Traceback (most recent cal
l last)
/tmp/ipykernel_8943/4164697690.py in <module>
----> 1 a[2]

NameError: name 'a' is not defined

In [ ]:

a = pd.Series(['a','b','c'])

In [ ]:

a = pd.date_range(start = '2023-03-01', end = '2023-03-28')

In [ ]:

type(a)

Pandas dataframe
In [ ]:

arr = np.array([[1,2,3],[4,5,6]])
print(arr)

localhost:8888/notebooks/anaconda3/Python/pandas.ipynb 2/10
04/05/2023, 10:44 pandas - Jupyter Notebook

In [ ]:

df = pd.DataFrame(arr)
print(df)

In [ ]:

temp = np.random.randint(low = 20, high =100, size = [20,])

name = np.random.choice(['Abhay','Teclov','Geekshub','Ankit'],20)
random = np.random.choice([10,11,13,12,14],20)

In [ ]:

df = pd.DataFrame({"Temp":temp,"Name":name,"Random":random})
df

In [ ]:

a = list(zip(temp, name, random))

print(a)

In [ ]:

df = pd.DataFrame(data = a, columns=['Temp','Name','Random'])

In [ ]:

type(df)

In [ ]:

temp = np.random.randint(low = 20, high =100, size = [20,])

name = np.random.choice(['Abhay','Teclov','Geekshub','Ankit'],20)
random = np.random.choice([10,11,13,12,14],20)

In [ ]:

df = pd.DataFrame({'temp':temp, 'name':name, 'random':random})

In [ ]:

type(df)

In [ ]:

df.head()

In [ ]:

df.tail()

localhost:8888/notebooks/anaconda3/Python/pandas.ipynb 3/10
04/05/2023, 10:44 pandas - Jupyter Notebook

In [ ]:

df.shape

In [ ]:

df.columns

In [ ]:

df.name

In [ ]:

df['name']

In [ ]:

df['temp'].describe()

In [ ]:

df.info()

In [ ]:

df.values

In [ ]:

df.set_index('temp', inplace = True)

In [ ]:

df.sort_index(axis =0, ascending=False)

In [ ]:

df.sort_values(by ='random', ascending = False)

In [ ]:

df.drop(['random'], axis =1)

localhost:8888/notebooks/anaconda3/Python/pandas.ipynb 4/10
04/05/2023, 10:44 pandas - Jupyter Notebook

In [ ]:

df.head()

In [ ]:

df.iloc[[0,1]]

In [ ]:

df.iloc[1:3,1]

In [ ]:

df.iloc[[True,True,False]]

In [ ]:

df.head()

In [ ]:

df.loc[:,:]

In [ ]:

df.loc[[39,84,34]]

In [ ]:

df.loc[[39,84],'name':'random']

In [ ]:

df.loc[[True, True, False, True]]

In [ ]:

df.loc[df.random > 13]

In [ ]:

df.loc[(df.random > 13) | (df.random == 10),:]

In [ ]:

# Merging & concat

d1 = pd.DataFrame([['a', 1], ['b', 2]],columns=['col1', 'number'])
d2 = pd.DataFrame([['c', 3, 'lion'], ['d', 4, 'tiger']],columns=['letter', 'numbe

In [ ]:

localhost:8888/notebooks/anaconda3/Python/pandas.ipynb 5/10
04/05/2023, 10:44 pandas - Jupyter Notebook

In [ ]:

pd.concat([d1,d2],axis =0)

In [ ]:

pd.concat([d1,d2], axis =0, ignore_index=True)

In [ ]:

pd.concat([d1,d2], axis = 1)

In [ ]:

d1 = pd.DataFrame({
"city" : ["lucknow","kanpur","agra","delhi"],
"temperature" : [32,45,30,40]
})

In [ ]:

d2 = pd.DataFrame({
"city" : ["delhi","lucknow","kanpur"],
"humidity" : [68,65,75]
})

In [ ]:

df = pd.merge(d1,d2, on='city')

In [ ]:

pd.merge(d1,d2, on=['city'], how ='outer')

In [ ]:

pd.merge(d1, d2, on =['city'], how='left')

localhost:8888/notebooks/anaconda3/Python/pandas.ipynb 6/10
04/05/2023, 10:44 pandas - Jupyter Notebook

In [ ]:

# dataset from https://github.com/codebasics/py/blob/master/pandas/6_handling_miss

In [ ]:

df1 = pd.read_csv("weather_data.csv")

In [ ]:

df1

In [ ]:

# pip3 install openpyxl

df1.to_excel('df_xl.xlsx', sheet_name = 'weather_data')

In [ ]:

# pip3 install xlrd

df2 = pd.read_excel('df_xl.xlsx')

In [ ]:

df2

In [ ]:

df2.to_csv('file.csv')

In [ ]:

df2.to_csv('file_noindex.csv', index = False)

In [ ]:

df_group = df2.groupby("event")

In [ ]:

df_group

In [ ]:

for temperature in df_group:

print(temperature)

In [ ]:

df_group.get_group('Rain')

In [ ]:

df_group.describe()

localhost:8888/notebooks/anaconda3/Python/pandas.ipynb 7/10
04/05/2023, 10:44 pandas - Jupyter Notebook

In [ ]:

def hot_temp(x):
return x > 30

In [ ]:

df2['hot_temp'] = df2['temperature'].apply(hot_temp)

In [ ]:

df2

In [ ]:

df2['hot_temp'] = df2['temperature'].apply(lambda x: x > 30)

In [ ]:

df2

In [ ]:

#pivot table

In [ ]:

df2.pivot_table(values = 'temperature', index = 'event', aggfunc = 'mean')

In [ ]:

df2.pivot_table(columns = 'temperature')

In [ ]:

help(pd.DataFrame.pivot_table)

In [ ]:

df3.to_csv("/home/apiiit-rkv/Desktop/dsp unit-3")

In [ ]:

import pandas as pd

In [ ]:

d=pd.read_excel("//home//apiiit-rkv//Desktop//marks.xlsx")
df=pd.DataFrame(d)
df

localhost:8888/notebooks/anaconda3/Python/pandas.ipynb 8/10
04/05/2023, 10:44 pandas - Jupyter Notebook

In [ ]:

#correlation
Correlation coefficients quantify the association between variables or features o
These statistics are of high importance for science and technology, and Python ha
tools that you can use to calculate them. SciPy, NumPy, and pandas correlation met
fast, comprehensive, and well-documented.

What Pearson, Spearman, and Kendall correlation coefficients are

How to use SciPy, NumPy, and pandas correlation functions
How to visualize data, regression lines, and correlation matrices with Matplot

1. Negative correlation (red dots): In the plot on the left, the y values tend
as the x values increase. This shows strong negative correlation, which o
large values of one feature correspond to small values of the other, and v

2.Weak or no correlation (green dots): The plot in the middle shows no obv
trend. This is a form of weak correlation, which occurs when an assoc
between two features is not obvious or is hardly observable.

Positive correlation (blue dots): In the plot on the right, the y val
to increase as the x values increase. This illustrates strong pos
correlation, which occurs when large values of one feature corres
large values of the other, and vice versa.

In [ ]:

import pandas as pd
x = pd.Series(range(10, 20))
x

In [ ]:

y = pd.Series([2, 1, 4, 5, 8, 12, 18, 25, 96, 48])

In [ ]:

x.corr(y) # Pearson's r

In [ ]:

y.corr(x)

In [ ]:

x.corr(y, method='spearman') # Spearman's rh

In [ ]:

x.corr(y, method='kendall')

In [ ]:

localhost:8888/notebooks/anaconda3/Python/pandas.ipynb 9/10
04/05/2023, 10:44 pandas - Jupyter Notebook

localhost:8888/notebooks/anaconda3/Python/pandas.ipynb 10/10

Unit 2
No ratings yet
Unit 2
81 pages
DSL Pandas
No ratings yet
DSL Pandas
87 pages
ICSE Robotics AI
0% (1)
ICSE Robotics AI
11 pages
Unit IV
No ratings yet
Unit IV
49 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Python Programming For Data Science
No ratings yet
Python Programming For Data Science
36 pages
Pandas: Import
100% (1)
Pandas: Import
13 pages
Python 2.1.2
No ratings yet
Python 2.1.2
7 pages
Chapter 2 Python Pandas - II
No ratings yet
Chapter 2 Python Pandas - II
19 pages
Pandas
No ratings yet
Pandas
21 pages
B.tech (Cse) Final Syllabus.
No ratings yet
B.tech (Cse) Final Syllabus.
28 pages
Pandas Tutorial
No ratings yet
Pandas Tutorial
9 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
14 pages
The Pandas Series Object-Print
No ratings yet
The Pandas Series Object-Print
16 pages
Python Pandas Demo PDF
100% (2)
Python Pandas Demo PDF
23 pages
Ip Study
No ratings yet
Ip Study
18 pages
Loading Pandas
No ratings yet
Loading Pandas
23 pages
ML Unit-2 Notes
No ratings yet
ML Unit-2 Notes
17 pages
Pandas
No ratings yet
Pandas
4 pages
Pandas
No ratings yet
Pandas
26 pages
Pandas - Ipynb - Colab
No ratings yet
Pandas - Ipynb - Colab
8 pages
Introduction To Pandas in Data Analytics
No ratings yet
Introduction To Pandas in Data Analytics
12 pages
Unit III - Pandas - Data Manipulation Using Python
No ratings yet
Unit III - Pandas - Data Manipulation Using Python
15 pages
Pandas 1705297450
No ratings yet
Pandas 1705297450
21 pages
Pandas
No ratings yet
Pandas
63 pages
Pandas
No ratings yet
Pandas
7 pages
Pandas
No ratings yet
Pandas
9 pages
Python Data Frame New
No ratings yet
Python Data Frame New
32 pages
Exp3 Python
No ratings yet
Exp3 Python
15 pages
All Document Reader 1715619870900
No ratings yet
All Document Reader 1715619870900
6 pages
Pandas Tutorial
No ratings yet
Pandas Tutorial
7 pages
Unit3 - 3) Pandas - Ipynb - Colab
No ratings yet
Unit3 - 3) Pandas - Ipynb - Colab
11 pages
Pandas
No ratings yet
Pandas
25 pages
10 Minutes To Pandas - Pandas 1.2.4 Documentation
No ratings yet
10 Minutes To Pandas - Pandas 1.2.4 Documentation
18 pages
Pandas
No ratings yet
Pandas
13 pages
Unit 4 DSE
No ratings yet
Unit 4 DSE
9 pages
Content Pandas Cheat Sheet
No ratings yet
Content Pandas Cheat Sheet
9 pages
Loki Temp PPT Pandas 2
No ratings yet
Loki Temp PPT Pandas 2
31 pages
Pandas For Python Pro Level Cheat Sheet
No ratings yet
Pandas For Python Pro Level Cheat Sheet
14 pages
Pandas
No ratings yet
Pandas
25 pages
Pandas Dataframe Export The CSV File
No ratings yet
Pandas Dataframe Export The CSV File
9 pages
Pandas
No ratings yet
Pandas
13 pages
10 Minutes To Pandas - Pandas 2.1.1 Documentation
No ratings yet
10 Minutes To Pandas - Pandas 2.1.1 Documentation
24 pages
Pandas
No ratings yet
Pandas
44 pages
Unit 4
No ratings yet
Unit 4
36 pages
Fundamental - Python
No ratings yet
Fundamental - Python
3 pages
Acknowledgement
No ratings yet
Acknowledgement
25 pages
CSL 410 L13
No ratings yet
CSL 410 L13
16 pages
Ii Unit Pandas
No ratings yet
Ii Unit Pandas
30 pages
Pandas+With+Python+ +DATAhill+Solutions
No ratings yet
Pandas+With+Python+ +DATAhill+Solutions
24 pages
Pandas Notes
No ratings yet
Pandas Notes
54 pages
Pandas Cheat Sheet
100% (2)
Pandas Cheat Sheet
6 pages
More Practice Questions For DataFrame
No ratings yet
More Practice Questions For DataFrame
9 pages
Python Pandas
No ratings yet
Python Pandas
2 pages
90-S Information Practices
No ratings yet
90-S Information Practices
15 pages
Introduction To Pandas
No ratings yet
Introduction To Pandas
2 pages
Pandas
No ratings yet
Pandas
11 pages
Lecture 5
No ratings yet
Lecture 5
36 pages
Pandas
No ratings yet
Pandas
29 pages
Isha Bhama Project Report Final
No ratings yet
Isha Bhama Project Report Final
46 pages
MSF568 Lec06Slides PDF
No ratings yet
MSF568 Lec06Slides PDF
100 pages
Pandas programs
No ratings yet
Pandas programs
2 pages
FALLSEMFY2023-24 BCSE101E ELA CH2023241700215 Reference Material II 24-11-2023 Introduction To Pandas
No ratings yet
FALLSEMFY2023-24 BCSE101E ELA CH2023241700215 Reference Material II 24-11-2023 Introduction To Pandas
15 pages
Foundation of Data Science - CS3352 - Hand Written Notes - Unit 1 - Introduction
No ratings yet
Foundation of Data Science - CS3352 - Hand Written Notes - Unit 1 - Introduction
39 pages
10 Minutes To Pandas
No ratings yet
10 Minutes To Pandas
26 pages
Learn Pandas Step by Step
No ratings yet
Learn Pandas Step by Step
3 pages
Internship Report 3
No ratings yet
Internship Report 3
34 pages
Unit-VI-Introduction-to-Libraries - And-Modules (NEP)
No ratings yet
Unit-VI-Introduction-to-Libraries - And-Modules (NEP)
25 pages
Important Question of Python
No ratings yet
Important Question of Python
26 pages
NOTES OF Python Ok
No ratings yet
NOTES OF Python Ok
73 pages
DSBDA Mini Project
No ratings yet
DSBDA Mini Project
19 pages
Format of Mini - Project Report
No ratings yet
Format of Mini - Project Report
32 pages
Exercise 3
No ratings yet
Exercise 3
25 pages
TSAnalyzer User Manual
No ratings yet
TSAnalyzer User Manual
16 pages
ML-Lab Manual - NEP - DSS
No ratings yet
ML-Lab Manual - NEP - DSS
23 pages
Python 3 Labo
No ratings yet
Python 3 Labo
30 pages
Labrecord
No ratings yet
Labrecord
39 pages
21CSC569J Fundamentals++of+Artificial+Intelligence
No ratings yet
21CSC569J Fundamentals++of+Artificial+Intelligence
3 pages
Mcqs
No ratings yet
Mcqs
30 pages
Mayank Pratap Puri - Resume
No ratings yet
Mayank Pratap Puri - Resume
2 pages
Ip SQP 002
No ratings yet
Ip SQP 002
8 pages
Data Analytics Roadmap
No ratings yet
Data Analytics Roadmap
8 pages
Saiyara Islam Resume
No ratings yet
Saiyara Islam Resume
1 page
Ishaa - Choudhary - CV .
No ratings yet
Ishaa - Choudhary - CV .
2 pages
Python Practical List 24
No ratings yet
Python Practical List 24
6 pages
Group A Assignment No2 Writeup
No ratings yet
Group A Assignment No2 Writeup
9 pages
Rugved Patil - Associate Data Analyst - 20240702
No ratings yet
Rugved Patil - Associate Data Analyst - 20240702
2 pages
Code2pdf 66714d844f78a
No ratings yet
Code2pdf 66714d844f78a
2 pages