0% found this document useful (0 votes)

56 views12 pages

Introduction to Pandas Library

Pandas is an open-source Python library designed for easy manipulation of relational or labeled data, built on top of NumPy. It provides data structures like Series and DataFrames for handling numerical data and time series, and includes functionalities for data creation, selection, and file I/O operations. Users can install Pandas via pip or conda and utilize its features for efficient data analysis.

Uploaded by

Dhrubajeet Gogoi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views12 pages

Introduction to Pandas Library

Uploaded by

Dhrubajeet Gogoi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Pandas in Python

Pandas is an open-source library that is made mainly for working with relational or labeled
data both easily and intuitively.

It provides various data structures and operations for manipulating numerical data and time
series. This library is built on the top of the NumPy library.

Pandas is fast and it has high-performance & productivity for users.

Install and import

Pandas is an easy package to install. Open up your terminal program (for Mac users) or
command line (for PC users) and install it using either of the following commands:

conda install pandas

pip install pandas

Alternatively, if you're currently viewing this article in a Jupyter notebook you can run this
cell:

!pip install pandas

The ! at the beginning runs cells as if they were in a terminal.

To import pandas we usually import it with a shorter name since it's used so much:

import pandas as pd

Core components of pandas: Series and DataFrames

The primary two components of pandas are the Series and DataFrame.

A Series is essentially a column, and a DataFrame is a multi-dimensional table made up of

a collection of Series.
Creating a Pandas Series
Pandas Series is a one-dimensional labeled array capable of holding data of any type
(integer, string, float, python objects, etc.). The axis labels are collectively called index.
Pandas Series is nothing but a column in an excel sheet.

Creating a series from array:

In order to create a series from array, we have to import a numpy module and have to use
array() function.

In [1]: # to use panda

import pandas as pd

# to use numpy
import numpy as np

# simple array
data = [Link](['n','i','e','l','i','t'])
ser = [Link](data)
print(ser)

0 n
1 i
2 e
3 l
4 i
5 t
dtype: object

Creating a series from Lists:

In order to create a series from list, we have to first create a list after that we can create a
series from list.

In [2]: import pandas as pd

# a simple list
list = ['n','i','e','l','i','t']

# create series form a list

ser = [Link](list)
print(ser)

0 n
1 i
2 e
3 l
4 i
5 t
dtype: object
Creating a series from Dictionary:
In order to create a series from dictionary, we have to first create a dictionary after that we
can make a series using dictionary. Dictionary key are used to construct a index.

In [3]: import pandas as pd

# a simple dictionary
dict = {"a)" : 'n', "b)" : 'i', "c)" : 'e', 'd)' : 'l', 'e)' : 'i', 'f)' :

# create series from dictionary

ser = [Link](dict)

print(ser)

a) n
b) i
c) e
d) l
e) i
f) t
dtype: object

Creating a series from array with index :

In order to create a series from array with index, we have to provide index with same
number of element as it is in array.

In [4]: import pandas as pd # import pandas as pd

import numpy as np # import numpy as np

data1 = [Link](['n', 'i', 'e', 'l', 'i', 't']) # simple array

# providing an index
ser = [Link](data1, index =[10, 11, 12, 13, 14, 15])
print(ser)

print("The data at 13th index is", ser[13]) # accessing a data by i

10 n
11 i
12 e
13 l
14 i
15 t
dtype: object
The data at 13th index is l
In [5]: #Combining two series
import pandas as pd # import pandas as pd
import numpy as np # import numpy as np

data1 = [Link](['n', 'i', 'e', 'l', 'i', 't']) # simple array
data2=["j", "b", "c"]
#data1= [Link](data1,['a', 'b'])
# providing an index
ser1 = [Link](data1, index =[10, 11, 12, 13, 14, 15])
ser2 = [Link](data2, index =[16,17,18])
ser=[Link](ser2)
print(ser)

10 n
11 i
12 e
13 l
14 i
15 t
16 j
17 b
18 c
dtype: object

C:\Users\acer\AppData\Local\Temp\ipykernel_7352\[Link]: FutureWa
rning: The [Link] method is deprecated and will be removed from pan
das in a future version. Use [Link] instead.
ser=[Link](ser2)

In [6]: #droping element from a series

import pandas as pd # import pandas as pd
import numpy as np # import numpy as np

data = [Link](['n', 'i', 'e', 'l', 'i', 't']) # simple array
ser=[Link](data)
print(ser)

ser1=[Link](index=[3,4])
print(ser1)

ser2=[Link](1,4) # truncate(before=1,after=4)
print(ser2)

0 n
1 i
2 e
3 l
4 i
5 t
dtype: object
0 n
1 i
2 e
5 t
dtype: object
1 i
2 e
3 l
4 i
dtype: object
DataFrame
A Data frame is a two-dimensional data structure, i.e., data is aligned in a tabular fashion in
rows and columns.

Pandas DataFrame consists of three principal components, the data, rows, and columns.

Create Pandas Dataframe

1. Creating DataFrame from dict of narray/lists
2. Creating Pandas DataFrame from lists of lists.
3. Creates a indexes DataFrame using arrays.
4. Creating Dataframe from list of dicts
5. Creating DataFrame using zip() function.
6. Creating DataFrame from Dicts of series.
In [7]: # DataFrame from dict narray / lists
# By default addresses.

import pandas as pd

# intialise data of lists.

data = {'Name':['Tom', 'nick', 'krish', 'jack'], 'Age':[20, 21, 19, 18]}

# Create DataFrame
df = [Link](data)

# Print the output.

(df)

Out[7]:
Name Age

0 Tom 20

1 nick 21

2 krish 19

3 jack 18

In [8]: # Import pandas library

import pandas as pd

# initialize list of lists

data = [['tom', 10], ['nick', 15], ['juli', 14]]

# Create the pandas DataFrame

df = [Link](data, columns = ['Name', 'Age'])

# print dataframe.
df

Out[8]:
Name Age

0 tom 10

1 nick 15

2 juli 14
In [9]: # pandas DataFrame with indexed by

# DataFrame using arrays.

import pandas as pd

# initialise data of lists.

data = {'Name':['Tom', 'Jack', 'nick', 'juli'], 'marks':[99, 98, 95, 90]}

# Creates pandas DataFrame.

df = [Link](data, index =['rank1', 'rank2', 'rank3', 'rank4'])

# print the data

Out[9]:
Name marks

rank1 Tom 99

rank2 Jack 98

rank3 nick 95

rank4 juli 90

In [10]: # Pandas DataFrame by lists of dicts.

import pandas as pd

# Initialise data to lists.

data = [{'a': 1, 'b': 2, 'c':3}, {'a':10, 'b': 20, 'c': 30}]

# Creates DataFrame.
df = [Link](data)

# Print the data

Out[10]:
a b c

0 1 2 3

1 10 20 30
In [ ]: # pandas Datadaframe from lists using zip.

import pandas as pd

# List1
Name = ['tom', 'krish', 'nick', 'juli']

# List2
Age = [25, 30, 26, 22]

# get the list of tuples from two lists.

# and merge them by using zip().
list_of_tuples = list(zip([Name, Age]))

# Assign data to tuples.

list_of_tuples

# Converting lists of tuples into

# pandas Dataframe.
df = [Link](list_of_tuples, columns = ['Name', 'Age'])

# Print data.
df

In [13]: # Pandas Dataframe from Dicts of series.

import pandas as pd

# Intialise data to Dicts of series.

d = {'one' : [Link]([10, 20, 30, 40], index =['a', 'b', 'c', 'd']),
'two' : [Link]([10, 20, 30, 40], index =['a', 'b', 'c', 'd'])}

# creates Dataframe.
df = [Link](d)

# print the data.

Out[13]:
one two

a 10 10

b 20 20

c 30 30

d 40 40

Column Selection: In Order to select a column in Pandas DataFrame, we can either access
the columns by calling them by their columns name.

Row Selection: Pandas provide a unique method to retrieve rows from a Data frame.
[Link][] method is used to retrieve rows from Pandas DataFrame.
In [14]: # Import pandas package
import pandas as pd

# Define a dictionary containing employee data

data = {'Name':['Jai', 'Princi', 'Gaurav', 'Anuj'],
'Age':[27, 24, 22, 32],
'Address':['Delhi', 'Kanpur', 'Allahabad', 'Kannauj'],
'Qualification':['Msc', 'MA', 'MCA', 'Phd']}

# Convert the dictionary into DataFrame

df = [Link](data)
(df)
print(df[["Name","Age"]]) #two columns selected
print(df[[Link][0:2]])
[Link]()
print(df[["Age"]])

Name Age
0 Jai 27
1 Princi 24
2 Gaurav 22
3 Anuj 32
Name Age
0 Jai 27
1 Princi 24
2 Gaurav 22
3 Anuj 32
Age
0 27
1 24
2 22
3 32

In [15]: # select two rows

first = [Link][1] #loc-->location
second = [Link][3]
print(first, "\n \n" ,second,"\n")

tb=[Link][1:3]
print(tb,"\n")

Name Princi
Age 24
Address Kanpur
Qualification MA
Name: 1, dtype: object

Name Anuj
Age 32
Address Kannauj
Qualification Phd
Name: 3, dtype: object

Name Age Address Qualification

1 Princi 24 Kanpur MA
2 Gaurav 22 Allahabad MCA
3 Anuj 32 Kannauj Phd
Panda read and write from csv file
Reading:

The read_csv() method returns a Pandas DataFrame that contains the data of the CSV file

writting:

Create a Pandas DataFrame first, then use to_csv() to write DataFrame to the CSV file.

Panda read and write from excel file

install this module:

pip install xlwt openpyxl xlsxwriter xlrd

Write an Excel File: Once you have those packages installed, you can save your
DataFrame in an Excel file with .to_excel():

Read an Excel File You can load data from Excel files with read_excel():

In [ ]: # importing pandas as pd
import pandas as pd

# Creating the dataframe

df = pd.read_csv("[Link]")

df
[Link](["Name", "City"])

In [ ]: # Import pandas package

import pandas as pd

# Define a dictionary containing employee data

data = {'Name':['Jai', 'Princi', 'Gaurav', 'Anuj'],
'Age':[27, 24, 22, 32],
'Address':['Delhi', 'Kanpur', 'Allahabad', 'Kannauj'],
'Qualification':['Msc', 'MA', 'MCA', 'Phd']}

# Convert the dictionary into DataFrame

df = [Link](data)
df.to_csv('write_siru.csv', index=False)

#df.to_csv('write_demo2.csv')
# saving the dataframe
df.to_csv(r'D:\[Link]', index=False)
Accessing data from DataFrames

In [ ]: # importing pandas as pd
import pandas as pd

# Creating the dataframe

df = pd.read_csv("[Link]")
print(df)

In [ ]: print([Link][2])
#or
print([Link][2,"City"])
#or
print([Link][2,2])

In [ ]: # importing pandas as pd
import pandas as pd

# Creating the dataframe

df = pd.read_csv("[Link]")
print(df,"\n")
[Link]([0,4],inplace = True) # deleting row data)

# display
print(df)

In [ ]: # importing pandas as pd
import pandas as pd

# Creating the dataframe

df = pd.read_csv("[Link]")
print(df,"\n")

[Link](["Age"],axis=1, inplace = True) #deleting column data

# display
print(df)

Joining two DataFrame

In [ ]: # importing pandas as pd
import pandas as pd

df1 = [Link]({"Int_Rate":[2,1,2,3], "IND_GDP":[50,45,45,67]}, index=[2

df2 = [Link]({"Low_Tier_HPI":[50,45,67,34],"Unemployment":[1,3,5,6]},

print(df1,"\n")
print(df2,"\n")
joined_df= [Link](df2)
print(joined_df)
In [ ]: import pandas as pd
import numpy as np
a = [Link](['Java', 'C', 'C++', [Link]])
b=[Link]({[Link]:"Python"})
print(b)

In [ ]:

Class 12 Python Practical Index
No ratings yet
Class 12 Python Practical Index
29 pages
Introduction to Pandas in Python
No ratings yet
Introduction to Pandas in Python
12 pages
Introduction to Pandas Data Structures
No ratings yet
Introduction to Pandas Data Structures
25 pages
Create and Manage Pandas DataFrames
No ratings yet
Create and Manage Pandas DataFrames
22 pages
Writing References in Practical Files
No ratings yet
Writing References in Practical Files
98 pages
Advantages of Pandas for Data Analysis
No ratings yet
Advantages of Pandas for Data Analysis
82 pages
Understanding Pandas Data Structures
No ratings yet
Understanding Pandas Data Structures
56 pages
Pandas Data Manipulation Techniques
No ratings yet
Pandas Data Manipulation Techniques
32 pages
Introduction to Pandas Data Structures
No ratings yet
Introduction to Pandas Data Structures
33 pages
Python Pandas DataFrame Guide
No ratings yet
Python Pandas DataFrame Guide
53 pages
Introduction to Pandas Data Structures
No ratings yet
Introduction to Pandas Data Structures
20 pages
Creating and Managing Pandas DataFrames
No ratings yet
Creating and Managing Pandas DataFrames
176 pages
Pandas Data Structures and Features Guide
No ratings yet
Pandas Data Structures and Features Guide
32 pages
Pandas DataFrame Functions Overview
No ratings yet
Pandas DataFrame Functions Overview
21 pages
Understanding DataFrames in Python
No ratings yet
Understanding DataFrames in Python
39 pages
Understanding DataFrames in Python
No ratings yet
Understanding DataFrames in Python
26 pages
Python Pandas and Data Visualization Guide
No ratings yet
Python Pandas and Data Visualization Guide
27 pages
Understanding Pandas Series Basics
No ratings yet
Understanding Pandas Series Basics
43 pages
Creating and Using Pandas Series
100% (1)
Creating and Using Pandas Series
80 pages
Pandas Data Handling Guide 2023-2024
No ratings yet
Pandas Data Handling Guide 2023-2024
21 pages
Creating Pandas Series and DataFrames
No ratings yet
Creating Pandas Series and DataFrames
33 pages
Pandas and MySQL Practical Guide
No ratings yet
Pandas and MySQL Practical Guide
40 pages
Data Handling with Pandas Basics
No ratings yet
Data Handling with Pandas Basics
17 pages
Introduction to Pandas Data Structures
No ratings yet
Introduction to Pandas Data Structures
25 pages
Creating DataFrames with Pandas
No ratings yet
Creating DataFrames with Pandas
29 pages
Pandas Data Structures Overview
No ratings yet
Pandas Data Structures Overview
48 pages
Full Form of MLL in Computer Science
No ratings yet
Full Form of MLL in Computer Science
22 pages
Creating and Accessing Pandas Series
No ratings yet
Creating and Accessing Pandas Series
37 pages
Python Libraries and Pandas Overview
No ratings yet
Python Libraries and Pandas Overview
25 pages
Data Handling with Pandas and Matplotlib
100% (3)
Data Handling with Pandas and Matplotlib
66 pages
Data Manipulation with Pandas Basics
No ratings yet
Data Manipulation with Pandas Basics
36 pages
Data Analysis with Pandas Series
No ratings yet
Data Analysis with Pandas Series
4 pages
Introduction to Pandas in Python
No ratings yet
Introduction to Pandas in Python
21 pages
Introduction to Pandas for Data Analysis
No ratings yet
Introduction to Pandas for Data Analysis
9 pages
Pandas DataFrame Creation Guide
No ratings yet
Pandas DataFrame Creation Guide
23 pages
Getting Started with Pandas Basics
No ratings yet
Getting Started with Pandas Basics
17 pages
Introduction to Python Pandas
No ratings yet
Introduction to Python Pandas
4 pages
Key Attributes of Pandas Series
No ratings yet
Key Attributes of Pandas Series
34 pages
P03 Introduction To Pandas Ans
No ratings yet
P03 Introduction To Pandas Ans
45 pages
Understanding Pandas for Data Analysis
No ratings yet
Understanding Pandas for Data Analysis
46 pages
Pandas Data Handling Guide for Class XII
No ratings yet
Pandas Data Handling Guide for Class XII
18 pages
Pandas DataFrame: Structure & Operations
No ratings yet
Pandas DataFrame: Structure & Operations
75 pages
Introduction to Pandas for Data Analysis
No ratings yet
Introduction to Pandas for Data Analysis
81 pages
Pandas Essentials for Data Analysis
No ratings yet
Pandas Essentials for Data Analysis
57 pages
Data Aggregation with Pandas
No ratings yet
Data Aggregation with Pandas
34 pages
Data Analysis with NumPy and Pandas
No ratings yet
Data Analysis with NumPy and Pandas
29 pages
Introduction to Pandas Data Structures
No ratings yet
Introduction to Pandas Data Structures
30 pages
Introduction to Pandas Series and DataFrames
No ratings yet
Introduction to Pandas Series and DataFrames
36 pages
Data Handling with Pandas Overview
No ratings yet
Data Handling with Pandas Overview
27 pages
Data Manipulation with Pandas Guide
100% (1)
Data Manipulation with Pandas Guide
38 pages
Understanding Pandas for Data Analysis
No ratings yet
Understanding Pandas for Data Analysis
39 pages
Data Analysis with Pandas: Series & DataFrames
No ratings yet
Data Analysis with Pandas: Series & DataFrames
4 pages
Pandas Series: Creation and Accessing Data
No ratings yet
Pandas Series: Creation and Accessing Data
44 pages
Data Wrangling with Pandas Guide
No ratings yet
Data Wrangling with Pandas Guide
16 pages
Data Handling and Visualization with Pandas
No ratings yet
Data Handling and Visualization with Pandas
75 pages
Create and Manipulate DataFrames in Python
No ratings yet
Create and Manipulate DataFrames in Python
32 pages
Pandas Series: Creation and Accessing Data
No ratings yet
Pandas Series: Creation and Accessing Data
6 pages
Vector Field and Data Visualization Techniques
No ratings yet
Vector Field and Data Visualization Techniques
5 pages
Frequency Division Multiplexing Explained
No ratings yet
Frequency Division Multiplexing Explained
6 pages
Synchronous vs Asynchronous TDM Explained
No ratings yet
Synchronous vs Asynchronous TDM Explained
20 pages
Understanding Wavelength Division Multiplexing
No ratings yet
Understanding Wavelength Division Multiplexing
6 pages
Art Appreciation Midterm Exam 2023
No ratings yet
Art Appreciation Midterm Exam 2023
2 pages
Signals & Systems Exam Questions
No ratings yet
Signals & Systems Exam Questions
2 pages
Guide to Technopreneurship Basics
No ratings yet
Guide to Technopreneurship Basics
4 pages
Pyrolysis Yield from Plastic Types
No ratings yet
Pyrolysis Yield from Plastic Types
3 pages
Ultimate Prestige Classes. Volume 2
92% (13)
Ultimate Prestige Classes. Volume 2
258 pages
Arab Contributions to Geography
100% (1)
Arab Contributions to Geography
5 pages
GSP 9700 Operation Manual
100% (1)
GSP 9700 Operation Manual
138 pages
Oberlin College Demands for Equity
14% (14)
Oberlin College Demands for Equity
14 pages
Unreal Engine 4 Archviz Guide
No ratings yet
Unreal Engine 4 Archviz Guide
18 pages
Machine Encoder Fixing
No ratings yet
Machine Encoder Fixing
23 pages
Flameproof Barrier Cable Gland Specifications
No ratings yet
Flameproof Barrier Cable Gland Specifications
2 pages
Missouri Explosive User Registration Guide
No ratings yet
Missouri Explosive User Registration Guide
20 pages
Sonipat Haryana 2017
No ratings yet
Sonipat Haryana 2017
95 pages
Overview of Azo Dyes and Their Classification
100% (2)
Overview of Azo Dyes and Their Classification
23 pages
07 Tiger Drylac R Primer 0973841 Zinc Free Ogf TDC en Aug 09 2016 Final
No ratings yet
07 Tiger Drylac R Primer 0973841 Zinc Free Ogf TDC en Aug 09 2016 Final
3 pages
Stylistic Analysis of "Gull Skeleton"
No ratings yet
Stylistic Analysis of "Gull Skeleton"
18 pages
My Daily Routine Activities
No ratings yet
My Daily Routine Activities
3 pages
Internet vs. Books: A Modern Debate
100% (1)
Internet vs. Books: A Modern Debate
2 pages
7-Minute Morning Devotion Guide
100% (1)
7-Minute Morning Devotion Guide
27 pages
Psychology Exam Marking Scheme 2022-23
No ratings yet
Psychology Exam Marking Scheme 2022-23
16 pages
Android App with GUI, Fonts, Colors
No ratings yet
Android App with GUI, Fonts, Colors
5 pages
Geography Exam for General Baccalaureate
No ratings yet
Geography Exam for General Baccalaureate
5 pages
Understanding Chemical Reactions Essentials
No ratings yet
Understanding Chemical Reactions Essentials
3 pages
COMP313 Test 1: UNIX, Processes, Scheduling
No ratings yet
COMP313 Test 1: UNIX, Processes, Scheduling
2 pages
Understanding Tropical Cyclones
No ratings yet
Understanding Tropical Cyclones
76 pages
Alfa Laval Flex Separation Parameters Guide
No ratings yet
Alfa Laval Flex Separation Parameters Guide
17 pages
Indictment of Dr. Eugene Gosy
No ratings yet
Indictment of Dr. Eugene Gosy
27 pages
TOPSIS for Santri Assistance Selection
No ratings yet
TOPSIS for Santri Assistance Selection
29 pages
Excavator Equipment Inspection Checklist
No ratings yet
Excavator Equipment Inspection Checklist
1 page
ICG Yellow Fever Vaccine Request Guide
No ratings yet
ICG Yellow Fever Vaccine Request Guide
6 pages

Introduction to Pandas Library

Uploaded by

Introduction to Pandas Library

Uploaded by

Pandas in Python

Pandas is fast and it has high-performance & productivity for users.

Install and import

conda install pandas

pip install pandas

!pip install pandas

The ! at the beginning runs cells as if they were in a terminal.

Core components of pandas: Series and DataFrames

A Series is essentially a column, and a DataFrame is a multi-dimensional table made up of

Creating a series from array:

In [1]: # to use panda

Creating a series from Lists:

In [2]: import pandas as pd

# create series form a list

In [3]: import pandas as pd

# create series from dictionary

Creating a series from array with index :

In [4]: import pandas as pd # import pandas as pd

In [6]: #droping element from a series

Create Pandas Dataframe

# intialise data of lists.

# Print the output.

In [8]: # Import pandas library

# initialize list of lists

# Create the pandas DataFrame

# DataFrame using arrays.

# initialise data of lists.

# Creates pandas DataFrame.

# print the data

In [10]: # Pandas DataFrame by lists of dicts.

# Initialise data to lists.

# Print the data

# get the list of tuples from two lists.

# Assign data to tuples.

# Converting lists of tuples into

In [13]: # Pandas Dataframe from Dicts of series.

# Intialise data to Dicts of series.

# print the data.

# Define a dictionary containing employee data

# Convert the dictionary into DataFrame

In [15]: # select two rows

Name Age Address Qualification

Panda read and write from excel file

pip install xlwt openpyxl xlsxwriter xlrd

# Creating the dataframe

In [ ]: # Import pandas package

# Define a dictionary containing employee data

# Convert the dictionary into DataFrame

# Creating the dataframe

# Creating the dataframe

# Creating the dataframe

Joining two DataFrame

You might also like