0% found this document useful (0 votes)

5 views22 pages

Pandas DataFrame1

A pandas DataFrame is a two-dimensional mutable data structure that can hold different types of data in labeled rows and columns. It can be created from various inputs such as lists, dictionaries, and other DataFrames, and allows for operations like arithmetic on its rows and columns. The document provides examples of creating DataFrames, selecting, adding, and deleting columns.

Uploaded by

khushi.chadha2020

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views22 pages

Pandas DataFrame1

Uploaded by

khushi.chadha2020

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

PANDAS_DATAFRAME

A DataFrame is a two-dimensional data

structure, i.e., data is aligned in a tabular fashion in
rows and columns.
Features of DataFrame

• Potentially columns are of different types

• Size – Mutable

• Labelled axes (rows and columns)

• Can Perform Arithmetic operations on rows and columns

Structure
Let us assume that we are creating a data frame with student’s data.
pandas.DataFrame

A pandas DataFrame can be created using the following constructor −

pandas.DataFrame( data, index, columns, dtype, copy)
The parameters of the constructor are as follows −

Sr.No Parameter & Description

1 data
data takes various forms like ndarray, series, map, lists, dict,
constants and also another DataFrame.

2 index
For the row labels, the Index to be used for the resulting frame is
Optional Default np.arange(n) if no index is passed.

3 columns
For column labels, the optional default syntax is - np.arange(n).
This is only true if no index is passed.

4 dtype
Data type of each column.

5 copy
This command (or whatever it is) is used for copying of data, if the
default is False.
Create DataFrame

A pandas DataFrame can be created using various inputs like −

• Lists
• dict
• Series
• Numpy ndarrays
• Another DataFrame
In the subsequent sections of this chapter, we will see how to create a
DataFrame using these inputs.

Create an Empty DataFrame

A basic DataFrame, which can be created is an Empty Dataframe.

Example
Live Demo

#import the pandas library and aliasing as pd

import pandas as pd
df = pd.DataFrame()
print df
Its output is as follows −
Empty DataFrame
Columns: []
Index: []
Create a DataFrame from Lists

The DataFrame can be created using a single list or a list of lists.

Example 1
Live Demo

import pandas as pd
data = [1,2,3,4,5]
df = pd.DataFrame(data)
print df

Its output is as follows −

0
0 1
1 2
2 3
3 4
4 5
Example 2
Live Demo

import pandas as pd
data = [['Alka',10],['Sonia',12],['Charu',13]]
df = pd.DataFrame(data,columns=['Name','Age'])
print df
Its output is as follows −
Name Age
0 Alka 10
1 Sonia 12
2 Charu 13
Example 3
Live Demo

import pandas as pd
data = [['Alex',10],['Bob',12],['Clarke',13]]
df = pd.DataFrame(data,columns=['Name','Age'],dtype=float)
print df
Its output is as follows −
Name Age
0 Alex 10.0
1 Bob 12.0
2 Clarke 13.0
Note − Observe, the dtype parameter changes the type of Age column
to floating point.

Create a DataFrame from Dict of ndarrays / Lists

All the ndarrays must be of same length. If index is passed, then the
length of the index should equal to the length of the arrays.
If no index is passed, then by default, index will be range(n),
where n is the array length.
Example 1
Live Demo

import pandas as pd
data = {'Name':['Tom', 'Jack', 'Steve', 'Ricky'],'Age':[28,34,29,42]}
df = pd.DataFrame(data)
print df
Its output is as follows −
Age Name
0 28 Tom
1 34 Jack
2 29 Steve
3 42 Ricky
Note − Observe the values 0,1,2,3. They are the default index assigned to each
using the function range(n).
Example 2
Let us now create an indexed DataFrame using arrays.
Live Demo

import pandas as pd
data = {'Name':['Tom', 'Jack', 'Steve', 'Ricky'],'Age':[28,34,29,42]}
df = pd.DataFrame(data, index=['rank1','rank2','rank3','rank4'])
print df
Its output is as follows −
Age Name
rank1 28 Tom
rank2 34 Jack
rank3 29 Steve
rank4 42 Ricky
Note − Observe, the index parameter assigns an index to each row.

Example 3:

Live Demo

import pandas as pd
data = {'Name':['Tom', 'Jack', 'Steve', 'Ricky'],'Age':[28,34,29,42]}
df = pd.DataFrame(data,columns=['Name','Age'])
print df
Its output is as follows −
Name Age
0 Tom 28
1 Jack 34
2 Steve 29
3 Ricky 42
Create a DataFrame from List of Dicts

List of Dictionaries can be passed as input data to create a DataFrame.

The dictionary keys are by default taken as column names.
Example 1
The following example shows how to create a DataFrame by passing
a list of dictionaries.
Live Demo

import pandas as pd
data = [{'a': 1, 'b': 2},{'a': 5, 'b': 10, 'c': 20}]
df = pd.DataFrame(data)
print df
Its output is as follows −
a b c
0 1 2 NaN
1 5 10 20.0
Note − Observe, NaN (Not a Number) is appended in missing areas.
Example 2
The following example shows how to create a DataFrame by passing
a list of dictionaries and the row indices.
Live Demo

import pandas as pd
data = [{'a': 1, 'b': 2},{'a': 5, 'b': 10, 'c': 20}]
df = pd.DataFrame(data, index=['first', 'second'])
print df
Its output is as follows −
a b c
first 1 2 NaN
second 5 10 20.0
Example 3
The following example shows how to create a DataFrame with a list
of dictionaries, row indices, and column indices.
Live Demo

import pandas as pd
data = [{'a': 1, 'b': 2},{'a': 5, 'b': 10, 'c': 20}]

#With two column indices, values same as dictionary keys

df1 = pd.DataFrame(data, index=['first', 'second'], columns=['a', 'b'])

#With two column indices with one index with other name
df2 = pd.DataFrame(data, index=['first', 'second'], columns=['a', 'b1'])
print df1
print df2
Its output is as follows −
#df1 output
a b
first 1 2
second 5 10

#df2 output

a b1
first 1 NaN
second 5 NaN
Note − Observe, df2 DataFrame is created with a column index other
than the dictionary key; thus, appended the NaN’s in place. Whereas,
df1 is created with column indices same as dictionary keys, so NaN’s
appended.
Example 4
The following example shows how to create a DataFrame with a list
of dictionaries, row indices, and column indices.
Live Demo

import pandas as pd
data = [{'a': 1, 'b': 2},{'a': 5, 'b': 10, 'c': 20}]

#With two column indices, values same as dictionary keys

df1 = pd.DataFrame(data, index=['first', 'second'], columns=['a',
'b','c'])

#With two column indices with one index with other name
df2 = pd.DataFrame(data, index=['first', 'second'], columns=['a',
'b1','b'])
print df1
print df2
Output
a b c
first 1 2 NaN
second 5 10 20.0

a b1 b
first 1 NaN 2
second 5 NaN 10
Create a DataFrame from Dict of Series

Dictionary of Series can be passed to form a DataFrame. The resultant

index is the union of all the series indexes passed.
Example
Live Demo

import pandas as pd

d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c'],dtype = float),

'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'],dtype = float)}

df = pd.DataFrame(d)
print df
Its output is as follows −
one two
a 1.0 1.0
b 2.0 2.0
c 3.0 3.0
d NaN 4.0
Note − Observe, for the series one, there is no
label ‘d’ passed, but in the result, for
the d label, NaN is appended with NaN.
Let us now understand column selection, addition,
and deletion through examples.

Column Selection

We will understand this by selecting a particular column (like first

column) from the DataFrame.
Example
Live Demo

import pandas as pd

d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),

'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}
df = pd.DataFrame(d)
print df ['one']
Its output is as follows −
a 1.0
b 2.0
c 3.0
d NaN
Name: one, dtype: float64
Columns can also be displayed using iloc()
method.

Eg. 1 To access the elements of second and third column of the

dataframe.

Example
Live Demo

import pandas as pd
d = {'one' : pd.Series([1, 15, 3], index=['a', 'b', 'c']),
'two' : pd.Series([56, 2, 67, 4], index=['a', 'b', 'c', 'd']),
'three' : pd.Series([10, 20, 3, 4], index=['a', 'b', 'c', 'e'])
}
df = pd.DataFrame(d)
print df.iloc[:,[1,2]]
Its output is as follows –
three two
a 10.0 56.0
b 20.0 2.0
c 3.0 67.0
d NaN 4.0
e 4.0 NaN
Eg . 2 To access the elements of first and third column of the
dataframe.
Example
Live Demo

import pandas as pd
d = {'one' : pd.Series([1, 15, 3], index=['a', 'b', 'c']),
'two' : pd.Series([56, 2, 67, 4], index=['a', 'b', 'c', 'd']),
'three' : pd.Series([10, 20, 3, 4], index=['a', 'b', 'c', 'e'])
}
df = pd.DataFrame(d,columns=['one','two','three'])
print df.iloc[:,[0,2]]
Output :
one three
a 1.0 10.0
b 15.0 20.0
c 3.0 3.0
d NaN NaN
e NaN 4.0
Eg.3 To access the second and third column of the dataframe.
Example
Live Demo

import pandas as pd
d = {'one' : pd.Series([1, 15, 3], index=['a', 'b', 'c']),
'two' : pd.Series([1, 15, 3], index=['a', 'b', 'c']),
'three' : pd.Series([56, 2, 67, 4], index=['a', 'b', 'c', 'd']),
'four' : pd.Series([10, 20, 3, 4], index=['a', 'b', 'c', 'e'])
}
df = pd.DataFrame(d,columns=['one','two','three','four'])
print df.iloc[:,[1,2]]

Its output is as follows −

two three
a 1.0 56.0
b 15.0 2.0
c 3.0 67.0
d NaN 4.0
e NaN NaN
Eg.4. To access first three columns of the dataframe.
To select range we need to use : (colon) sign and remember in the
range (start value : last value-1)
Example
Live Demo

import pandas as pd
import pandas as pd
d = {'one' : pd.Series([1, 15, 3], index=['a', 'b', 'c']),
'two' : pd.Series([1, 15, 3], index=['a', 'b', 'c']),
'three' : pd.Series([56, 2, 67, 4], index=['a', 'b', 'c', 'd']),
'four' : pd.Series([10, 20, 3, 4], index=['a', 'b', 'c', 'e'])
}
df = pd.DataFrame(d,columns=['one','two','three','four'])
print df.iloc[:,1:2]
Its output is as follows −
two
a 1.0
b 15.0
c 3.0
d NaN
e NaN
Column Addition
We will understand this by adding a new column to an existing data
frame.
Example
Live Demo

import pandas as pd

d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),

'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}

df = pd.DataFrame(d)

# Adding a new column to an existing DataFrame object with column

label by passing new series

print ("Adding a new column by passing as Series:")

df['three']=pd.Series([10,20,30],index=['a','b','c'])
print df

print ("Adding a new column using the existing columns in

DataFrame:")
df['four']=df['one']+df['three']

print df
Its output is as follows −
Adding a new column by passing as Series:
one two three
a 1.0 1 10.0
b 2.0 2 20.0
c 3.0 3 30.0
d NaN 4 NaN

Adding a new column using the existing columns in DataFrame:

one two three four
a 1.0 1 10.0 11.0
b 2.0 2 20.0 22.0
c 3.0 3 30.0 33.0
d NaN 4 NaN NaN

Column Deletion

Columns can be deleted or popped; let us take an example to

understand how.
Example
Live Demo

# Using the previous DataFrame, we will delete a column

# using del function and pop function
import pandas as pd

d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),

'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd']),
'three' : pd.Series([10,20,30], index=['a','b','c'])}

df = pd.DataFrame(d)
print ("Our dataframe is:")
print df
# using del keyword
print ("Deleting the first column using DEL function:")
del df['one']
print df

# using pop function

print ("Deleting another column using POP function:")
df.pop('two')
print df

Its output is as follows −

Our dataframe is:
one three two
a 1.0 10.0 1
b 2.0 20.0 2
c 3.0 30.0 3
d NaN NaN 4

Deleting the first column using DEL function:

three two
a 10.0 1
b 20.0 2
c 30.0 3
d NaN 4

Deleting another column using POP function:

three
a 10.0
b 20.0
c 30.0
d NaN
Row Selection, Addition, and Deletion

We will now understand row selection, addition and deletion through

examples. Let us begin with the concept of selection.

Selection by Label

Rows can be selected by passing row label to a loc function.

Live Demo

import pandas as pd
d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),
'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}

df = pd.DataFrame(d)
#print df
print df.loc['b']
Its output is as follows −
one 2.0
two 2.0
Name: b, dtype: float64
The result is a series with labels as column names of the DataFrame.
And, the Name of the series is the label with which it is retrieved.
Selection by integer location

Rows can be selected by passing integer location to an iloc function.

Live Demo

import pandas as pd

d = {'one' : pd.Series([1, 2, 6], index=['a', 'b', 'c']),

'two' : pd.Series([1, 20, 30, 4], index=['a', 'b', 'c', 'd'])}

df = pd.DataFrame(d)
print df.iloc[3]
Its output is as follows −
one NaN
two 4.0
Name: d, dtype: float64

Slice Rows
Multiple rows can be selected using ‘ : ’ operator.
Live Demo

import pandas as pd
d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),
'two' : pd.Series([1, 2, 3, 4], index=['a', 'b', 'c', 'd'])}

df = pd.DataFrame(d)
print df[2:4]
Its output is as follows −
one two
c 3.0 3
d NaN 4
Addition of Rows

Add new rows to a DataFrame using the append function. This

function will append the rows at the end.

Live Demo

import pandas as pd

df = pd.DataFrame([[1, 2], [3, 4]], columns = ['a','b'])

#print df
df2 = pd.DataFrame([[5, 6], [7, 8]], columns = ['a','b'])
df = df.append(df2)
print df
Its output is as follows −
a b
0 1 2
1 3 4
0 5 6
1 7 8
Deletion of Rows

Use index label to delete or drop rows from a DataFrame. If label is

duplicated, then multiple rows will be dropped.
If you observe, in the above example, the labels are duplicate. Let us
drop a label and will see how many rows will get dropped.

Live Demo

import pandas as pd

df = pd.DataFrame([[1, 2], [3, 4]], columns = ['a','b'])

df2 = pd.DataFrame([[5, 6], [7, 8]], columns = ['a','b'])

df = df.append(df2)

# Drop rows with label 0

df = df.drop(0)

print df

Its output is as follows −

ab
134
178
In the above example, two rows were dropped because those two
contain the same label 0.

Homework 4 Coursera 07 PDF
89% (9)
Homework 4 Coursera 07 PDF
11 pages
IPDirector Userman IPEdit
No ratings yet
IPDirector Userman IPEdit
270 pages
Cheat Sheet: The Pandas Dataframe Object: Column Index (DF - Columns)
No ratings yet
Cheat Sheet: The Pandas Dataframe Object: Column Index (DF - Columns)
6 pages
Python Pandas New Sylabus
No ratings yet
Python Pandas New Sylabus
53 pages
Python Pandas ch-2
No ratings yet
Python Pandas ch-2
56 pages
SBLC 1
No ratings yet
SBLC 1
23 pages
Python Pandas - DataFrame
No ratings yet
Python Pandas - DataFrame
12 pages
lecture-9-pandas
No ratings yet
lecture-9-pandas
176 pages
Block 1-Data Handling Using Pandas DataFrame
No ratings yet
Block 1-Data Handling Using Pandas DataFrame
17 pages
Dataframe Ip
No ratings yet
Dataframe Ip
75 pages
Pandas
No ratings yet
Pandas
16 pages
DataFrame Notes1
No ratings yet
DataFrame Notes1
32 pages
09_Pandas slides
No ratings yet
09_Pandas slides
33 pages
data frame CREATION
No ratings yet
data frame CREATION
7 pages
14_Pandas
No ratings yet
14_Pandas
25 pages
Python UnitIV
No ratings yet
Python UnitIV
20 pages
ACFrOgCuxzI7id1LCXi9yoyuvISxGard75NvAshCzyRkhz0Fv_jimN6GuJsUI3qR2_jr7vxbRmHlwJPmcpRa7v3zCXyCokAXM23U17GlLnoA-5jSOz-osgZwdAL-ghXvjz5yld44_1rLLZaDMrebwXv-HRUry-kJjWFBo4Jkhw==
No ratings yet
ACFrOgCuxzI7id1LCXi9yoyuvISxGard75NvAshCzyRkhz0Fv_jimN6GuJsUI3qR2_jr7vxbRmHlwJPmcpRa7v3zCXyCokAXM23U17GlLnoA-5jSOz-osgZwdAL-ghXvjz5yld44_1rLLZaDMrebwXv-HRUry-kJjWFBo4Jkhw==
12 pages
3. DATAFRAME.pdf
No ratings yet
3. DATAFRAME.pdf
14 pages
The Pandas Library
No ratings yet
The Pandas Library
39 pages
Lab 9
No ratings yet
Lab 9
9 pages
CSL-410-L15
No ratings yet
CSL-410-L15
29 pages
p.no 35 to 52
No ratings yet
p.no 35 to 52
18 pages
Class Notes: Class: XII Date: 7-Apr-2020 Subject: Informatics Practices Topic: 2. Python Pandas
No ratings yet
Class Notes: Class: XII Date: 7-Apr-2020 Subject: Informatics Practices Topic: 2. Python Pandas
4 pages
Python Pandas Module - Introduction-07-11-2023
No ratings yet
Python Pandas Module - Introduction-07-11-2023
84 pages
Chapter 1 Python Pandas - I
No ratings yet
Chapter 1 Python Pandas - I
35 pages
IP Practical File - Reference
No ratings yet
IP Practical File - Reference
98 pages
Pandas Shan Ver2
No ratings yet
Pandas Shan Ver2
25 pages
Pandas & Numpy
No ratings yet
Pandas & Numpy
32 pages
Chapter 2 Data Handling using pandas - I(DATA FRAME)
No ratings yet
Chapter 2 Data Handling using pandas - I(DATA FRAME)
15 pages
UNIT - 3 Pandas
No ratings yet
UNIT - 3 Pandas
21 pages
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
38 pages
Unit 4
No ratings yet
Unit 4
36 pages
Dataframe Notes
No ratings yet
Dataframe Notes
26 pages
Dataframe Notes
No ratings yet
Dataframe Notes
39 pages
Cheat Sheet
No ratings yet
Cheat Sheet
10 pages
Python 3rd unit question and answer
No ratings yet
Python 3rd unit question and answer
25 pages
12 IP Unit 1 Python Pandas I (Part 3 Dataframes) Notes
100% (1)
12 IP Unit 1 Python Pandas I (Part 3 Dataframes) Notes
24 pages
Python pandas
No ratings yet
Python pandas
34 pages
Pandas
No ratings yet
Pandas
27 pages
Pandas python
No ratings yet
Pandas python
11 pages
L1_DataFrames_I
No ratings yet
L1_DataFrames_I
24 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
12 pages
P03 Introduction To Pandas Ans
No ratings yet
P03 Introduction To Pandas Ans
45 pages
Pandas
No ratings yet
Pandas
82 pages
Class 12 Practical File
No ratings yet
Class 12 Practical File
29 pages
Pandas DataFrame Notes
67% (3)
Pandas DataFrame Notes
13 pages
UNIT-4 Important Q-A
No ratings yet
UNIT-4 Important Q-A
28 pages
Data Handing Using Pandas-I
100% (2)
Data Handing Using Pandas-I
46 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
10 pages
Pandas DataFrameObject
No ratings yet
Pandas DataFrameObject
4 pages
Pandas 1
No ratings yet
Pandas 1
89 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
6 pages
Pandas.ipynb - Colab (1)
No ratings yet
Pandas.ipynb - Colab (1)
8 pages
Python Pandas Interview Questions
100% (1)
Python Pandas Interview Questions
17 pages
Class 12 Panda Project
No ratings yet
Class 12 Panda Project
13 pages
Data Handling Using Pandas-I-ORG
No ratings yet
Data Handling Using Pandas-I-ORG
44 pages
Data Handlinng Using Pandas
No ratings yet
Data Handlinng Using Pandas
46 pages
Pandas - Dataframe - Introduction
No ratings yet
Pandas - Dataframe - Introduction
16 pages
Pandas DataFrame
No ratings yet
Pandas DataFrame
70 pages
Pandas Library
No ratings yet
Pandas Library
5 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
OAT March 25
No ratings yet
OAT March 25
1 page
ALMA Connect List 22-25
No ratings yet
ALMA Connect List 22-25
2 pages
ALMA Connect List 22-25
No ratings yet
ALMA Connect List 22-25
15 pages
Room Wise Seating Plan April 2025 Paper 1
No ratings yet
Room Wise Seating Plan April 2025 Paper 1
10 pages
Proposal IT Role Cloudologic
No ratings yet
Proposal IT Role Cloudologic
1 page
Marketing Ws
No ratings yet
Marketing Ws
2 pages
CPA 2019
No ratings yet
CPA 2019
5 pages
IC5 - L2 - WQ - U7to8 EXAMEN RESUELTO
No ratings yet
IC5 - L2 - WQ - U7to8 EXAMEN RESUELTO
2 pages
4 in 1 Backlit Users Guide
No ratings yet
4 in 1 Backlit Users Guide
10 pages
TC2987en-Ed01 How To Implement SNMP To Monitor OXO Connect
No ratings yet
TC2987en-Ed01 How To Implement SNMP To Monitor OXO Connect
18 pages
Circular Queue
No ratings yet
Circular Queue
3 pages
HD 13 Numerical Integration of MDOF 2008
No ratings yet
HD 13 Numerical Integration of MDOF 2008
18 pages
GSA BIM Guide 05 Version 2.1
No ratings yet
GSA BIM Guide 05 Version 2.1
76 pages
13 - m1 - Linux Basic Commands - Edureka VM PDF
No ratings yet
13 - m1 - Linux Basic Commands - Edureka VM PDF
3 pages
Strategical Brand Audit On KFC
No ratings yet
Strategical Brand Audit On KFC
17 pages
Vianney
No ratings yet
Vianney
41 pages
Isaac Madan, Shaurya Saluja, Aojia Zhao, Automated Bitcoin Trading Via Machine Learning Algorithms
No ratings yet
Isaac Madan, Shaurya Saluja, Aojia Zhao, Automated Bitcoin Trading Via Machine Learning Algorithms
10 pages
Zte Mf100 Hsdpa Usb Stick
No ratings yet
Zte Mf100 Hsdpa Usb Stick
10 pages
Arcpad Forms
No ratings yet
Arcpad Forms
27 pages
Devlist
No ratings yet
Devlist
7 pages
Financial Consolidations MS AX2009
100% (1)
Financial Consolidations MS AX2009
11 pages
Artificial Intelligence Slide
No ratings yet
Artificial Intelligence Slide
14 pages
Cybersecurity Issues in Generative AI
No ratings yet
Cybersecurity Issues in Generative AI
4 pages
Salesforce Pages Developers Guide
No ratings yet
Salesforce Pages Developers Guide
569 pages
Mail Merging2025
No ratings yet
Mail Merging2025
22 pages
1990 Experiences With Defect Prevention
No ratings yet
1990 Experiences With Defect Prevention
30 pages
04 trobleshooting-H5500AWXXH
No ratings yet
04 trobleshooting-H5500AWXXH
30 pages
Gener Ol
No ratings yet
Gener Ol
59 pages
Chapter 1
No ratings yet
Chapter 1
8 pages
A Literature Study On Agricultural Production System Using Iot As Inclusive Technology
No ratings yet
A Literature Study On Agricultural Production System Using Iot As Inclusive Technology
5 pages
Trustless Education? A Blockchain System For University Grades PDF
No ratings yet
Trustless Education? A Blockchain System For University Grades PDF
4 pages
Atlantic Computer Case
100% (1)
Atlantic Computer Case
5 pages
Overall Summary:: SAP MM Certified Associate & SAP Certification ID: 0019350978
No ratings yet
Overall Summary:: SAP MM Certified Associate & SAP Certification ID: 0019350978
6 pages
Cisco Lead2pass 200-201 Sample Question 2022-Dec-26 by Barton 88q Vce
No ratings yet
Cisco Lead2pass 200-201 Sample Question 2022-Dec-26 by Barton 88q Vce
12 pages
SDPB Manual
No ratings yet
SDPB Manual
14 pages

Pandas DataFrame1

Uploaded by

Pandas DataFrame1

Uploaded by

PANDAS_DATAFRAME

A DataFrame is a two-dimensional data

• Potentially columns are of different types

• Labelled axes (rows and columns)

• Can Perform Arithmetic operations on rows and columns

A pandas DataFrame can be created using the following constructor −

Sr.No Parameter & Description

A pandas DataFrame can be created using various inputs like −

Create an Empty DataFrame

A basic DataFrame, which can be created is an Empty Dataframe.

#import the pandas library and aliasing as pd

The DataFrame can be created using a single list or a list of lists.

Its output is as follows −

Create a DataFrame from Dict of ndarrays / Lists

List of Dictionaries can be passed as input data to create a DataFrame.

#With two column indices, values same as dictionary keys

#With two column indices, values same as dictionary keys

Dictionary of Series can be passed to form a DataFrame. The resultant

d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c'],dtype = float),

We will understand this by selecting a particular column (like first

d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),

Eg. 1 To access the elements of second and third column of the

Its output is as follows −

d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),

# Adding a new column to an existing DataFrame object with column

print ("Adding a new column by passing as Series:")

print ("Adding a new column using the existing columns in

Adding a new column using the existing columns in DataFrame:

Columns can be deleted or popped; let us take an example to

# Using the previous DataFrame, we will delete a column

d = {'one' : pd.Series([1, 2, 3], index=['a', 'b', 'c']),

# using pop function

Its output is as follows −

Deleting the first column using DEL function:

Deleting another column using POP function:

We will now understand row selection, addition and deletion through

Rows can be selected by passing row label to a loc function.

Rows can be selected by passing integer location to an iloc function.

d = {'one' : pd.Series([1, 2, 6], index=['a', 'b', 'c']),

Add new rows to a DataFrame using the append function. This

df = pd.DataFrame([[1, 2], [3, 4]], columns = ['a','b'])

Use index label to delete or drop rows from a DataFrame. If label is

df = pd.DataFrame([[1, 2], [3, 4]], columns = ['a','b'])

# Drop rows with label 0

Its output is as follows −

You might also like