0% found this document useful (0 votes)

3 views

Data frames pandas, handout 1 (1)

Uploaded by

ayaqassas21

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Data frames pandas, handout 1 (1)

Uploaded by

ayaqassas21

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

1

Data Frames
Pandas library
1-Pandas library
2-Notepade.csv
3-Data frames
4-Reading from excel file and .csv file
5-The writer function

Reading data from excel file

You have to create this excel file, then read it and display
it in the output using python

Book4.xlsx excel file

a b c
0 1 5 3
1 1 6 8
2 2 3 7
3 3 1 8

Code to read the file, from excel

Page 1
2

You need to import some libraries that are

shown below. To install these libraries use
the pip3
#use pip3 in command window to install
# 1- pandas 2- openpyxl
# pip3 install pandas
# pip3 install openyxl
Example

Example Write the script

below to read

For excel file

import pandas as pd
df = pd.read_excel('book4.xlsx')
print(df)
print('done')

Page 2
3

###################################
For notepad with .csv extension
Create notepad commas separated data
as shown below.
When you save it by going to all
files, then add the extension .csc

Example The code to read notepad.csv

import pandas as pd
df = pd.read_csv('data2.csv')
print(df)
print('done')

Page 3
4

Data Frames
Python dataframe is a data structure constructed with rows and columns,
similar to a database or Excel spreadsheet. It consists of a dictionary of lists in
which the list each have their own identifiers or keys, such as “last name” or
“food group.”

—------------
Example
d={"Duration":{
"0":60,
"1":60,
"2":60,
"3":45,

},
"Maxpulse":{
"0":130,
"1":145,
"2":135,
"3":175,
},
"Calories":{
"0":409.1,
"1":479.0,
"2":340.0,
"3":282.4,

}
}
import json

with open("dd.json", 'w') as fout:

json_dumps_str = json.dumps(d, indent=4)
print(json_dumps_str, file=fout)

import pandas as pd
df = pd.read_json('dd.json')

Page 4
5

print(df.to_string())

Output
Duration Maxpulse Calories
0 60 130 409.1
1 60 145 479.0
2 60 135 340.0
3 45 175 282.4

To create a dataframe you must first create a dictionary.

These are the dictionary methods we use

Method Usage

Values( Return a list of all values in the dictionary

)

Update( Updates the dictionary with the specified key-value pairs

)

setdefa Returns the value of the specified key. If the key does not exist
ult() insert the key, with the specified value

Page 5
6

clear() Removes all the elements from the dictionary

keys() Returns a list containing the keys of the dictionary

pop() Removes the element with the specified key

popitem Removes the last inserted key-value pair

()

get() Returns the value of the specified key

items() Returns a list containing a tuple for each key value pair

copy() Returns a copy of the dictionary

Page 6
7

fromkey Returns a dictionary with the specified keys and value

s()

This how we create a

dictionary and print header
import pandas as pd
import numpy as np
df={
'First name':['Ali','Fred'],
'Last name':['Baba','Json'],
'Visit':['march,19','March,25'],

Page 7
8

'Leave':['April,20,Aprile,26']
}

# Get the list of all column names from headers

for column_headers in df:
print(column_headers, end='\t')

Page 8
9

Example:column by column
generate a data frame
import pandas as pd
import numpy as np

technologies= {
'Courses':["Spark","PySpark","Hadoop","Python","Pandas"],
'Fee' :[22000,25000,23000,24000,26000],
'Duration':['30days','50days','30days', None,np.nan],
'Discount':[1000,2300,1000,1200,2500]
}
df = pd.DataFrame(technologies)
print(df)

#notice the column generated in the output

# To Get the list of all column

names from headers
column_headers = list(df.columns.values)
print("The Column Header :", column_headers)
Output:
The Column Header : ['Courses', 'Fee', 'Duration', 'Discount']

Page 9
10

#Example Get the list of all column names

from headers

column_headers = df.columns.values.tolist()
print("The Column Header :", column_headers)

output
The Column Header : ['Courses', 'Fee', 'Duration', 'Discount']

Page 10
11

To reset, and delete

rows
import pandas as pd
# Create DataFrame from dict
df =
pd.DataFrame({'Courses':['Spark','PySpark','Java','PHP'
],
'Fee':[20000,20000,15000,10000],
'Duration':['35days','35days','40days','30days']})
print(df)
df=df.drop([2])
print(df)
df2=df.reset_index()
print(df2)

output
Courses Fee Duration
0 Spark 20000 35days
1 PySpark 20000 35days
2 Java 15000 40days
3 PHP 10000 30days

Courses Fee Duration

0 Spark 20000 35days
1 PySpark 20000 35days
3 PHP 10000 30days
index Courses Fee Duration
0 0 Spark 20000 35 days
1 1 PySpark 20000 35 days

Page 11
12

2 3 PHP 10000 30 days

#Use .scv file to drop a colun

import pandas as pd
df = pd.read_csv('data2.csv')
print(df)
df2=df.drop([2])
print(df2)
print('done')

a b c
0 1 2 3
1 2 5 7
2 3 8 3

a b c
0 1 2 3
1 2 5 7
Done
Page 12
13

0 1 5 3
1 1 6 8
2 2 3 7
3 3 1 8
a b c
0 1 5 3
1 1 6 8
3 3 1 8

Example Write Excel with Python

Pandas. Excel file will be created
You can write any data (lists, strings, numbers etc) to Excel, by first converting
it into a Pandas DataFrame and then writing the DataFrame to Excel.

To export a Pandas DataFrame as an Excel file (extension: .xlsx, .xls), use

the to_excel() method.

Page 13
14

Install the following library

$ pip install xlwt
$ pip install openpyxl
Importing openpyxl is required if you want to append it to an existing Excel file
described at the end.

import pandas as pd
import openpyxl

df = pd.DataFrame([[11, 21, 31], [12, 22, 32], [31, 32, 33]],

index=['one', 'two', 'three'], columns=['a', 'b', 'c'])

print(df)
# a b c
# one 11 21 31
# two 12 22 32
# three 31 32 33

You can specify a path as the first argument of the to_excel() method.

Note: that the data in the original file is deleted when overwriting.

The argument new_sheet_name is the name of the sheet. If omitted, it will be

named Sheet1.

Output is the excel file that is created

Page 14
15

import pandas as pd
import openpyxl

df = pd.DataFrame([[11, 21, 31], [12, 22, 32], [31, 32,

33]],
index=['one', 'two', 'three'],
columns=['a', 'b', 'c'])

print(df)
#writing to excel
df.to_excel('pandas_to_excel.xlsx',
sheet_name='new_sheet_name')

#If you do not need to write index (row name), columns

(column name),
#the argument index, columns is False

#df.to_excel('xxx_no_index_header.xlsx', index=False,
header=False)

#then use the ExcelWriter() function like

this:

Page 15
16

with pd.ExcelWriter('pandas_to_excel.xlsx') as writer:

df.to_excel(writer, sheet_name='sheet1')
df.to_excel(writer, sheet_name='sheet2')
#You don’t need to call writer.save(), writer.close()
within the blocks.

Page 16

Course Structure - Agricultural Engineering
75% (4)
Course Structure - Agricultural Engineering
15 pages
Python Libraries
No ratings yet
Python Libraries
27 pages
dv_lab_manual_modified
No ratings yet
dv_lab_manual_modified
31 pages
ainotes
No ratings yet
ainotes
5 pages
Ainotes dataframe
No ratings yet
Ainotes dataframe
5 pages
Oxy Metre
No ratings yet
Oxy Metre
17 pages
dav 2 unit
No ratings yet
dav 2 unit
55 pages
Pandas DataFrame1
No ratings yet
Pandas DataFrame1
22 pages
Practical File Python
No ratings yet
Practical File Python
25 pages
Exercise 3
No ratings yet
Exercise 3
12 pages
Unit 2 notes-II
No ratings yet
Unit 2 notes-II
47 pages
Pandas_Dataframe_All_Operations_1735471870
No ratings yet
Pandas_Dataframe_All_Operations_1735471870
4 pages
MOD-3 Dap
No ratings yet
MOD-3 Dap
41 pages
ACFrOgCuxzI7id1LCXi9yoyuvISxGard75NvAshCzyRkhz0Fv_jimN6GuJsUI3qR2_jr7vxbRmHlwJPmcpRa7v3zCXyCokAXM23U17GlLnoA-5jSOz-osgZwdAL-ghXvjz5yld44_1rLLZaDMrebwXv-HRUry-kJjWFBo4Jkhw==
No ratings yet
ACFrOgCuxzI7id1LCXi9yoyuvISxGard75NvAshCzyRkhz0Fv_jimN6GuJsUI3qR2_jr7vxbRmHlwJPmcpRa7v3zCXyCokAXM23U17GlLnoA-5jSOz-osgZwdAL-ghXvjz5yld44_1rLLZaDMrebwXv-HRUry-kJjWFBo4Jkhw==
12 pages
Data Handling using pandas - I Q & ANS (1)
No ratings yet
Data Handling using pandas - I Q & ANS (1)
9 pages
99c949c0-5910-425f-9ac5-155882800fa5
No ratings yet
99c949c0-5910-425f-9ac5-155882800fa5
36 pages
Pandas
No ratings yet
Pandas
16 pages
Edx Course Lab Programs
No ratings yet
Edx Course Lab Programs
19 pages
Aryan Cs Project
No ratings yet
Aryan Cs Project
28 pages
Experiment 1 solution
No ratings yet
Experiment 1 solution
5 pages
Pandas Notes (1)
No ratings yet
Pandas Notes (1)
10 pages
Pandas Dataframe
No ratings yet
Pandas Dataframe
48 pages
Data Analysis and Visulaization Experiment
No ratings yet
Data Analysis and Visulaization Experiment
104 pages
Informatic Practices Hhw
No ratings yet
Informatic Practices Hhw
21 pages
UNIT 1 PYTHON PROGRAMMING-II
No ratings yet
UNIT 1 PYTHON PROGRAMMING-II
15 pages
Chapter 2 Data Handling using pandas - I(DATA FRAME)
No ratings yet
Chapter 2 Data Handling using pandas - I(DATA FRAME)
15 pages
SBLC 1
No ratings yet
SBLC 1
23 pages
Arpit
No ratings yet
Arpit
30 pages
IP 12th Chapter 3
No ratings yet
IP 12th Chapter 3
9 pages
Python Lab ALL 10 Prgms
No ratings yet
Python Lab ALL 10 Prgms
16 pages
Experiment 678910
No ratings yet
Experiment 678910
12 pages
Python and Excel
No ratings yet
Python and Excel
11 pages
Pandas
No ratings yet
Pandas
13 pages
exp3 python (1)
No ratings yet
exp3 python (1)
15 pages
Pandas CheatSheet
No ratings yet
Pandas CheatSheet
18 pages
Python For DS Cheat Sheet
100% (2)
Python For DS Cheat Sheet
6 pages
Pandas
No ratings yet
Pandas
29 pages
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
No ratings yet
Pierian Data - Python For Finance & Algorithmic Trading Course Notes
11 pages
pyspark classes and function
No ratings yet
pyspark classes and function
20 pages
Python_for_DataScience
No ratings yet
Python_for_DataScience
47 pages
Lab Manual
No ratings yet
Lab Manual
7 pages
DATA HANDLING AND CSV 2024- 2025
No ratings yet
DATA HANDLING AND CSV 2024- 2025
12 pages
PythonForMachineLearning
No ratings yet
PythonForMachineLearning
66 pages
09_Pandas slides
No ratings yet
09_Pandas slides
33 pages
Python pandas
No ratings yet
Python pandas
34 pages
Movie Ticket Data Analysis System (Ip Class 12) (2024-25)
No ratings yet
Movie Ticket Data Analysis System (Ip Class 12) (2024-25)
26 pages
Informatic Practices Hhw (3)
No ratings yet
Informatic Practices Hhw (3)
59 pages
Text to Word
No ratings yet
Text to Word
5 pages
unit 4 Spark SQL
No ratings yet
unit 4 Spark SQL
49 pages
Pandas PDF(2)
No ratings yet
Pandas PDF(2)
25 pages
Chapter 1 Python Pandas - I
No ratings yet
Chapter 1 Python Pandas - I
35 pages
python interviews
No ratings yet
python interviews
154 pages
CSL-410-L15
No ratings yet
CSL-410-L15
29 pages
12th IP PRACTICALS
No ratings yet
12th IP PRACTICALS
18 pages
Dev Lab Manual Org
No ratings yet
Dev Lab Manual Org
28 pages
cookbook.rst
No ratings yet
cookbook.rst
28 pages
IP Book 12 Question Bank
No ratings yet
IP Book 12 Question Bank
20 pages
Pandas Library Documentation
No ratings yet
Pandas Library Documentation
16 pages
EDS - Python Cheat Sheet
No ratings yet
EDS - Python Cheat Sheet
3 pages
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Pardo, Orense and Sarmah (2018) - Cyclic Strength of Sand Mixed With Biochar Some Preliminary Results
No ratings yet
Pardo, Orense and Sarmah (2018) - Cyclic Strength of Sand Mixed With Biochar Some Preliminary Results
7 pages
Modbus Map-Vortex Crowcon
No ratings yet
Modbus Map-Vortex Crowcon
44 pages
听觉空间注意力haho
No ratings yet
听觉空间注意力haho
5 pages
Suppl.B HPK-L - 1136 - 00 - 03 S2-EN - Final
No ratings yet
Suppl.B HPK-L - 1136 - 00 - 03 S2-EN - Final
34 pages
KVL and KCL
100% (1)
KVL and KCL
14 pages
Class 9 B
No ratings yet
Class 9 B
2 pages
Week 23 RD Grade Math
No ratings yet
Week 23 RD Grade Math
4 pages
3D Woven Fabric
No ratings yet
3D Woven Fabric
9 pages
Next Generation Financial Protocol: Velo Team
No ratings yet
Next Generation Financial Protocol: Velo Team
27 pages
Application: Name:Plate-Shaped RF Power Ceramic Capacitor Item#.: CCG81 Series
No ratings yet
Application: Name:Plate-Shaped RF Power Ceramic Capacitor Item#.: CCG81 Series
3 pages
Module 2 Exercise2
No ratings yet
Module 2 Exercise2
2 pages
Fundamentals of Computer - 100 MCQ Questions MCQ Sets
100% (1)
Fundamentals of Computer - 100 MCQ Questions MCQ Sets
26 pages
Oil Recovery by Imbibition Austin and Buds Water Flooding in The Formations
No ratings yet
Oil Recovery by Imbibition Austin and Buds Water Flooding in The Formations
7 pages
Pharmaceutical Calculations and Techniques PHCL111: Our Lady of Fatima University
No ratings yet
Pharmaceutical Calculations and Techniques PHCL111: Our Lady of Fatima University
24 pages
Mechanisms: © 2011 Project Lead The Way, Inc. Automation and Robotics VEX
No ratings yet
Mechanisms: © 2011 Project Lead The Way, Inc. Automation and Robotics VEX
30 pages
Firestone Air Gripper Catalog
No ratings yet
Firestone Air Gripper Catalog
42 pages
2.3 Load Combination's: Notation
No ratings yet
2.3 Load Combination's: Notation
24 pages
Smart Actuator II - Installation & Operation Manual.v3.8a
No ratings yet
Smart Actuator II - Installation & Operation Manual.v3.8a
103 pages
Ek Khwaab Ne Aankhein Kholi Hain Kya Mod Aaya Hai Kahaani Mein Wo Bheeg Rahi Hai Baarish Mein Aur Aag Lagi Hai Paani Mein
No ratings yet
Ek Khwaab Ne Aankhein Kholi Hain Kya Mod Aaya Hai Kahaani Mein Wo Bheeg Rahi Hai Baarish Mein Aur Aag Lagi Hai Paani Mein
3 pages
DesignBuilder Simulation Training Slides
No ratings yet
DesignBuilder Simulation Training Slides
27 pages
Algorithm Complexity
No ratings yet
Algorithm Complexity
35 pages
TAC 1000 Site Readiness Template (1)
No ratings yet
TAC 1000 Site Readiness Template (1)
6 pages
CU7008-Ultra Wideband Communication
100% (2)
CU7008-Ultra Wideband Communication
16 pages
Determiners
No ratings yet
Determiners
14 pages
HEMATOLOGY-LAB-PREFINALS
No ratings yet
HEMATOLOGY-LAB-PREFINALS
9 pages
JWT Handbook
No ratings yet
JWT Handbook
77 pages
Teachers Resource Chapter 3 Number Relationships Sample
No ratings yet
Teachers Resource Chapter 3 Number Relationships Sample
94 pages
Orthogonality
No ratings yet
Orthogonality
21 pages
General Physiology Test Paper and answer key Batch 2024
No ratings yet
General Physiology Test Paper and answer key Batch 2024
2 pages

Data frames pandas, handout 1 (1)

Uploaded by

Data frames pandas, handout 1 (1)

Uploaded by

1

Reading data from excel file

Book4.xlsx excel file

Code to read the file, from excel

You need to import some libraries that are

Example Write the script

For excel file

Example The code to read notepad.csv

with open("dd.json", 'w') as fout:

To create a dataframe you must first create a dictionary.

Values( Return a list of all values in the dictionary

Update( Updates the dictionary with the specified key-value pairs

clear() Removes all the elements from the dictionary

keys() Returns a list containing the keys of the dictionary

pop() Removes the element with the specified key

popitem Removes the last inserted key-value pair

get() Returns the value of the specified key

copy() Returns a copy of the dictionary

fromkey Returns a dictionary with the specified keys and value

This how we create a

# Get the list of all column names from headers

#notice the column generated in the output

# To Get the list of all column

#Example Get the list of all column names

To reset, and delete

Courses Fee Duration

2 3 PHP 10000 30 days

#Use .scv file to drop a colun

Example Write Excel with Python

To export a Pandas DataFrame as an Excel file (extension: .xlsx, .xls), use

Install the following library

df = pd.DataFrame([[11, 21, 31], [12, 22, 32], [31, 32, 33]],

The argument new_sheet_name is the name of the sheet. If omitted, it will be

Output is the excel file that is created

df = pd.DataFrame([[11, 21, 31], [12, 22, 32], [31, 32,

#If you do not need to write index (row name), columns

#then use the ExcelWriter() function like

with pd.ExcelWriter('pandas_to_excel.xlsx') as writer:

You might also like