0% found this document useful (0 votes)
14 views2 pages

DataAnalysisHandout7 DataFiltering

This document provides examples of how to filter Pandas dataframes by text, values, dates, and columns. Examples are given to filter by equality, inequality, ranges, dates, and parts of dates. Multiple conditions can be combined using logical operators like & for AND.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views2 pages

DataAnalysisHandout7 DataFiltering

This document provides examples of how to filter Pandas dataframes by text, values, dates, and columns. Examples are given to filter by equality, inequality, ranges, dates, and parts of dates. Multiple conditions can be combined using logical operators like & for AND.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Pandas Data frame –Data filtering

Example-Filter by a text (==)


import pandas as pd
import numpy as np
customers=pd.read_excel('Customer.xlsx')
print(customers)
customers1=customers.loc[customers['Area']=='Colombo']
print(customers1)

Example-Filter by a text (!=)


import pandas as pd
import numpy as np
customers=pd.read_excel('Customer.xlsx')
print(customers)
customers1=customers.loc[customers['Area']!='Colombo']
print(customers1)

Example-Filter with isin


import pandas as pd
import numpy as np
customers=pd.read_excel('Customer.xlsx')
print(customers)
customers1=customers.loc[customers['Area'].isin(['Colombo','Gampaha'])]
print(customers1)

Example-Filter by a value(>)
import pandas as pd
import numpy as np
customers=pd.read_excel('Customer.xlsx')
print(customers)
customers1=customers.loc[customers['DueAmount']>50000]
print(customers1)

Example-Filter with isin


import pandas as pd
import numpy as np
customers=pd.read_excel('Customer.xlsx')
print(customers)
customers1=customers.loc[customers['Area'].isin(['Colombo','Gampaha'])]
print(customers1)

Example-Filter by two conditions(&)


import pandas as pd
import numpy as np
customers=pd.read_excel('Customer.xlsx')
print(customers)
customers1=customers.loc[(customers['Area'].isin(['Colombo','Gampaha']))&
(customers['DueAmount']>50000)]
print(customers1)

Example -Filter by a date


import pandas as pd
import numpy as np
customers=pd.read_excel('Customer.xlsx')
print(customers)
customers1=customers.loc[customers['LastTransaction']>'05-19-2020']
print(customers1)

Example-Filter with a date range


import pandas as pd
import numpy as np
customers=pd.read_excel('Customer.xlsx')
print(customers)
customers1=customers.loc[(customers['LastTransaction']>'05-19-2020') &
(customers['LastTransaction']<'10-19-2020')]
print(customers1)

Example-Filter by part of a date


import pandas as pd
import numpy as np
customers=pd.read_excel('Customer.xlsx')
print(customers)
customers1=customers.loc[customers['LastTransaction'].dt.month==3]
print(customers1)

Example-Filter by part of a date


import pandas as pd
import numpy as np
customers=pd.read_excel('Customer.xlsx')
print(customers)
customers1=customers.loc[customers['LastTransaction'].dt.year==2019]
print(customers1)

Example-Filter by first letter of a text column


import pandas as pd
import numpy as np
customers=pd.read_excel('Customer.xlsx')
print(customers)
customers1=customers.loc[customers['Name'].str[0]=='K']
print(customers1)

Example-Filter columns
import pandas as pd
import numpy as np
customers=pd.read_excel('Customer.xlsx')
print(customers)
customers1=customers.filter(items=['Name','DueAmount'])
print(customers1)

You might also like