0% found this document useful (0 votes)

17 views5 pages

Da Exp 9

Uploaded by

anandkrishna1511

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views5 pages

Da Exp 9

Uploaded by

anandkrishna1511

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

BHARATIYA VIDYA BHAVAN’S

SARDAR PATEL INSTITUTE OF TECHNOLOGY

(Empowered Autonomous Institute Affiliated to University of Mumbai)
[Knowledge is Nectar]

Department of Computer Science Engineering

Course - Data Analytics

UID 2021600022
2021600033

Name Mahek Gupta

Shruti Kedari

Class and Batch BE AIML Batch B

Date 10-11-2024

Lab 9

Aim To perform association rule mining on a dataset (Apriori Algorithm)c

Objective Association rule mining identifies patterns or relationships between items in large datasets.
In market basket analysis, it uncovers frequent item combinations, such as customers
buying bread and butter also purchasing milk. These insights help businesses optimize
product placements, promotions, and inventory management.

Theory
1. Association Rule Mining:

Association rule mining is a data mining technique used to find rules that predict the
occurrence of an item based on the occurrences of other items in a transaction. The rules
are typically represented in the form of "If-Then" statements, where if a particular set of
items (antecedent) is present, then there is a likelihood that another item or set of items
(consequent) will also be present in the same transaction.

Each rule has two main components:

● Support: Measures the frequency of occurrence of an itemset in the dataset. A

higher support indicates that the rule is more frequently applicable.
● Confidence: Measures the likelihood that items in the consequent will also be
present in transactions containing the antecedent.

For example: In a supermarket dataset, a rule like "If a customer buys bread and butter,
then they are 70% likely to buy milk" could be a common association.

2. Apriori Algorithm:

The Apriori algorithm is a widely used algorithm in association rule mining for generating
frequent itemsets. It works by identifying the frequent individual items and extending them
BHARATIYA VIDYA BHAVAN’S
SARDAR PATEL INSTITUTE OF TECHNOLOGY
(Empowered Autonomous Institute Affiliated to University of Mumbai)
[Knowledge is Nectar]

Department of Computer Science Engineering

to larger itemsets as long as they meet a minimum support threshold. This approach
reduces computational complexity by avoiding the generation of non-frequent itemsets.

Key steps in the Apriori algorithm:

1. Identify Frequent 1-itemsets: Find all individual items that meet the minimum
support threshold.
2. Generate Candidates for k-itemsets: From the (k-1)-itemsets that meet the
support threshold, create candidate k-itemsets by combining pairs of (k-1)-itemsets
that share a common prefix.
3. Prune Non-frequent Itemsets: Remove any candidate k-itemsets that do not meet
the minimum support.
4. Generate Association Rules: For each frequent itemset, generate association
rules and calculate their confidence. If the confidence meets the threshold, keep the
rule; otherwise, discard it.

Apriori Principle: This principle states that any subset of a frequent itemset must also be
frequent. The algorithm uses this property to prune the search space, reducing the number
of candidate itemsets.

Implementation / # Import necessary libraries

Code import pandas as pd
from mlxtend.frequent_patterns import apriori, association_rules
import matplotlib.pyplot as plt

# Step 1: Create the dataset

data = {
'Milk': [1, 0, 1, 1, 0, 1, 1, 0, 1, 1],
'Bread': [1, 1, 0, 1, 0, 1, 0, 1, 1, 1],
'Butter': [1, 1, 1, 1, 1, 0, 1, 1, 0, 1],
'Cheese': [0, 1, 0, 1, 0, 1, 0, 1, 0, 1],
'Eggs': [1, 1, 0, 1, 1, 0, 1, 1, 1, 0],
'Apples': [0, 1, 1, 0, 1, 1, 1, 0, 1, 1],
'Bananas': [1, 0, 1, 1, 0, 1, 0, 1, 0, 1]
}

# Convert the dictionary to a DataFrame

df = pd.DataFrame(data)
df.index.name = 'Transaction'
BHARATIYA VIDYA BHAVAN’S
SARDAR PATEL INSTITUTE OF TECHNOLOGY
(Empowered Autonomous Institute Affiliated to University of Mumbai)
[Knowledge is Nectar]

Department of Computer Science Engineering

# Save the DataFrame to a CSV file

df.to_csv('grocery_transactions.csv')
print("Dataset created and saved as 'grocery_transactions.csv'.")

# Step 2: Perform Association Rule Mining

# Load the dataset
basket = pd.read_csv('grocery_transactions.csv', index_col=0)

# Apply the Apriori algorithm to find frequent itemsets with a minimum

support of 0.3 (30%)
frequent_itemsets = apriori(basket, min_support=0.3,
use_colnames=True)

# Generate the association rules with a minimum confidence of 0.7

(70%)
rules = association_rules(frequent_itemsets, metric="confidence",
min_threshold=0.7)

# Display the generated rules

print("Association Rules:")
print(rules[['antecedents', 'consequents', 'support', 'confidence',
'lift']])

# Step 3: Visualization
# Scatter plot for Support vs. Confidence
plt.figure(figsize=(10, 6))
plt.scatter(rules['support'], rules['confidence'], alpha=0.5,
color='purple')
plt.xlabel('Support')
plt.ylabel('Confidence')
plt.title('Support vs Confidence')
plt.show()

# Histogram of Lift values

plt.figure(figsize=(10, 6))
BHARATIYA VIDYA BHAVAN’S
SARDAR PATEL INSTITUTE OF TECHNOLOGY
(Empowered Autonomous Institute Affiliated to University of Mumbai)
[Knowledge is Nectar]

Department of Computer Science Engineering

plt.hist(rules['lift'], bins=10, alpha=0.7, color='blue')
plt.xlabel('Lift')
plt.ylabel('Frequency')
plt.title('Distribution of Lift Values')
plt.show()

Output
BHARATIYA VIDYA BHAVAN’S
SARDAR PATEL INSTITUTE OF TECHNOLOGY
(Empowered Autonomous Institute Affiliated to University of Mumbai)
[Knowledge is Nectar]

Department of Computer Science Engineering

Conclusion In conclusion, association rule mining, particularly through the Apriori algorithm, is a
powerful tool for discovering meaningful relationships and patterns in large datasets. By
identifying frequently occurring itemsets and generating association rules, it provides
valuable insights that can drive strategic business decisions, optimize product offerings,
and enhance customer experiences. This technique is widely applied in areas like retail,
healthcare, and marketing, where understanding item correlations is crucial for success.

References https://www.youtube.com/watch?v=zi_ydmbWfAs

Grade 7 4th Quarter Exam
94% (16)
Grade 7 4th Quarter Exam
6 pages
Programming Assignment
No ratings yet
Programming Assignment
5 pages
Module 5 - Frequent Pattern Mining
No ratings yet
Module 5 - Frequent Pattern Mining
111 pages
(AHU) Clivet AHU and Modular Air Handling Unit
No ratings yet
(AHU) Clivet AHU and Modular Air Handling Unit
27 pages
Association and Recommendation System
No ratings yet
Association and Recommendation System
24 pages
Data Analysis (No Free Launch Theorem)
No ratings yet
Data Analysis (No Free Launch Theorem)
8 pages
Data Mining Mod 2
No ratings yet
Data Mining Mod 2
7 pages
Association Rule Mining Presentation
No ratings yet
Association Rule Mining Presentation
44 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
5 pages
Association Rule Mining
No ratings yet
Association Rule Mining
72 pages
Topic 1, 2, 3
No ratings yet
Topic 1, 2, 3
5 pages
DWDM Mid Ii
No ratings yet
DWDM Mid Ii
13 pages
Association Rules Explained
No ratings yet
Association Rules Explained
10 pages
Unit IV DWDM
No ratings yet
Unit IV DWDM
17 pages
Devdm
No ratings yet
Devdm
7 pages
Chapter - 05 - Association Rules
No ratings yet
Chapter - 05 - Association Rules
38 pages
Data Mining and Predictive Modeling: Lecture 9: Association Rule Mining, Apriori Algorithm
No ratings yet
Data Mining and Predictive Modeling: Lecture 9: Association Rule Mining, Apriori Algorithm
24 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
19 pages
Data Mining - Module 6
No ratings yet
Data Mining - Module 6
7 pages
Unit 3 Data Mining
No ratings yet
Unit 3 Data Mining
15 pages
Apriori Algorithm Examples
No ratings yet
Apriori Algorithm Examples
45 pages
Marketbasket Analysis
No ratings yet
Marketbasket Analysis
28 pages
Unit 2
No ratings yet
Unit 2
14 pages
ML Module3
No ratings yet
ML Module3
83 pages
Inbound 5400902715551305870
No ratings yet
Inbound 5400902715551305870
4 pages
Apriori Algorithm or Market Basket Analysis - Kaggle
No ratings yet
Apriori Algorithm or Market Basket Analysis - Kaggle
30 pages
Ex. 9 Association Rule Learning Using Apriori Algorithm
No ratings yet
Ex. 9 Association Rule Learning Using Apriori Algorithm
3 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
4 pages
Association Rule Mining
No ratings yet
Association Rule Mining
26 pages
BD25
No ratings yet
BD25
19 pages
Apriori Algorithm in Data Mining
No ratings yet
Apriori Algorithm in Data Mining
8 pages
Unit 4
No ratings yet
Unit 4
72 pages
16-Efficient and Scalable Frequent Item Set Mining Methods - Apriori Algorithm-05-02-2025
No ratings yet
16-Efficient and Scalable Frequent Item Set Mining Methods - Apriori Algorithm-05-02-2025
37 pages
Equent Itemsets & Clustering
No ratings yet
Equent Itemsets & Clustering
27 pages
Association Rule Mining
No ratings yet
Association Rule Mining
10 pages
Shweta Singh-Dwdm2024
No ratings yet
Shweta Singh-Dwdm2024
5 pages
6 - Association Rules - For Students
No ratings yet
6 - Association Rules - For Students
39 pages
Association: Market Basket Analysis
No ratings yet
Association: Market Basket Analysis
40 pages
Apriori Is A Classic Algorithm Used in Data Mining and Association Rule Learning
No ratings yet
Apriori Is A Classic Algorithm Used in Data Mining and Association Rule Learning
1 page
Market Basket Analysis
No ratings yet
Market Basket Analysis
27 pages
Association Rules Overview
No ratings yet
Association Rules Overview
23 pages
Unit 4 .3 Association Analysis
No ratings yet
Unit 4 .3 Association Analysis
50 pages
What Is A Frequent Itemset?
No ratings yet
What Is A Frequent Itemset?
7 pages
Ariori Introduction and Concept
No ratings yet
Ariori Introduction and Concept
37 pages
Apriori
No ratings yet
Apriori
34 pages
Apriori Algorithm Example PDF
No ratings yet
Apriori Algorithm Example PDF
7 pages
Pract4 63
No ratings yet
Pract4 63
3 pages
Apriori Algorithm in Word File
No ratings yet
Apriori Algorithm in Word File
16 pages
UNIT 2 Updated
No ratings yet
UNIT 2 Updated
50 pages
Lecture 2.3.1 2.3.2
No ratings yet
Lecture 2.3.1 2.3.2
23 pages
Data Mining Frequent Patterns
No ratings yet
Data Mining Frequent Patterns
22 pages
Association Rules
No ratings yet
Association Rules
24 pages
Unit - III
No ratings yet
Unit - III
27 pages
Contents
No ratings yet
Contents
59 pages
Association Rule Mining (ARM)
No ratings yet
Association Rule Mining (ARM)
24 pages
Pattern Mining
No ratings yet
Pattern Mining
36 pages
Mod 5
No ratings yet
Mod 5
56 pages
04-Association Rule Mining
No ratings yet
04-Association Rule Mining
22 pages
Association Rules
No ratings yet
Association Rules
24 pages
You Should Spend About 20 Minutes On Questions 1
No ratings yet
You Should Spend About 20 Minutes On Questions 1
6 pages
Sloa 294 A
No ratings yet
Sloa 294 A
9 pages
Question Bank
100% (2)
Question Bank
2 pages
Ha12ip 12mbl Manual
No ratings yet
Ha12ip 12mbl Manual
112 pages
Practical Analysis and Design
No ratings yet
Practical Analysis and Design
570 pages
Ali Ahmed CV
No ratings yet
Ali Ahmed CV
2 pages
Case Study Report (Peptic Ulcer) Group 1
No ratings yet
Case Study Report (Peptic Ulcer) Group 1
9 pages
Mendel 3key
No ratings yet
Mendel 3key
20 pages
Gatwick Report - IRFAN
100% (1)
Gatwick Report - IRFAN
3 pages
Caveman's Guide To Radar
No ratings yet
Caveman's Guide To Radar
55 pages
QM - 55 Guía Del Usuario
No ratings yet
QM - 55 Guía Del Usuario
2 pages
Introduction To Ore Mineralogy
No ratings yet
Introduction To Ore Mineralogy
10 pages
Example of Nanomaterials
79% (14)
Example of Nanomaterials
8 pages
With Pneumatic and Electric Actuators: Datasheet 448001 English
No ratings yet
With Pneumatic and Electric Actuators: Datasheet 448001 English
7 pages
Syllabus-CISC614-90-O-2021 - Late Fall
No ratings yet
Syllabus-CISC614-90-O-2021 - Late Fall
6 pages
Important Questions For ICSE 2025 Maths
No ratings yet
Important Questions For ICSE 2025 Maths
9 pages
Sitotoksin Glikosida
No ratings yet
Sitotoksin Glikosida
8 pages
Iñigo Manglano-Ovalle
No ratings yet
Iñigo Manglano-Ovalle
9 pages
Ultimate BK 21 - 24-25
No ratings yet
Ultimate BK 21 - 24-25
16 pages
Safe Vehicle Operation
No ratings yet
Safe Vehicle Operation
38 pages
Econder Microchip
No ratings yet
Econder Microchip
12 pages
2nd Year Syllabus NEP
No ratings yet
2nd Year Syllabus NEP
68 pages
Logger
No ratings yet
Logger
3 pages
Plagiarism
No ratings yet
Plagiarism
24 pages
MODIS Surface Reflectance User's Guide
No ratings yet
MODIS Surface Reflectance User's Guide
35 pages
Animals Classification
No ratings yet
Animals Classification
10 pages
Bamboo Fiber Assessment
No ratings yet
Bamboo Fiber Assessment
7 pages
Motions of Late Antiquity Essays On Religion Politics and Society in Honour of Peter Brown 1st Edition Jamie Kreiner Helmut Reimitz Editors
100% (5)
Motions of Late Antiquity Essays On Religion Politics and Society in Honour of Peter Brown 1st Edition Jamie Kreiner Helmut Reimitz Editors
49 pages

Da Exp 9

Uploaded by

Da Exp 9

Uploaded by

BHARATIYA VIDYA BHAVAN’S

SARDAR PATEL INSTITUTE OF TECHNOLOGY

Department of Computer Science Engineering

Course - Data Analytics

Name Mahek Gupta

Class and Batch BE AIML Batch B

Aim To perform association rule mining on a dataset (Apriori Algorithm)c

Each rule has two main components:

● Support: Measures the frequency of occurrence of an itemset in the dataset. A

Department of Computer Science Engineering

Key steps in the Apriori algorithm:

Implementation / # Import necessary libraries

# Step 1: Create the dataset

# Convert the dictionary to a DataFrame

Department of Computer Science Engineering

# Save the DataFrame to a CSV file

# Step 2: Perform Association Rule Mining

# Apply the Apriori algorithm to find frequent itemsets with a minimum

# Generate the association rules with a minimum confidence of 0.7

# Display the generated rules

# Histogram of Lift values

Department of Computer Science Engineering

Department of Computer Science Engineering

You might also like