Dar Lec 15 Association Rules

Association rules are a data mining technique used to identify relationships within large datasets, particularly in market basket analysis. Key components include antecedents and consequents, with measurements such as support, confidence, and lift to assess the strength of these associations. The document also outlines steps to create association rules using R, including the use of the apriori function to generate and filter rules based on specified criteria.

Uploaded by

sharmahemant3610

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views16 pages

Dar Lec 15 Association Rules

Uploaded by

sharmahemant3610

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Association rules

Complied and presented by

Dr. Chetna Arora
• Association rules are a data mining technique
used to uncover interesting relationships,
patterns, or associations within large datasets.
They are widely used in market basket analysis
to identify products frequently purchased
together.
• Components of an Association Rule
• Antecedent (If):
– These are the items or conditions in the "if" part of the rule.
– Example: In a supermarket, bread is the antecedent in the rule: "If a
customer buys bread, they are likely to buy butter."
• Consequent (Then):
– These are the items or outcomes in the "then" part of the rule.
– Example: Butter is the consequent in the rule: "If a customer buys
bread, they are likely to buy butter."
• Rule Example:
• If {bread} then {butter}
• This means customers who buy bread are also likely to buy butter.
Key Measurements for Association Rules

• Support:
• Definition: Measures how frequently an itemset
appears in the dataset.
• Formula:
Support=Number of transactions containing both ante
cedent and consequent/Total number of transactions
• Example:
If bread and butter appear together in 40 out of 1,000
transactions:
• {Support} = {40}/{1000} = 0.04 (4% of transactions)}
• Confidence:
• Definition: Measures how often the rule is true when the
antecedent occurs.
• Formula:
Confidence=Number of transactions containing both anteceden
t and consequent/
Number of transactions containing antecedent
• Example:
If 50 transactions contain bread, and 40 of these also contain
butter:
• {Confidence} = {40}{50} = 0.8 {(80% confidence)} This means
80% of the time, customers who buy bread also buy butter.
• Lift:
• Definition: Measures how much more likely the antecedent
and consequent occur together compared to if they were
independent.
• Formula: Lift=Confidence/Support of consequent
• Example:
If butter appears in 100 out of 1,000 transactions:
Support of butter=100/1000=0.1
• Using the previous confidence (0.8): Lift=0.80/0.1=8
• A lift of 8 means that customers buying bread are 8 times
more likely to buy butter compared to random chance.
• Lift > 1: This indicates that the antecedent and consequent
are positively associated—the occurrence of the
antecedent makes the consequent more likely to occur
than by random chance.
• Lift = 1: This indicates no association—the antecedent and
consequent occur together as frequently as they would if
they were independent.
• Lift < 1: This indicates a negative association—the
occurrence of the antecedent makes the consequent less
likely to occur than by random chance.
• Example:
Transaction ID Items Bought
1 Bread, Butter, Milk
2 Bread, Butter
3 Bread, Milk
4 Butter, Milk
5 Bread, Butter, EGGS

Rule:
If {Bread} → {Butter}
Support: Bread and Butter appear together in 3 out of 5
transactions. Support=3/5=0.6 (60%)
Confidence: Bread appears in 4 transactions, and 3 of those
include Butter. Confidence=3/4=0.75 (75%)
Lift: Butter appears in 4 out of 5 transactions.
Lift=0.75/0.8*=0.9375
A lift less than 1 suggests the items are less likely to be associated
than by chance.
*Support of consequent is how frequently Butter appears in the
dataset, regardless of whether Bread is purchased or not. In this case,
if Butter appears in 4 transactions out of 5,
the Support of Butter is:
Support of Butter=4/5=0.8
Metric Meaning Interpretation

How often the itemset (both antecedent and

Support Higher support indicates a frequent pattern.
consequent) occurs in the dataset.

The likelihood of the consequent occurring Higher confidence means the rule is more
Confidence
when the antecedent is present. reliable.

Lift > 1 indicates a positive association, Lift =

How much the presence of the antecedent
Lift 1 means independence, Lift < 1 indicates a
increases the likelihood of the consequent.
negative association.
• Steps to Create Association Rules in R
• Install and load the arules package.
• Load a transactional dataset.
• Use the apriori() function to generate rules.
• Inspect and interpret the rules.
• Example in R
• Here’s a step-by-step guide with a simple
example:
• Step 1: Install and Load Required Package
• install.packages("arules")
• # Install only if not already installed
• library(arules)
• Step 2: Load Dataset
• We’ll use the built-in Groceries dataset from
the arules package.
• data("Groceries")
• summary(Groceries)
• Step 3: Generate Association Rules
• rules <- apriori(Groceries, parameter = list(support = 0.01, confidence = 0.5))
• # Adjust values as needed
• *apriori()
• apriori() is a function from the arules package in R used to apply the Apriori
algorithm to a dataset. The Apriori algorithm finds frequent itemsets and generates
association rules based on the given support and confidence thresholds.
• In simple terms, it finds patterns like "if item X is bought, item Y is likely to be bought
too.“
• Support measures how frequently an item or itemset appears in the dataset. For
example, if an itemset has a support of 0.01, it means that itemset appears in at
least 1% of the total transactions.
• his specifies the minimum confidence threshold.
• Confidence measures how often the consequent (the item in the "then" part of the
rule) appears in transactions where the antecedent (the item in the "if" part)
appears.
• If you set confidence = 0.5, you're looking for rules where, when the antecedent is
bought, the consequent is bought at least 50% of the time.
• For example, a rule like {Bread} → {Butter} would be considered valid if, whenever
someone buys bread, they also buy butter at least 50% of the time.
• Step 4: View and Inspect Rules
• # View the top 5 rules
• inspect(head(rules, 5)
• 5rows (if,then)with support, confidence, lift
• Step 5: Filter Rules (Optional)
• Filter rules to focus on specific criteria, like high lift:
• filtered_rules <- subset(rules, lift > 1.5)
• inspect(head(filtered_rules, 5))
• his code will filter out the rules in the rules object where the lift
is greater than 1.5.
• Why 1.5?: Setting a threshold for lift ensures that you are
selecting rules that show a strong association. If you set the
threshold too low, you may end up with too many weak
associations, whereas a higher threshold, like 1.5, ensures that
the rules you get have a stronger and more significant
relationship.
• Real-World Example
• If a rule says: {milk} => {bread} [support=0.02,
confidence=0.8, lift=3]
• 2% of transactions include milk and bread
together.
• 80% of the time, bread is bought when milk is
bought.
• Customers are 3 times more likely to buy bread
if they buy milk.

Notice: Privacy Act: Occupancy Requirements of Subsidized Multifamily Housing Programs HUD Handbook 4350.3 REV-1 Revised Web Site Availability
No ratings yet
Notice: Privacy Act: Occupancy Requirements of Subsidized Multifamily Housing Programs HUD Handbook 4350.3 REV-1 Revised Web Site Availability
2 pages
Dar Lec10
No ratings yet
Dar Lec10
22 pages
Dar Case Study
No ratings yet
Dar Case Study
12 pages
Class1 Cs
No ratings yet
Class1 Cs
3 pages
Skill: Memory and Concentration::Worksheet Number:24: A) Block 5 B) Red C) Pink D) Block 8
No ratings yet
Skill: Memory and Concentration::Worksheet Number:24: A) Block 5 B) Red C) Pink D) Block 8
4 pages
The Entrepreneurial Journey From Vision To Reality
No ratings yet
The Entrepreneurial Journey From Vision To Reality
8 pages
Association in DM
No ratings yet
Association in DM
6 pages
Data Mining Unit-V
No ratings yet
Data Mining Unit-V
14 pages
Adobe Scan 24 Jan 2025
No ratings yet
Adobe Scan 24 Jan 2025
3 pages
Topic 03 - Mining Association Rules
No ratings yet
Topic 03 - Mining Association Rules
12 pages
DWDM 3
No ratings yet
DWDM 3
31 pages
Data Analysis (No Free Launch Theorem)
No ratings yet
Data Analysis (No Free Launch Theorem)
8 pages
SM100 Operation Manual Book
100% (2)
SM100 Operation Manual Book
136 pages
BIA Unit 4
No ratings yet
BIA Unit 4
11 pages
ELPE1400 Sample Exam Paper Soln
No ratings yet
ELPE1400 Sample Exam Paper Soln
9 pages
How
No ratings yet
How
4 pages
COS10022 DSP Week06 Association Rules
No ratings yet
COS10022 DSP Week06 Association Rules
52 pages
1association Analysis-Apriori
No ratings yet
1association Analysis-Apriori
67 pages
Association Rules
No ratings yet
Association Rules
29 pages
Lecture 11 Assiciation Rules II M
No ratings yet
Lecture 11 Assiciation Rules II M
27 pages
Unit 2
No ratings yet
Unit 2
14 pages
04-Association Rule Mining
No ratings yet
04-Association Rule Mining
22 pages
Data Mining Frequent Patterns
No ratings yet
Data Mining Frequent Patterns
22 pages
Lec 2
No ratings yet
Lec 2
18 pages
Association Rules
No ratings yet
Association Rules
24 pages
Lec 4
No ratings yet
Lec 4
22 pages
Chapter 14 - Association Rules
No ratings yet
Chapter 14 - Association Rules
10 pages
6 - Association Rules - For Students
No ratings yet
6 - Association Rules - For Students
39 pages
Data Mining Chapter 2: Market Basket Analysis
No ratings yet
Data Mining Chapter 2: Market Basket Analysis
4 pages
Apriori Algorithm or Market Basket Analysis - Kaggle
No ratings yet
Apriori Algorithm or Market Basket Analysis - Kaggle
30 pages
Untitled Document
No ratings yet
Untitled Document
59 pages
Unit4 1 Association Rules Apriori
No ratings yet
Unit4 1 Association Rules Apriori
23 pages
DM Unit 3
No ratings yet
DM Unit 3
22 pages
Joshua Michael Yelon - STATIC NETWORKS OF OBJECTS AS A TOOL FOR PARALLEL PROGRAMMING
No ratings yet
Joshua Michael Yelon - STATIC NETWORKS OF OBJECTS AS A TOOL FOR PARALLEL PROGRAMMING
182 pages
Final Project
No ratings yet
Final Project
12 pages
DM Unit Ii
No ratings yet
DM Unit Ii
30 pages
Association Rule Mining
No ratings yet
Association Rule Mining
26 pages
PR Actical - Assignments: Class XII-Informatics Practices
No ratings yet
PR Actical - Assignments: Class XII-Informatics Practices
26 pages
UNIT 2 Updated
No ratings yet
UNIT 2 Updated
50 pages
Lecture - 11 - Sathya - Zainab
No ratings yet
Lecture - 11 - Sathya - Zainab
17 pages
Journal Europian Research Society
No ratings yet
Journal Europian Research Society
12 pages
Data Mining
No ratings yet
Data Mining
4 pages
Association Rule Mod 3
No ratings yet
Association Rule Mod 3
28 pages
Indigo Airlines Terms & Conditions
No ratings yet
Indigo Airlines Terms & Conditions
2 pages
Association Rule Mining
No ratings yet
Association Rule Mining
24 pages
Importance of Association Rule Mining and Its Real-Time Applications
No ratings yet
Importance of Association Rule Mining and Its Real-Time Applications
28 pages
Lecture 8
No ratings yet
Lecture 8
13 pages
SWBruker Lumos
No ratings yet
SWBruker Lumos
35 pages
Teleperformance To Host National Hiring Day Across Their USA Locations
No ratings yet
Teleperformance To Host National Hiring Day Across Their USA Locations
3 pages
Lab - Association Rule
No ratings yet
Lab - Association Rule
6 pages
Seminar 6
No ratings yet
Seminar 6
30 pages
POC ZOHO App Creater V 1.0.0
No ratings yet
POC ZOHO App Creater V 1.0.0
2 pages
Delhi-NCR Companies
50% (2)
Delhi-NCR Companies
2 pages
Information Technology: Osmania University Faculty of Business Management Computer Lab - Practical Question Bank
No ratings yet
Information Technology: Osmania University Faculty of Business Management Computer Lab - Practical Question Bank
6 pages
P 3
No ratings yet
P 3
4 pages
Aml Unit 3
No ratings yet
Aml Unit 3
17 pages
Handbook of Numerical Analysis: Volume I, Finite Difference Method I, Solutions of Equations in R N (RN) I
100% (1)
Handbook of Numerical Analysis: Volume I, Finite Difference Method I, Solutions of Equations in R N (RN) I
651 pages
Intro To Assembly Language
100% (1)
Intro To Assembly Language
11 pages
Association: Market Basket Analysis
No ratings yet
Association: Market Basket Analysis
40 pages
Lec.5.Intro.D.S. Fall 2023
No ratings yet
Lec.5.Intro.D.S. Fall 2023
18 pages
Bryce 5 User Manual: Multimedia Module
No ratings yet
Bryce 5 User Manual: Multimedia Module
11 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
14 pages
InPowert Lite G-Drive Users Guide
No ratings yet
InPowert Lite G-Drive Users Guide
174 pages
Marketbasket Analysis
No ratings yet
Marketbasket Analysis
28 pages
FFIEC CAT App B Map To NIST CSF June 2015 PDF4 PDF
No ratings yet
FFIEC CAT App B Map To NIST CSF June 2015 PDF4 PDF
24 pages
Unit 3 Final
No ratings yet
Unit 3 Final
13 pages
Business Continuity Planning
No ratings yet
Business Continuity Planning
33 pages
Compiler Construction Using Flex and Bison - Aaby - Anthony A
100% (1)
Compiler Construction Using Flex and Bison - Aaby - Anthony A
102 pages
Robust Pole Placement Using Linear Quadratic Regulator Weight Selection Algorithm
No ratings yet
Robust Pole Placement Using Linear Quadratic Regulator Weight Selection Algorithm
5 pages
Association Rule Mining Presentation
No ratings yet
Association Rule Mining Presentation
44 pages
Clickstream Analytics
No ratings yet
Clickstream Analytics
22 pages
CNC Usb Controller
No ratings yet
CNC Usb Controller
210 pages
Assignment 1
No ratings yet
Assignment 1
10 pages
UNIT 3: Association Rules and Regression: I) Apriori Algorithm
No ratings yet
UNIT 3: Association Rules and Regression: I) Apriori Algorithm
18 pages
Association Rule Mining
No ratings yet
Association Rule Mining
17 pages
Springer Manuscript Style Guide
No ratings yet
Springer Manuscript Style Guide
13 pages
AR Measures: Tea & Coffee Association (Tea T - Coffee C) Association (Coffee C - Tea T)
No ratings yet
AR Measures: Tea & Coffee Association (Tea T - Coffee C) Association (Coffee C - Tea T)
1 page
Flowchart
No ratings yet
Flowchart
16 pages
Data Analytics Unit III
No ratings yet
Data Analytics Unit III
88 pages
Lecture06 Association Mining
No ratings yet
Lecture06 Association Mining
54 pages
Association Rule
No ratings yet
Association Rule
22 pages
Chapter 14 Association Rules
No ratings yet
Chapter 14 Association Rules
23 pages
Chapter 13 - Association Rules: Data Mining For Business Intelligence
No ratings yet
Chapter 13 - Association Rules: Data Mining For Business Intelligence
22 pages
Market Basket Analysis
No ratings yet
Market Basket Analysis
7 pages
Association Rules
No ratings yet
Association Rules
20 pages
Ado Connection With Vb6.0
No ratings yet
Ado Connection With Vb6.0
13 pages
Data Analytics Project
No ratings yet
Data Analytics Project
5 pages
Data Mining Unit 4 (1) PDF PDF
No ratings yet
Data Mining Unit 4 (1) PDF PDF
11 pages
Association Rule Mining:: "If A Customer Buys Bread, He's 70% Likely of Buying Milk."
No ratings yet
Association Rule Mining:: "If A Customer Buys Bread, He's 70% Likely of Buying Milk."
12 pages
Data Analysis Using Apriori Algorithm & Neural Netwok: Ashutosh Padhi
No ratings yet
Data Analysis Using Apriori Algorithm & Neural Netwok: Ashutosh Padhi
27 pages
Lab8 Apriori
No ratings yet
Lab8 Apriori
9 pages
Association Rule - Data Mining
100% (1)
Association Rule - Data Mining
131 pages

Dar Lec 15 Association Rules

Uploaded by

Dar Lec 15 Association Rules

Uploaded by

Association rules

Complied and presented by

How often the itemset (both antecedent and

Lift > 1 indicates a positive association, Lift =

You might also like