Ch-4 Data Mining Knowledge Representation Premitives
Ch-4 Data Mining Knowledge Representation Premitives
PRIMITIVES
• Data Mining:
Task-Relevant Data
Interestingness Measures
[2.2%,60%]
BACKGROUND KNOWLEDGE : CONCEPT
HIERARCHIES
Schema hierarchies
Set grouping hierarchies
Operation-derived hierarchies
certainty ,
utility and
novelty.
confidence(A=>B)=
#_tuples_containing_both_A_and_B
#_tuples_containing_A
• UTILITY:
It can be estimated by a utility function such as support. The
support of an associated pattern refers to the percentage of task-relevant
data tuples for which the pattern is true .for associated rules of the form
“A=>B” where A and B are set of items,
support(A=>B) = #_tuples_containing_both_A_and_B
total_#_of_tuples
• NOVELTY:
It contribute new information or increased performed to the given
pattern set. Novelty is removed redundant patterns. For eg: a data
exception may be considered novel in it differs from that based on
statistical model or user beliefs.
location(X,”CANADA”) => buys( X,”SONY_TV”) [8%, 70%]
PRESENTATION AND VISUALIZATION OF
DISCOVERED PATTERNS