Data Mining
Data Mining
DATA MINING
1
What is Data Mining
3
Why do we need Data Mining?
5
Major Characteristics of Data Mining
Data are often buried deep within very large databases, which
sometimes contain data from several years.
7
Data Mining Tasks\Algorithms(fall Into Four
Broad Categories):
Classification
Clustering
Association Rule Discovery
Sequential Pattern Discovery
8
Data Mining Tasks\Algorithms
condition
Data Mining algorithms(Fall into four broad
categories):
Classification is sorting cases into groups so that members of the same group
are strongly associated in some meaningful way.
Cluster analysis identifying the common characteristics shared by members
of groups in transactions, and interpret that into a case.
11
Data Mining algorithms(Fall into four broad
categories):
3. Association
– Establishes relationship about items that
occur together in a given record Placing batteries in the
– Determining associations among items toys
that sell together If a customer buys bread,
– Often called market basket analysis as they are also likely to
buy milk
the primary applications is the analysis of
sales transactions
– Application example : Market basket
analysis 12
Data Mining algorithms(Fall into four broad
categories):
Unemployed
consumer who
4. Sequence discovery purchased pre paid
– The identification of association over telco service are
time most likely to
convert to postpaid
– Some sequence discovery techniques upon being employed
keep track of elapsed time between
associated events and the frequency of Purchase of
occurrences machinery will later
be followed by the
– Application example : Market basket purchase of
analysis over time, customer life maintenance service
13
cycle analysis
14
Types of data mining (Two types)
Business Use
Banking
Forecasting levels of bad loans, fraud in credit card usage,
Where data mining is beneficial (the intent in most of these examples is to
credit card spending pattern, new loans
identify a business opportunity and create a sustainable competitive advantage).
Retailing andblanks.Predicting sales, determining correct inventory levels and
Fill in the
sales distribution schedules
Manufacturing Predicting when to expect machinery failures
and production
Marketing Predicting which customers will respond to Internet
banners or buy a particular products
16
Use in Business
Business Use
17
Understanding Customer Behavior
– Cluster/group the terms e.g. the term spillage and associate with
other key terms such as coffee, tea, soup, drink
– Can identify incidents that might lead to trouble and help 21
25
Types of Web Mining
26