OLAM and Data Mining: Concepts and Techniques
OLAM and Data Mining: Concepts and Techniques
Increasing potential
to support
business decisions End User
Making
Decisions
Data Sources
Paper, Files, Database systems, OLTP, WWW
Mining query Mining result
OLAM OLAP
Engine Engine
Data Cube API
MDDB
Meta
Data
Filtering&Integration Database API Filtering
An OLAM Architecture
Data Mining: Confluence of Multiple
Disciplines
• Database systems, data warehouse and OLAP
• Statistics
• Machine learning
• Visualization
• Information science
• High performance computing
• Other disciplines:
– Neural networks, mathematical modeling, information
retrieval, pattern recognition, etc.
Data Mining: On What Kind of Data?
• Relational databases
• Data warehouses
• Transactional databases
• Advanced DB systems and information
repositories
– Object-oriented and object-relational databases
– Spatial databases
– Time-series data and temporal data
– Text databases and multimedia databases
– Heterogeneous and legacy databases
– WWW
Data Mining Functionality
Data mining methods may be classified onto 6
basic classes:
• Associations
– Finding rules like “if the customer buys mustard,
sausage, and beer, then the probability that he/she buys
chips is 50%”
• Classifications
– Classify data based on the values of the decision
attribute, e.g. classify patients based on their “state”
• Clustering
– Group data to form new classes, cluster customers
based on their behavior to find common patterns
Data Mining Functionality
• Sequential patterns
– Finding rules like “if the customer buys TV, then, few
days later, he/she buys camera, then the probability that
he/she will buy within 1 month video is 50%”
• Time-Series similarities
– Finding similar sequences (or subsequences) in time-
series (e.g. stock analysis)
• Outlier detection
– Finding anomalies/exceptions/deviations in data