Definition of Data Mining
Definition of Data Mining
Table of Contents
INTRODUCTION...........................................................................................................................................2
Definition of Data mining..........................................................................................................................3
Here we can see some role of data mining as example [2]:.................................................................3
Tasks in Data Mining:...............................................................................................................................4
Definition Of Data Warehouse..................................................................................................................5
Key characteristics of a data warehouse..............................................................................................5
OLTP vs. OLAP: Powerhouses for Different Data Processing Needs [5]:.............................................5
Purpose:.................................................................................................................................................5
Data Model:.............................................................................................................................................6
Performance:...........................................................................................................................................6
Users:.......................................................................................................................................................6
OLTP vs. OLAP: A Tale of Two Data Processing Systems...............................................................................7
CONCLUSSION.........................................................................................................................................7
References...................................................................................................................................................8
1. INTRODUCTION
In this topic I’m going to discuss the definition of data mining the role of data mining and the
tasks of data mining. The second point is discussed in this topic is data warehouse the character
of data warehouse and the difference and similarity of LTP (On-Line Transaction Processing) and
OLAP (On-Line Analytical Processing
Definition of Data mining
Data mining is the process of extracting meaningful patterns and trends from large datasets.
Imagine sifting through a mountain of unrefined ore to discover valuable minerals. Similarly,
data mining helps unearth valuable insights hidden within vast quantities of data [1].
From this definition data mining means gathering searching or digging of meaningful data.
o Data Preparation: This crucial step involves cleaning and pre-processing the data to
ensure its accuracy and consistency. Think of organizing your messy mining site before
you start digging!
o Data Integration: Data from various sources might need to be combined and reconciled
for a holistic view. Imagine combining data from different mine shafts to get a complete
picture.
o Data Selection: Focusing on relevant subsets of data can be more efficient for specific
tasks. Just like targeting specific areas within the mine for the minerals you seek.
o Data Transformation: Data might need to be transformed into a format suitable for
analysis techniques. This could involve scaling numerical values or converting data types.
Similar to crushing and grinding the ore to make it easier to process.
o Modeling and Pattern Discovery: Here's where the magic happens! Various algorithms
are applied to identify patterns and relationships within the data. Imagine using
specialized tools to separate the valuable minerals from the rest of the material.
o Evaluation and Interpretation: The discovered patterns need to be validated and assessed
for their business significance. Just like evaluating the quality and quantity of the
extracted minerals before you celebrate!
Simply from this point of definition I simply concluded that Data Warehouse is a centralized
hub for data analysis [3].
Subject-oriented: Data is organized around business subjects (e.g., sales, customers, products)
rather than by application source.
Integrated: Data from multiple sources is transformed and cleansed to ensure consistency.
Read-optimized: Designed for querying and analysis, not for real-time transactions.
Data warehouses are a cornerstone of Business Intelligence (BI) systems, providing a foundation
for data exploration, reporting, and data mining activities.
OLTP vs. OLAP: Powerhouses for Different Data Processing Needs [5]:
LTP (On-Line Transaction Processing) and OLAP (On-Line Analytical Processing) are two
fundamental data processing systems, but they serve distinct purposes.
Purpose:
OLTP: Focuses on handling a high volume of short, concurrent transactions in real-time.
Imagine processing an online bank transfer or updating inventory levels after a sale.
OLAP: Concentrates on analyzing large datasets to identify trends and patterns. Think
about analyzing sales data over years to understand customer buying habits.
Data Model:
OLTP: Employs normalized database structures to minimize data redundancy and ensure
data integrity for transactions. This avoids inconsistencies when multiple users modify
the same data.
OLAP: Often utilizes denormalized or multidimensional data models. These models may
contain some redundancy but allow faster retrieval and analysis of complex relationships
within the data.
Performance:
OLTP: Prioritizes fast response times for individual transactions. This ensures users
experience minimal delays when performing tasks like placing orders or checking
account balances.
OLAP: Focuses on efficient retrieval of large datasets for analysis. Query times might
be longer compared to OLTP, but the goal is to provide comprehensive insights.
Users:
OLTP: Supports operational tasks. Users include tellers, customer service
representatives, and anyone involved in day-to-day transactions.
OLAP: Empowers data analysis. Users include business analysts, data scientists, and
managers who seek insights from historical data to make informed decisions.
Example:
Online Banking:
OLTP: When you transfer funds online, the system performs an OLTP transaction,
debiting your account and crediting the recipient's in real-time.
OLAP: Later, a data analyst might use OLAP to analyze historical transaction data to
identify trends in customer spending habits or detect potential fraudulent activity.
In essence, OLTP and OLAP work together. OLTP systems provide the raw data for everyday
operations, which is then fed into data warehouses for OLAP analysis. Understanding these
differences is crucial for designing data processing solutions tailored to an organization's specific
needs.
OLTP vs. OLAP: A Tale of Two Data Processing Systems
CONCLUSSION
Finaly, I try to conclude what is data mining and data warehouse and about OLTP AND OLAP.
Data mining with simple understanding means gathering or extracting or searching or digging of
valuable data something like that. While data warehouse means repository part of stored data
from a certain source in organization the last concept but not the least is about OLTP and OLAP:
LTP (On-Line Transaction Processing) and OLAP (On-Line Analytical Processing) are two
fundamental data processing systems.
References
[1] M. S. V. K. Pang-Ning Tan, Pang-Ning Tan, Michael Steinbach, Vipin Kumar. Introduction to Data
Mining (Second Edition).