Data Mining
Data Mining
Fall 2024
Data Mining
• Data Mining is a set of method that applies to large and complex databases.
This is to eliminate the randomness and discover the hidden pattern. As these
data mining methods are almost always computationally intensive. We use
data mining tools, methodologies, and theories for revealing patterns in
data.
• The process of data mining is a complex process that involves intensive data
warehousing as well as powerful computational technologies.
• Furthermore, data mining is not only limited to the extraction of data but is
also
• used for transformation, cleaning, data integration, and pattern analysis.
Another terminology for Data Mining is Knowledge Discovery.
Key features of Data Mining
• Prediction of Patterns based on trends in the data.
• Calculating the predictions for the outcomes.
• Creating information in response to the analysis
• Focusing on greater databases.
• Clustering the visual data
Data Mining Steps
• Step 1: Data Cleaning – In this step, data is cleaned such that there is no noise or irregularity present
within
• the data.
• Step 2: Data Integration – In the process of Data Integration, we combine multiple data sources into one.
• Step 3: Data Selection – In this step, we extract our data from the database.
• Step 4: Data Transformation – In this step, we transform the data to perform summary analysis as well as
• aggregatory operations.
• Step 5: Data Mining – In this step, we extract useful data from the pool of existing data.
• Step 6: Pattern Evaluation – We analyze several patterns that are present in the data.
• Step 7: Knowledge Representation – In the final step, we represent the knowledge to the user in the form
• of trees, tables, graphs, and matrices.
Data Mining Applications
• Market and Stock Analysis
• Fraud Detection
• Risk Management and Corporate Analysis
• Analyzing the Customer Lifetime Value
Data Mining Tools
• RapidMiner
• Weka
• Knime
• Oracle DataMining
• TeraData
• Orange
Advantages of Data Mining
• Marketing / Retail
• Finance / Banking
• Government Agencies
• Law Enforcement
• Researchers
• Increases Website Optimization
• Beneficial for Marketing Campaigns
• Increases Brand Loyalty
• To Predict Future Trends
• Quick Fraud Detection
Issues with Datamining
• A skilled person for Data Mining
• Privacy Issues
• Security Issues
• Additional irrelevant information Gathered
• Misuse of information