DATA Mining
DATA Mining
• Relational databases
• Data warehouses
• Advanced DB and information repositories
• Object-oriented and object-relational databases
• Transactional and Spatial databases
• Heterogeneous and legacy databases
• Multimedia and streaming database
• Text databases
• Text mining and Web mining
Data Mining Implementation Process
Data Mining implementation
process in details...
Business understanding
Sequential Association
Prediction
Patterns Rules
1.Classification: This analysis is used to retrieve important and relevant information about data, and metadata. This
data mining method helps to classify data in different classes.
2.Clustering: Clustering analysis is a data mining technique to identify data that are like each other. This process
helps to understand the differences and similarities between the data.
3.Regression: Regression analysis is the data mining method of identifying and analyzing the relationship between
variables. It is used to identify the likelihood of a specific variable, given the presence of other variables.
4.Association Rules: This data mining technique helps to find the association between two or more Items. It
discovers a hidden pattern in the data set.
5.Outer detection: This type of data mining technique refers to observation of data items in the dataset which do
not match an expected pattern or expected behavior. This technique can be used in a variety of domains, such as
intrusion, detection, fraud or fault detection, etc. Outer detection is also called Outlier Analysis or Outlier mining.
6.Sequential Patterns: This data mining technique helps to discover or identify similar patterns or trends in
transaction data for certain period.
7.Prediction: Prediction has used a combination of the other techniques of data mining like trends, sequential
patterns, clustering, classification, etc. It analyzes past events or instances in a right sequence for predicting a future
event.
Challenges of Implementation of Data mine
R-language: is an open source tool for statistical computing and graphics. R has a wide variety of statistical,
classical statistical tests, time-series analysis, classification and graphical techniques. It offers effective data
handing and storage facility.
Oracle Data Mining: popularly knowns as ODM is a module of the Oracle Advanced Analytics Database. This
Data mining tool allows data analysts to generate detailed insights and makes predictions. It helps predict customer
behavior, develops customer profiles, identifies cross-selling opportunities.
Benefits of Data Mining:
• There are chances of companies may sell useful information of their customers to other
companies for money. For example, American Express has sold credit card purchases of their
customers to the other companies.
• Many data mining analytics software is difficult to operate and requires advance training to
work on.
• Different data mining tools work in different manners due to different algorithms employed in
their design. Therefore, the selection of correct data mining tool is a very difficult task.
• The data mining techniques are not accurate, and so it can cause serious consequences in
certain conditions.
Data Mining Applications
Applications Usage
Communications Data mining techniques are used in communication sector to predict customer behavior to offer highly targeted and relevant campaigns.
Insurance Data mining helps insurance companies to price their products profitable and promote new offers to their new or existing customers.
Education Data mining benefits educators to access student data, predict achievement levels and find students or groups of students which need extra
attention. For example, students who are weak in maths subject.
Manufacturing With the help of Data Mining Manufacturers can predict wear and tear of production assets. They can anticipate maintenance which helps them
reduce them to minimize downtime
Banking Data mining helps finance sector to get a view of market risks and manage regulatory compliance. It helps banks to identify probable defaulters to
decide whether to issue credit cards, loans, etc.
Retail Data Mining techniques help retail malls and grocery stores identify and arrange most sellable items in the most attentive positions. It helps store
owners to comes up with the offer which encourages customers to increase their spending.
Service providers like mobile phone and utility industries use Data Mining to predict the reasons when a customer leaves their company. They
Service Providers analyze billing details, customer service interactions, complaints made to the company to assign each customer a probability score and offers
incentives.
E-commerce websites use Data Mining to offer cross-sells and up-sells through their websites. One of the most famous names is Amazon, who use
E-Commerce
Data mining techniques to get more customers into their eCommerce store.
Data Mining allows supermarket’s develope rules to predict if their shoppers were likely to be expecting. By evaluating their buying pattern, they
Super Markets could find woman customers who are most likely pregnant. They can start targeting products like baby powder, baby shop, diapers and so on.
Crime Data Mining helps crime investigation agencies to deploy police workforce (where is a crime most likely to happen and when?), who to search at
Investigation a border crossing etc.
Bioinformatics Data Mining helps to mine biological data from massive datasets gathered in biology and medicine.