DWDM
DWDM
Subject Title Data Warehousing and Data Mining Techniques in Business and Commerce
Credit Value 3
Level 4
Objectives
This subject aims at equipping students with the latest knowledge and skills to:
• Create a clean, consistent repository of data within a data warehouse for large
corporations;
• Utilize various techniques developed for data mining to discover interesting
patterns in large databases;
• Use existing commercial or public-domain tools to perform data mining tasks
to solve real problems in business and commerce;
• Expose students to new techniques and ideas that can be used to improve the
effectiveness of current data mining tools.
Intended Learning Upon completion of the subject, students will be able to:
Outcomes
Professional/academic knowledge and skills
(a) understand why there is a need for data warehouse in addition to traditional
operational database systems;
(c) design a data warehouse and understand the process required to construct one;
(d) understand why there is a need for data mining and in what ways it is different
from traditional statistical techniques;
(f) solve real data mining problems by using the right tools to find interesting
patterns;
(h) obtain hands-on experience with some popular data mining software.
(i) solve real-world problems in business and commerce using data mining and
data warehousing tools;
(j) learn independently and search for relevant information to write reports to
recommend appropriate data warehousing and data mining tools.
(k) Solve complex problems individually or in groups and develop group work
skills directly and indirectly.
8. Clustering 6
Clustering; k-means algorithm; hierarchical algorithm;
Condorset; neural network and genetic algorithms based
approach; evaluation of effectiveness.
9. Sequential data mining 3
Sequential data mining; time dependent data and temporal
data; time series analysis; sub-sequence matching;
classification and clustering of temporal data; prediction.
10. Other techniques 6
Computation intelligence techniques; fuzzy logic, genetic
algorithms and neural networks for data mining.
Total 42
Laboratory Experiment:
Duration of
Topic
Laboratory
1. Knowledge discovery lifecycle using CRISP-DM 2
2. Discover Association rules and sequential patterns using 2
Clementine 2
3. Discover Classification rules using Clementine 1
4. Discover Clusters using Clementine
Total 7
Case Study:
• Application of data mining techniques to solve real business problems.
• Attributes leading to success and failure of data warehousing projects tutorials
when appropriate.
Teaching/Learning This subject consists mainly of class lectures and laboratory sessions. For the
Methodology class lectures, various cases will be presented to help student understand why there
is a need for data warehouse to be built and why data mining is important for
modern day business intelligence. Students will be given time to participate in
discussions when the cases are presented.
All assignments and projects will also be given in the form of different cases
collected so as to allow students to learn more about how data warehouse and data
mining can be and have been used in real business environment. For the projects
and assignments, students are expected to learn independently and think critically
with minimize guidance. They are expected to practice their writing kills through
project documentations and report writing. As students will work in teams on the
project, they are expected to also learn to work with each other collaboratively.
1. Assignments 55%
2. Project
3. Examination 45%
Total 100 %
The assessment consists of written assignments, a group project and an examination. For
the assignments and projects, they are designed to ensure that students are able to achieve
the learning outcomes intended for this subject. They are expected to tackle a number of
cases drawn from different application areas in business and commerce so that they can
understand why there is a need for data warehouse in addition to traditional operational
database systems and why data mining is important for modern-day business intelligence.
In addition, students will learn through the questions and cases, when a particular data
warehouse architecture or when a particular data mining algorithm is useful and should be
used. Questions in the assignments are expected to help students learning the details of
the data mining algorithm and the use of popular data mining software. They are also
expected to use such popular tool as Oracle Warehouse Builder to construct data
warehouses. For the projects, students are expected to work in groups of three to four to
tackle a real case involving the design of a data warehouse or the use of data mining to
mine very large data bases. They are expected to learn how real-world problems in
business and commerce should be tackled using real-world tools as Oracle’s Warehouse
Builder or IBM’s Clementine data mining system. They are expected to learn
independently and search for relevant information to write reports to recommend
appropriate data warehousing and data mining tools. Students are expected to practice
their writing skills with project document and report writing. They will learn to develop
critical thinking and team work skills.
Laboratory 7 Hrs.