07 Conceptual Design
07 Conceptual Design
04 QA ???
Conceptual Design
Conceptual Design
• Conceptual Design
– Transforms data requirements to conceptual model
Conceptual Design to abstract the users’ request to some information structures, which act as the bridge connecting the real world and the
machine world
Multidimensional Fact Identifying the fact of interest Comprehensive ME/R Dimensional Schema
Modeling Preliminary Identifying the dimensions ness ER Model Dimensional schema is
workload hierarchies Currency DFM designed to store data in a
Identifying measures Speed UML class way that:
Identifying aggregations diagram Emphasizes
understandibility
Enhances query
performance
Accommodates change
Basic Concept …
• A fact is a collection of data items related to business
transactions or represent business items. A facts consist of
measures and context data.
• A dimension is a collection of data related to one business
dimension. Contextual background for the facts are defined by the
dimensions; parameters to perform OLAP are also defined by
dimensions.
Basic Concept …
• A measure is a numerical attribute of a fact. Performance or behavior of the
business can be represented by a measure relative to the dimensions. An essential
decision in a measure definition is the lowest level of detail (sometimes called the
grain) in order to determine the type of analysis that can be performed.
• Aggregation is pre-calculated summaries of data came from the most granular
fact table. Aggregation is applied in the case when the analysis needs computation
through a number of dimensions and lots of rows of each dimension to calculate
metrics of fact table. Query performance can be improved using aggregate fact
tables without increasing overall storage space.
Modeling for DW
• For D W the models have to offer support for
multidimensional data
• In the relational model the classical goal is to
– Remove redundancy
– Allow efficient retrieval of individual records
• In the case of D W
– Redundancy is necessary to speed up queries
– OLAP queries usually involve multiple records
(range queries) and aggregates
Tools…
• Entity/relationship-based (E/R-based)
• Object-oriented
• Ad hoc models (Sen and Sinha, 2005).
Benefit of ER Extensions
• E/R has been tested for considerable time (years);
• E/R is a commonly used tool by many designers;
• a variety of application domains can be flexibly adapted by
E/R;
• substantial research results have been derived for E/R (Sapia
et al, 1999; Tryfona et al, 1999).
Benefits of OO Model
• the static and dynamic properties of information systems can be
better represented with these models;
• requirements and constraints can be expressed in a powerful
mechanism;
• data modelling is currently dominated by object-oriented
approaches;
• UML, in particular, is a standard and is extendable (Lujan-Mora et
al, 2002).
Multidim Conceptual Model
• Modeling business queries
– Define the purpose of the D W and decide on
the
subject(s) Time
– Identify questions of
interest
• Who bought the Customers
Busine
ss
Employees
products?
(customers and their structure)
Model