Dataset Documentation:: Multivariate Data Analysis, Eighth Edition
Dataset Documentation:: Multivariate Data Analysis, Eighth Edition
A number of datasets are available to enable students and faculty to perform the multivariate
analyses described in the textbook. While some techniques require specialized datasets (e.g.,
multidimensional scaling, conjoint analysis and structural equation modeling), many of the
techniques are performed using conventional survey data.
HBAT
HBAT is a common dataset developed for use with many of the techniques to allow students to
see the interrelationships among techniques as well as the techniques themselves. The HBAT
dataset has several forms utilized throughout the text:
HBAT – the primary database described in the text which has multiple metric and
nonmetric variables allowing for use in most of the multivariate techniques.
HBAT_200 – an expanded dataset, comparable to HBAT except for 200 rather than
100 respondents, that allows for multiple independent variables in MANOVA while
still maintaining adequate sample size in the individual cells.
HBAT_SPLITS – contains two variables that split the HBAT dataset into 50/50 and
60/40 subsamples. This dataset can be merged with the original HBAT dataset if
desired to replicate the estimation and holdout/validation datasets.
Other HBAT Datasets
In addition to these primary datasets, there are several other datasets used with specific
techniques, including conjoint analysis, multidimensional scaling and structural equation
modeling. These datasets include:
HBAT_SEM – the original data responses from 400 individuals which are the basis for
the structural equation analyses of Chapters 10, 11 and 12. This dataset can be used
to derive the covariance matrices used as input to structural equation programs such
as LISREL, EQS or AMOS.
HBAT400_6CON – this dataset includes the 400 observations from the previous
analysis with the addition of indicators for a sixth construct as described below.
A Sixth Construct -- The HBAT researcher goes to a scales book and quickly finds
a six-item Supervisor Support scale. The scale consists of the following items:
There are several datasets used by other techniques covered in supplemental materials
available in the online resources. These techniques include conjoint analysis, multidimensional
scaling and correspondence analysis. The applicable datasets are:
Other Datasets
Finally, two additional datasets are provided to allow students access to data other than the
HBAT data files described in the textbook:
HATCO – this dataset has been utilized in past versions of the textbook and provides
a simplified set of variables amenable to all of the basic multivariate techniques.
Dataset Formats
Given the widespread interchangeability of data formats among statistical programs, all of the
datasets are provided in two formats. First is the .SAV format used in SPSS, which allows for
documentation of variable descriptions, etc. in a standard format. Also, all of the datasets are
contained in an EXCEL workbook, amenable to input to any statistical program.
Online Resources
All of the datasets plus a wide range of associated materials are available through our online
resources – Cengage Brain or the text-related website www.mvstats.com. We encourage users
to visit these sites where additional materials are available to all users as well as some
materials available to adopters of the eighth and earlier editions.