0% found this document useful (0 votes)
636 views

Dataset Documentation:: Multivariate Data Analysis, Eighth Edition

The document summarizes datasets available to accompany a multivariate data analysis textbook. It describes the primary HBAT dataset and several variants used for different techniques. Other datasets are provided for techniques like conjoint analysis, multidimensional scaling, and structural equation modeling. All datasets are available in .SAV and Excel formats through online resources.

Uploaded by

Raditya
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
636 views

Dataset Documentation:: Multivariate Data Analysis, Eighth Edition

The document summarizes datasets available to accompany a multivariate data analysis textbook. It describes the primary HBAT dataset and several variants used for different techniques. Other datasets are provided for techniques like conjoint analysis, multidimensional scaling, and structural equation modeling. All datasets are available in .SAV and Excel formats through online resources.

Uploaded by

Raditya
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Dataset Documentation:

Multivariate Data Analysis, Eighth Edition

“The world’s leading authority on applied multivariate data analysis


based on number of citations, as reported by Google.Scholar.”

A number of datasets are available to enable students and faculty to perform the multivariate
analyses described in the textbook. While some techniques require specialized datasets (e.g.,
multidimensional scaling, conjoint analysis and structural equation modeling), many of the
techniques are performed using conventional survey data.

HBAT

HBAT is a common dataset developed for use with many of the techniques to allow students to
see the interrelationships among techniques as well as the techniques themselves. The HBAT
dataset has several forms utilized throughout the text:

HBAT – the primary database described in the text which has multiple metric and
nonmetric variables allowing for use in most of the multivariate techniques.

HBAT_200 – an expanded dataset, comparable to HBAT except for 200 rather than
100 respondents, that allows for multiple independent variables in MANOVA while
still maintaining adequate sample size in the individual cells.

HBAT_MISSING – a reduced dataset with 70 respondents and missing data among


the variables. It is utilized in illustrating the techniques for diagnosis and remedy of
missing data described in Chapter 2.

HBAT_SPLITS – contains two variables that split the HBAT dataset into 50/50 and
60/40 subsamples. This dataset can be merged with the original HBAT dataset if
desired to replicate the estimation and holdout/validation datasets.
Other HBAT Datasets

In addition to these primary datasets, there are several other datasets used with specific
techniques, including conjoint analysis, multidimensional scaling and structural equation
modeling. These datasets include:

HBAT_SEM – the original data responses from 400 individuals which are the basis for
the structural equation analyses of Chapters 10, 11 and 12. This dataset can be used
to derive the covariance matrices used as input to structural equation programs such
as LISREL, EQS or AMOS.

HBAT_SEM_NOMISSING – the original dataset of 400 responses has two individuals


with missing data. To facilitate SEM analysis without having to address issues of
missing data, this dataset replaces the missing values so that the resulting sample is
400 complete responses. It can be used to perform any SEM analysis, although it
should be noted that very small differences from the results obtained with
HBAT_SEM and reported in the text may occur due to elimination of these two
cases.

HBAT_SEM_FT_NOMISS and HBAT_SEM_PT_NOMISS – These two sub-samples of


the HBAT_SEM_NOMISSING dataset are defined by employees full-time or part-time
status (variable C2). These sub-samples are used in multi-group analysis presented in
Chapter 12.

HBAT PLS-SEM_No Missing Data – variant of HBAT_SEM_NOMISSING used for


SmartPLS estimation in Chapter 13 (Excel version only)

HBAT400_6CON – this dataset includes the 400 observations from the previous
analysis with the addition of indicators for a sixth construct as described below.

A Sixth Construct -- The HBAT researcher goes to a scales book and quickly finds
a six-item Supervisor Support scale. The scale consists of the following items:

1. SP1. A 100-point slider scale anchored by “I have no problems with my


supervisor” (scored 0) on the left and “My supervisor and I constantly
have problems” (scored 100).
2. SP2. A 9-point semantic differential scale anchored by 1 = “I personally
like my supervisor” to 9 = “I personally dislike my supervisor.”
3. SP3. A 5-point Likert scale expressing agreement with the statement “My
company often leaves me without all the resources I need to do my job.”
4. SP4. An 11-point scale asking agreement with “My supervisor helps me
resolve all my problems at work.”
5. SP5. A 9-point scale asking how often “My supervisor gives me a pat on
the back” ranging from 1 (never) to 9 (whenever I deserve one).
6. SP6. A 5-point Likert scale asking agreement with the statement
“Management supports me when I have a problem.
HBAT.COV, HBATF.COV and HBATM.COV – These three covariance matrices
represent the overall sample, female respondents and male respondents,
respectively. They are used with LISREL software for in the analyses described in
Chapters 10, 11 and 12.

Datasets For Other Techniques

There are several datasets used by other techniques covered in supplemental materials
available in the online resources. These techniques include conjoint analysis, multidimensional
scaling and correspondence analysis. The applicable datasets are:

HBAT_CPLAN and HBAT_CONJOINT – the datasets to perform the “full profile”


conjoint analysis available in SPSS. HBAT_CPLAN details the stimulus profile
descriptions and HBAT_CONJOINT contains the actual responses to the profiles.

HBAT_MDS, HBAT_CORRESP and HBAT_CORRESP_INDIV – the datasets used for the


multidimensional scaling and correspondence analyses in the text.

Other Datasets

Finally, two additional datasets are provided to allow students access to data other than the
HBAT data files described in the textbook:

HATCO – this dataset has been utilized in past versions of the textbook and provides
a simplified set of variables amenable to all of the basic multivariate techniques.

SALES – this dataset concerns sales training and is comprised of 80 respondents,


representing a portion of data that was collected by academic researchers.

Dataset Formats

Given the widespread interchangeability of data formats among statistical programs, all of the
datasets are provided in two formats. First is the .SAV format used in SPSS, which allows for
documentation of variable descriptions, etc. in a standard format. Also, all of the datasets are
contained in an EXCEL workbook, amenable to input to any statistical program.

Online Resources

All of the datasets plus a wide range of associated materials are available through our online
resources – Cengage Brain or the text-related website www.mvstats.com. We encourage users
to visit these sites where additional materials are available to all users as well as some
materials available to adopters of the eighth and earlier editions.

You might also like