A Robust Bootstrap Test for Mediation Analysis Online Appendix
A Robust Bootstrap Test for Mediation Analysis Online Appendix
Andreas Alfons
Data. The data for the illustrative case come from a larger research program on team
processes. Data were collected from 354 senior business administration students playing a
12-round business simulation game (two separate games of 6 rounds) in randomly assigned
4-person teams (92 teams in total) as part of their capstone strategy course at a Western
European university. Data on several individual- and team-level constructs were collected in
three survey waves: prior to, during, and after the simulation game, with different constructs
being surveyed in the different waves. The overall response rate was 93% (332 students).
Leaving out teams with less than 50% response rate yields n = 89 teams for further analysis.
Theory. Values are standards that guide thought and action (Schwartz, 1992). Values
predispose individuals to favor one ideology over another, determine how one judges oneself
and others, and cause taking certain positions on social issues (Rokeach, 1973). Schwartz’s
value theory proposes ten distinct universal values that are theoretically derived from human
nature; these ten values are power, achievement, hedonism, stimulation, self-direction,
universalism, benevolence, tradition, conformity, and security. When team members possess
different set of values – meaning a high team value diversity – teams can experience higher
levels of conflict in executing their tasks (Jehn, 1994), because the variety of worldviews
may cause different prioritizations of actions that need to be coherently conducted. Conflict
on the task content triggered by a difference in values can be detrimental to team outcomes
(Jehn, Northcraft, & Neale, 1999), such as team commitment (Mowday, Steers, & Porter,
1979). Team commitment is the strength of team members’ identification with, and
involvement in, a particular team (Bishop & Scott, 2000). Accordingly, we propose that team
Measures, validity and reliability. We operationalized task conflict with the intra-
group task conflict scale of (Jehn, 1995). The five items on the presence of conflict were
rated on a 5-point Likert scale (anchored by 1 = “None” and 5 = “A lot”). Sample items
measuring task conflict include the following: “How frequently are there conflicts about
ideas in your work unit?” and “How often do people in your work unit disagree about
(median 𝑟𝑊𝐺 = 0.95). We used the short version of Schwartz’s Value Survey (SVS) to
measure team members’ individual values (Lindeman & Verkasalo, 2005). Then we
operationalized value diversity with average of the coefficient of variation of each value
dimension among team members. Team commitment is measured by four items based on
Mowday, Steers, & Porter (1979). Sample items include “I feel proud to belong to this team”
and “I am willing to exert extra effort to help this team succeed”. Cronbach’s alpha is 0.78.
Individual responses were aggregated to team level (median 𝑟𝑊𝐺 = 0.93). Value diversity
was measured in survey 1, task conflict in survey 2, and team commitment in survey 3.
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 4
alternative simulations as robustness checks. These simulations cover a wide range of settings
• OLS bootstrap: the bootstrap test following OLS estimation (Bollen & Stine,
1990; Shrout & Bolger, 2002; MacKinnon, Lockwood, & Williams, 2004;
• OLS Sobel: the Sobel test following OLS estimation (Sobel, 1982);
• Median bootstrap: Yuan & MacKinnon’s (2014) bootstrap test using median
regression;
• ROBMED: our proposed test using MM-estimation (Yohai, 1987) and the fast-
All bootstrap tests use 𝑅 = 5000 bootstrap samples, and report a bias-corrected and
accelerated percentile-based confidence interval (Davison & Hinkley, 1997) for the indirect
1. The first simulation design is a variation of the design from the manuscript,
but with varying sample sizes 𝑛 and varying effect sizes of 𝑎, 𝑏, and 𝑐. The
1
We exclude the SNT bootstrap (see Table 3 of the manuscript) from the additional simulations due to
its long computation time.
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 5
aim of this design is to investigate how robust the findings of the manuscript
are across a range of typical sample sizes and effect sizes in organizational
research.
2. The second simulation design uses the same basic design from the manuscript,
but with varying probability of outliers and varying distance of the outliers
from the main part of the data. The aim of this design is to investigate how the
3. The third simulation design uses the same basic design from the manuscript,
parameter settings are used in the latter two cases. The aim of this design is to
investigate how the methods are affected by different levels of skewness and
kurtosis.
from the main part of the data. The aim of this design is to verify our findings
regarding outliers.
5. The fifth simulation design is taken from Yuan & MacKinnon (2014) with
different distributions of the error terms. This design also varies the number of
observations and the effect sizes. Its aim is to verify our findings regarding
For each simulation design, we compare the methods in two situations: (i) when there
is mediation, and (ii) when there is no mediation. In total, this yields 700 different parameter
settings, which allows us to draw robust conclusions about the performance of the methods.
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 6
We generate 𝐾 = 1000 data sets for each setting. On each data set, two-sided tests with null
Note that the Box-Cox bootstrap applies a nonlinear transformation to each variable,
therefore the estimates are not comparable to the other methods (in particular under
deviations from the model assumptions). This already highlights a disadvantage of nonlinear
transformations, as in this case we no longer get an estimate of the actual model parameter,
but an estimate for a different model that is difficult to interpret (see also Becker, Robertson,
& Vandenberg, 2019). We therefore discuss the performance of the Box-Cox bootstrap
mainly in terms of the significance test of the indirect effect, and in general exclude it from
Simulation design 1: Robustness of findings across range of sample sizes and effect sizes
explore a range of effect sizes in two different situations: one with mediation (𝑎 = 𝑏 = 𝑐 =
0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, true indirect effect 𝑎𝑏 = 0.04, 0.09, 0.16, 0.25, 0.36, 0.49, 0.64),
and one where mediation does not exist (𝑎 = 𝑐 = 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 𝑏 = 0, true
indirect effect 𝑎𝑏 = 0). We consider the same four settings regarding the error distributions
and outliers as described in Table 5 in the manuscript. The parameter 𝜎1 is chosen such that
𝑀 has variance 1 in the setting with normally distributed errors, but unlike in the manuscript,
Varying effect size. We first give a detailed discussion of the results for varying
effect size for sample size 𝑛 = 100, which is the sample size used in the manuscript. The
conclusions are similar for the other sample sizes, except that the power of all methods
2
With this data generating process, the variance of 𝑌 is given by 𝜎𝑌2 = 𝑏 2 + 𝑐 2 + 𝑎𝑏𝑐 + 𝜎22 . Hence for
large enough values of 𝑎, 𝑏 and 𝑐, it is no longer possible to restrict the variance of 𝑌 to 1. We compensate for
this by adjusting the values of outliers to 𝑀𝑖∗ = 𝑀𝑖 /10 − 3 and 𝑌𝑖∗ = 𝑌𝑖 /10 + 3𝜎𝑌 .
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 7
increases with increasing sample size. The results for varying effect size in different sample
Simulations with mediation. Figure 2 displays the results for sample size 𝑛 = 100.
̂ − 𝑎𝑏 over
The top row shows the average bias of the indirect effect (that is, the average of 𝑎𝑏
the simulation runs) for increasing effect size, while the bottom row shows the rate of how
often the methods reject the null hypothesis and the corresponding estimate of the indirect
effect has the correct sign. Note that evaluating the methods by the rejection rate from the
two-sided tests alone does not provide a meaningful comparison in these simulation settings,
as outliers and other deviations from normality can push the estimated indirect effect from a
positive one towards a negative one. This incorrectly estimated negative indirect effect can be
large enough in magnitude to reject the null hypothesis of a two-sided test. However, while
the sign of the estimated effect is negative, the sign of the true effect is positive, which would
result in an incorrect interpretation of the indirect effect. By taking into account the sign of
the estimated indirect effect as well, we obtain a better measure of realized power of the tests.
With normal error terms, all methods estimate the indirect effect very accurately,
although it seems that the Box-Cox bootstrap has an increasing tendency to apply
unnecessary transformations as the effect sizes increase. For small effect sizes, ROBMED is
slightly less powerful than the OLS bootstrap, the winsorized bootstrap, or the Box-Cox
bootstrap, but it does not lose much power. Furthermore, it is more powerful than the OLS
In the presence of outliers, ROBMED is the only method that still gives accurate
estimates of the indirect effect. The OLS-based methods (i.e., OLS bootstrap and OLS Sobel)
are the most affected by the outliers, while the median bootstrap and the winsorized bootstrap
also show a considerable bias. For all of these methods, the bias increases with increasing
effect size. The results from estimation carry over to the realized power of the tests, with
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 8
ROBMED being clearly the most powerful test. The winsorized bootstrap is the only other
method that is not too far behind in terms of power, despite its bias in effect size.
Interestingly, the power of all methods except ROBMED first rises with increasing effect
size, but then decreases again for large effect sizes. A possible explanation is that different
effect sizes change the relative position of the outliers with respect to main data cloud, as the
shape of that data cloud is changed. This means that the influence of outliers on the methods
For skew-normal error terms, all methods are very accurate in estimating the indirect
effect. The Box-Cox bootstrap, the winsorized bootstrap, the OLS bootstrap, and ROBMED
are highly similar in terms of power, with only minor differences for the smallest investigated
effect size. As in the setting with normal error terms, the median bootstrap has slightly less
For t-distributed errors, all methods estimate the indirect effect accurately, although it
should be noted that the Box-Cox bootstrap is deceived into applying power transformations
that are not particularly suitable for symmetric heavy tails. In terms of power, ROBMED, the
winsorized bootstrap, the median bootstrap, and the Box-Cox bootstrap all have high power,
with ROBMED being marginally more powerful for small and moderate effect sizes. Only
Simulations with no mediation. Figure 7 shows the results for sample size 𝑛 = 100.
The top row of Figure 7 shows the average relative bias of the indirect effect with respect to
the effect size of 𝑎. Note that while 𝑏 = 0, estimates will be small but nonzero. When
multiplying this nonzero estimate of 𝑏 with the estimate of 𝑎, the resulting estimate of 𝑎𝑏
will naturally increase with increasing effect size 𝑎, and therefore also the bias compared to
̂ is obtained as the
the true value 𝑎𝑏 = 0. The same holds of course when the estimate 𝑎𝑏
̂ − 𝑎𝑏)/𝑎 is a more
average over the bootstrap replicates. Therefore, the relative bias (𝑎𝑏
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 9
meaningful evaluation of the estimates of the indirect effect across different effect sizes of 𝑎.
The bottom row of Figure 7 presents the rejection rate. Since the tests are performed with
nominal size 𝛼 = 0.05, the rejection rate should be as close as possible to this value. It can be
For normal, skew-normal, and t-distributed error terms, all methods accurately
estimate the indirect effect and the rejection rates of all bootstrap tests are close to the
nominal size 𝛼 = 0.05. However, the OLS Sobel test is undersized for smaller effect sizes.
In the setting with outliers, ROBMED again yields the most accurate estimates of the
indirect effect. Although there is some bias across the range of the effect size, it is far smaller
than that of any other method. All other methods suffer from considerable bias, in particular
the OLS-based methods. It is also noteworthy that the relative bias of all methods increases
for large effect sizes of 𝑎, although that of ROBMED remains stable the longest. In addition,
ROBMED is the only method with a rejection rate reasonably close to the nominal size 𝛼 =
0.05. While the rejection rate of the median bootstrap is not too far off the nominal size for
smaller effect sizes of 𝑎, it increases slightly throughout the range of the effect size. All other
tests have too large rejection rates, but interestingly the rejection rate of the winsorized
Concluding discussion. The findings from the manuscript are robust across a range of
effect sizes. In terms of estimating the indirect effect, ROBMED is the only method that is
accurate across the four settings for error distributions and outliers. It is the most powerful
test in the presence of outliers and heavy tails, and it does not lose much power under normal
3
Winsorization moves outliers onto a certain tolerance ellipsoid of the estimated covariance matrix
(here estimated via a Huber M-estimator; see Zu & Yuan, 2010). As the effect sizes of 𝑎 and 𝑐 become larger,
the tolerance ellipsoids become more concentrated, meaning that the outliers are moved closer to the center for
larger values of those effect sizes. That likely makes the influence of outliers decrease and the rejection rate
move closer to the nominal size.
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 10
and skew-normal errors. Furthermore, it offers the best protection against falsely reporting
Increasing sample size. We now discuss how an increasing sample size affects the
Simulations with mediation. From Figures 1-5, it is clear the OLS-based methods
have insufficient power in the presence of outliers and heavy tails, even for large samples. As
the sample size increases, the performance of the winsorized bootstrap, the median bootstrap,
and the Box-Cox bootstrap becomes more similar to that of ROBMED under normal, skew-
normal, and t-distributed errors. The most notable differences between methods are observed
in the presence of outliers. Figure 11 therefore summarizes the results differently for this
setting, with the sample size on the horizontal axis and separate columns for different effect
sizes. Clearly, the Box-Cox transformation is not a suitable method to handle outliers. Hence
the Box-Cox bootstrap shows a similar bias to the OLS-based methods, and it suffers from a
severe loss of power for many effect sizes even when the sample size is large. ROBMED
clearly performs the best for all sample sizes, followed by the winsorized bootstrap and the
median bootstrap.
Simulations with no mediation. Figures 6-10 show that all bootstrap tests are fairly
well calibrated (rejection rate close to the nominal size 𝛼 = 0.05), except for the setting with
outliers. Figure 12 presents the results in the latter setting in a different manner, which reveals
one of the most interesting results of this simulation design. As the sample size increases,
across all effect sizes, ROBMED shows the smallest bias and it is the only method that is
reasonably well calibrated. For all other methods, the rejection rate increases with increasing
sample size. For 𝑛 = 1000, the rejection rate of the other methods ranges from about 50%
for the median bootstrap to almost 100% for the OLS bootstrap.
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 11
Concluding discussion. ROBMED is the only method with reliable results across all
settings for error distributions and outliers, effect sizes, and sample sizes. In particular,
ROBMED is the only method that protects against false mediation discoveries in the presence
of outliers if the true indirect effect is 0, while overall being the most powerful across the
We use the same basic simulation design from the manuscript, but we vary the outlier
settings. The explanatory variable 𝑋 and the error terms 𝑒1 and 𝑒2 are generated from
situation with mediation (true indirect effect 𝑎𝑏 = 0.16), and 𝑏 = 0 for a situation where
mediation does not exist (true indirect effect 𝑎𝑏 = 0). The parameters 𝜎1 and 𝜎2 are chosen
such that 𝑀 and 𝑌 have variance 1, and the sample sizes are 𝑛 = 100 and 𝑛 = 250. In
addition to analyzing the clean data, we generate outliers in the following way. With
probability 𝑝 = 0.01, 0.02, 0.03, 0.04, observations are turned into outliers by setting 𝑀𝑖∗ =
𝑀𝑖 /10 − 𝑑 and 𝑌𝑖∗ = 𝑌𝑖 /10 + 𝑑, with outlier shift 𝑑 = 1, … ,6. Note that the outlier setting
Simulations with mediation. Table 1 contains the results for the average estimates
and standard errors of the indirect effect. For small values of the outlier shift 𝑑, the outliers
overlap with the main data cloud, making it impossible for any method to distinguish the
outliers from data points that follow the model. Hence all methods suffer from a bias in the
estimate of the indirect effect for small values of 𝑑, with the bias increasing for higher outlier
probabilities. As 𝑑 becomes larger and the outliers become separable from the main data
cloud, ROBMED is the only method for which the estimates recover from the bias. Its
estimates move again closer to the true value and remain approximately unbiased. In addition
to being the only method to recover from the bias, ROBMED also exhibits the smallest
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 12
maximum bias (over the range of 𝑑). The estimates of the OLS-based methods, and to a
lesser extent the estimates of the median bootstrap, continue to move away from the true
value for an increasing outlier shift 𝑑. The winsorized bootstrap is able to stop the bias from
increasing for reasonably small values of 𝑑, as winsorization cuts off the influence of outliers,
but the bias remains constant and does not decrease again. For further illustration, Figure 13
(top) visualizes this behavior of the average estimates for outlier probability 2%.
Table 2 lists how often the methods reject the null hypothesis and the corresponding
estimate of the indirect effect has the correct sign (our measure of realized power). The
results from the estimation of the indirect effect clearly carry over. All tests except ROBMED
continue to lose power for increasing outlier probability and increasing outlier shift 𝑑. Across
all outlier probabilities, ROBMED only loses some power for small values of 𝑑 when the
outliers overlap with the main data cloud. As 𝑑 increases further and the outliers become
separable from the main part of the data, their power goes back to the same level that is
observed on clean data (𝑑 = 0) and stays there for a broad range of 𝑑. Once its power
stabilizes, ROBMED is the most powerful test. These results are further illustrated in Figure
Simulations with no mediation. The results for the average estimates and standard
errors of the indirect effect are shown in Table 3. For increasing outlier probability and outlier
shift 𝑑, the outliers push the estimates towards a negative estimated effect. Across all outlier
probabilities, we observe the following behavior. ROBMED is again the only method for
which the estimates recover from this bias and stay approximately unbiased once the outliers
are separable from the main part of the data (large enough values of 𝑑). The winsorized
bootstrap is able to contain the bias but is unable to decrease the bias again for larger values
of the outlier shift 𝑑. The bias of the median bootstrap, and even more so the bias of the OLS-
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 13
based methods, keeps increasing for increasing 𝑑. For outlier probability 2%, this behavior is
The rejection rate of the tests (i.e., their realized size) is shown Table 4. The rejection
rate for the OLS-based tests quickly rises with increasing outlier probability and increasing
outlier shift 𝑑. The rejection rates of the Box-Cox bootstrap, the winsorized bootstrap, and
the median bootstrap test increase (somewhat) more slowly. ROBMED exhibits a fairly stable
rejection rate close to the nominal size 𝛼 = 0.05 except for a noticeable increase for smaller
values of 𝑑 where the outliers overlap with the main data cloud, which is more pronounced
for higher outlier probabilities. A visualization of these results for outlier probability 2% can
this simulation design. Across various outlier probabilities, it is the only method that can
recover from bias and loss of power as the outliers become separable from the main data
cloud. In addition, ROBMED does not lose much power to the OLS bootstrap when there are
no outliers, and it is the only method to effectively protect against falsely detecting mediation
when the true indirect effect is 0. Finally, the poor performance of the Box-Cox bootstrap
demonstrates that nonlinear transformations are not a suitable treatment for outliers.
We again use the same basic simulation design from the manuscript, but this time we
vary the error distributions for different levels of skewness and kurtosis. The explanatory
variable 𝑋 follows a standard normal distribution 𝑁(0,1). We set 𝑎 = 𝑐 = 0.4, and 𝑏 = 0.4
for a situation with mediation (true indirect effect 𝑎𝑏 = 0.16), as well as 𝑏 = 0 for a situation
where mediation does not exist (true indirect effect 𝑎𝑏 = 0). The parameters 𝜎1 and 𝜎2 are
chosen such that 𝑀 and 𝑌 have variance 1 in the case of normal error distributions, and the
sample sizes are 𝑛 = 100 and 𝑛 = 250. We investigate the following three settings for the
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 14
distributions of the error terms 𝑒1 and 𝑒2 : (i) a centered log-normal distribution, (ii) skew-t
(Fleishman, 1978).
Centered log-normal distribution. The error terms 𝑒1 and 𝑒2 follow a centered log-
normal distribution. That is, the error terms are generated from a log-normal distribution
𝐿𝑜𝑔𝑁(0,1) after which the expected value 𝑒1/2 is subtracted. The skewness of this
distribution is 6.185 and the excess kurtosis is 107.936, hence deviations from normality are
quite severe.
Simulations with mediation. The top row of Figure 15 shows boxplots of the
estimates of the indirect effect. Despite the strong deviation from normality, bias is very low
for all methods. Keep in mind that the estimates of the Box-Cox bootstrap are not comparable
to the true indirect effect due to the application of nonlinear transformations, which also
demonstrates that nonlinear transformations are not always a suitable treatment for non-
The bottom row of Figure 15 shows the rate of how often the methods reject the null
hypothesis and the corresponding estimate of the indirect effect has the correct sign (our
measure of realized power of the tests). For the smaller sample size (𝑛 = 100), ROBMED
and the Box-Cox bootstrap are the only methods that have realized power of (almost) 100%.
The winsorized bootstrap and the median bootstrap only have slightly lower power, but the
OLS based tests perform poorly. For the larger sample size (𝑛 = 250), all robust tests have
realized power of (almost) 100%, but the OLS-based tests still trail behind at about 80%.
Simulations with no mediation. Regarding the estimation of the indirect effect, the
top row of Figure 16 shows that the bias is fairly low for all methods. ROBMED has again
lower variance than the median bootstrap, the winsorized bootstrap, and the OLS-based
methods. The bottom row of Figure 16 shows that the rejection rates of the Box-Cox
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 15
bootstrap and the robust methods are close to the nominal size 𝛼 = 0.05 for the larger sample
size (𝑛 = 250), with the rejection rate of the OLS bootstrap being somewhat too high at
about 10%. For the smaller sample size (𝑛 = 100), all tests exhibit slightly elevated rejection
rates, with the winsorized bootstrap having the highest rejection rate.
Concluding discussion. ROBMED and the Box-Cox bootstrap outperform the other
tests in terms of power. Despite the severe deviations from normality, ROBMED estimates
the indirect effect very accurately. On the other hand, the estimates of the Box-Cox bootstrap
are not comparable with the true indirect effect due to the nonlinear transformations.
𝑆𝑇(𝜉, 1, 𝜆, 𝜐)4 (e.g., Azzalini & Capitano, 2014). Note that a standard normal distribution is
the parameters 𝜆 and 𝜐 is quite difficult, as their effect on skewness and kurtosis cannot be
decoupled (Arellano-Valle & Azzalini, 2013). We set the parameter 𝜆 such that the skewness
∞, 5, 2. For 𝜐 = 2, skewness and kurtosis are undefined, while 𝜐 = 5 is the smallest integer
value of 𝜐 for which both skewness and kurtosis are finite. The values of 𝜆, 𝜐, and the
corresponding values of skewness and excess kurtosis are reported together with the
simulation results in Tables 5–8. Finally, the location parameter 𝜉 is chosen such that the
Simulations with mediation. The results for the estimates of the indirect effect in
Table 5 show that the bias is close to 0 for all methods. We emphasize again that the estimates
of the Box-Cox bootstrap are not comparable with the true indirect effect due to the nonlinear
4
In the literature, the parameter 𝜆 of the skew-t distribution is usually denoted by 𝛼. We changed this
notation here to avoid confusion with the significance level 𝛼 of the mediation tests.
5
The skewness of the skew-normal distribution is bounded by the interval [−0.995, 0.995].
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 16
transformations. Table 6 reports how often the tests reject the null hypothesis and the
corresponding estimate of the indirect effect has the correct sign (our measure of realized
power). For most parameter settings, the differences in power between the bootstrap tests are
small. For the smaller sample size (𝑛 = 100), the median bootstrap in general has (slightly)
lower power than the other robust tests. Interestingly, all methods have lower power for
symmetric error distributions with heavier tails (𝜆 = 0, small 𝜐) than for distributions with
skewness and heavier tails (𝜆 ≠ 0, small 𝜐). A likely explanation is that the variability of the
skew-t distribution decreases as |𝜆| increases, meaning that the uncertainty in the model is the
highest for 𝜆 = 0.6 The most pronounced differences among the methods are found for 𝜐 =
2, where the tails are the heaviest: across all values of 𝜆, ROBMED has slightly higher power
than the other tests, while the OLS bootstrap suffers from a severe loss of power. For the
larger sample size (𝑛 = 250), ROBMED, the winsorized bootstrap, the median bootstrap,
and the Box-Cox bootstrap exhibit power of (close to) 100% across all parameter values,
while the OLS bootstrap still suffers from a considerable loss of power for 𝜐 = 2.
Simulations with no mediation. Tables 7 and 8 contain the results for the estimates of
the indirect effect and the rejection rates of the tests, respectively. Differences between the
methods are small. The bias is almost 0 for all methods. The rejection rates are close to the
nominal size 𝛼 = 0.05 for most parameter settings, but all tests tend to slightly overreject the
methods, including the Box-Cox bootstrap. ROBMED has comparable power to other
methods when deviations from normality are small, but (slightly) higher power when
6
It is not possible to fix the variance of the skew-t distribution across the different parameter settings
for 𝜆 and 𝜐, as the variance of the skew-t distribution is infinite for 𝜐 = 2.
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 17
the error terms 𝑒1 and 𝑒2 from non-normal distributions. Fleishman’s method works as
follows. For each variable, observations are first generated by a standard normal distribution,
transformation are chosen such that the distribution of the transformed variable matches
given values of skewness and kurtosis. However, such a polynomial transformation does not
exist for all values of skewness and kurtosis, and Headrick & Kowalchuk (2007) note that not
(pdf).7 Accordingly, we only consider values of skewness and kurtosis that result in such a
valid pdf. The investigated values of skewness and excess kurtosis are reported together with
Simulations with mediation. Table 9 contains the results for the estimates of the
indirect effect, and Table 10 lists how often the tests reject the null hypothesis and the
corresponding estimate of the indirect effect has the correct sign (our measure of realized
power). As the deviations from normality are fairly small with Fleishman’s method, the bias
is close to 0 for all methods and power is (close to) 100%. For the smaller sample size (𝑛 =
100), only the median bootstrap suffers from loss of power for a small number of parameter
Simulations with no mediation. The results for the estimates of the indirect effect and
the rejection rates of the tests are shown in Tables 11 and 12, respectively. Bias is close to 0
for all methods, and rejection rates are similar for all tests. For the larger sample size (𝑛 =
250), all tests are well calibrated with rejection rates close to the nominal size 𝛼 = 0.05, but
for the smaller sample size (𝑛 = 100) all tests have a tendency to slightly overreject.
7
Among the conditions for a valid pdf is that percentage points and measures of central tendencies
(such as mean, median and mode) can be computed (see Headrick & Kowalchuk, 2007).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 18
Concluding discussion. Since deviations from normality are rather small, all tests
perform very well. Only the median bootstrap suffers from loss of power for a small number
of parameter settings.
This simulation design consists of several extensions of the design of Zu & Yuan
(2010). The explanatory variable 𝑋 and the error terms 𝑒1 and 𝑒2 follow a standard normal
distribution, and we set 𝜎1 = 𝜎2 = 1 such that the variance of the rescaled error terms
remains 1. We set 𝑎 = 𝑐 = 0.2, but we vary the value of 𝑏 to investigate different settings:
𝑏 = 0.2 yields a setting with mediation (with a true indirect effect 𝑎𝑏 = 0.04), while 𝑏 = 0
corresponds to a setting where mediation does not exist (true indirect effect 𝑎𝑏 = 0). The
sample sizes are 𝑛 = 100 and 𝑛 = 250. In addition to analyzing the clean data, we replace a
small percentage of observations with outliers by setting 𝑀𝑖∗ = 𝑀𝑖 − 𝑑 and 𝑌𝑖∗ = 𝑌𝑖 + 𝑑. The
original design of Zu & Yuan (2010) is obtained by setting 𝑛 = 250, 𝑑 = 6, and by replacing
2, 4, 6, 8, 10 observations by outliers.
Effect of the amount of outliers. Similar to Zu & Yuan (2010), we set the outlier
Simulations with mediation. In the top row of Figure 17, the average estimates of the
indirect effect are shown for an increasing percentage of outliers. Clearly, the OLS-based
methods show a large bias for the indirect effect in the presence of outliers, with this bias
continuously increasing for an increasing percentage of outliers. The median bootstrap and
the winsorized bootstrap are affected to a lesser extent, but also their bias keeps increasing as
the percentage of outliers increases. However, ROBMED remains stable and accurate in
The bottom row of Figure 17 displays the rate of how often the methods reject the
null hypothesis and the corresponding estimate of the indirect effect has the correct sign (our
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 19
measure of realized power). For the OLS-based tests, the results from the estimation of the
indirect effect clearly carry over. Their realized power quickly drops and reaches 0 when
there are about 1% of outliers. The Box-Cox bootstrap, the median bootstrap, and the
winsorized bootstrap also continuously lose power, and eventually their realized power drops
to (almost) 0. ROBMED is the only test that remains stable. It is the most powerful test when
Simulations with no mediation. In the top row of Figure 18, we observe that the
outliers push the estimates of the OLS-based methods towards a negative estimated effect. A
similar effect, although to a lesser extent, is visible for the estimates of the median bootstrap
and the winsorized bootstrap. ROBMED, on the other hand, remains stable and close to the
true value 𝑎𝑏 = 0.
The bottom row of Figure 18 presents the rejection rate of the tests (i.e., their realized
size). As expected, the rejection rate for the OLS-based tests quickly rises, but interestingly it
starts to fall again for higher percentages of outliers. This is likely because of the estimated
confidence intervals being even more affected by the outliers than the point estimates,
yielding very large confidence intervals for higher percentages of outliers. The rejection rates
of the Box-Cox bootstrap, the winsorized bootstrap, and the median bootstrap increase
(somewhat) more slowly. ROBMED is the only test unaffected by the outliers and its
simulation design. It remains accurate in estimating the indirect effect and powerful for
assessing its significance. In addition, ROBMED does not lose much power to the OLS
bootstrap when there are no outliers. ROBMED’s competitors show a loss of power in the
presence of outliers, and can be driven to falsely report mediation when the true indirect
effect is 0.
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 20
Effect of the distance of outliers from the main bulk of the data. We fix the
Simulations with mediation. The top row of Figure 19 displays the average estimates
of the indirect effect for varying values of the outlier shift 𝑑. For small values of this shift,
where outliers overlap with the main bulk of the data, the bias of all methods increases. As
the outlier shift 𝑑 increases further and the outliers become separable from the main bulk of
the data, ROBMED is the only methods for which the bias goes back towards zero. For the
OLS bootstrap, and to a lesser extent for the median bootstrap, the bias keeps increasing. For
the winsorized bootstrap, the bias stabilizes but does not decrease again.
The bottom row of Figure 19 shows how often the methods reject the null hypothesis
and the corresponding estimate of the indirect effect has the correct sign (our measure of
realized power). As in the previous simulation design, the results from the estimation of the
indirect effect carry over. All tests except ROBMED continue to lose power for an increasing
outlier shift 𝑑. ROBMED only loses some power for small values of 𝑑 when the outliers
overlap with the main data cloud. As 𝑑 increases further and the outliers become separable
from the main part of the data, their power goes back to the same level that is observed on
clean data (𝑑 = 0) and stays there for a broad range of 𝑑. Once its power stabilizes,
Simulations with no mediation. From the top row of Figure 20, it is clear that the
outliers push the estimates towards a negative estimated effect. Otherwise, the results are
pretty similar to the case with mediation. ROBMED is again the only method for which the
estimates recover from this bias and stay approximately unbiased once the outliers are
separable from the main part of the data. The winsorized bootstrap is able to contain the bias
but is unable to decrease the bias again for larger values of the outlier shift 𝑑. The bias of the
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 21
median bootstrap, and even more so the bias of the OLS-based methods, keeps increasing for
increasing 𝑑.
The rejection rate of the tests (i.e., their realized size) is shown in the bottom row of
Figure 20. As in the previous simulation design, the rejection rate for the OLS-based tests
quickly rises, but starts to fall again for larger values of 𝑑. An explanation could again be that
the estimated confidence intervals are even more affected by large outliers than the point
estimates. The rejection rates of the Box-Cox bootstrap, the winsorized bootstrap, and the
median bootstrap test increase (somewhat) more slowly. While the rejection rate of the
winsorized bootstrap test levels off for reasonably small 𝑑, and to a lesser extent that of the
Box-Cox bootstrap as well, the rejection rate of the median bootstrap keeps increasing.
ROBMED exhibits a fairly stable rejection rate close to the nominal size 𝛼 = 0.05 except for
a small bump for smaller values of 𝑑 where the outliers overlap with the main data cloud.
simulation design. It is the only method that can recover from bias and loss of power as the
outliers become separable from the main data cloud. Furthermore, it is the only method to
effectively protect against falsely detecting mediation when the true indirect effect is 0.
The last simulation design is taken from Yuan & MacKinnon (2014). First, the
several settings where mediation exists: we set 𝑎 = 𝑏 = 𝑐 = 0.14 for small effect sizes
(yielding a true indirect effect 𝑎𝑏 = 0.0196), 𝑎 = 𝑏 = 𝑐 = 0.39 for medium effect sizes
(yielding a true indirect effect 𝑎𝑏 = 0.1521), and 𝑎 = 𝑏 = 𝑐 = 0.59 for larger effect sizes
(yielding a true indirect effect 𝑎𝑏 = 0.3481). To study situations where mediation does not
exist, we keep the same values for 𝑏 and 𝑐, but set 𝑎 = 0 for a true indirect effect 𝑎𝑏 = 0.
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 22
The sample sizes are 𝑛 = 50, 100, 200, 500. Moreover, we set 𝜎1 = 𝜎2 = 1 and consider the
heavy tails.
3. A contaminated normal distribution 0.9 ∙ 𝑁(0,1) + 0.1 ∙ 𝑁(0, 102 ), i.e., with
10% probability, error terms are generated from a normal distribution with a
Even though results for all effect sizes are shown in Figures 21-28, we discuss only the
results for the medium effect sizes, as the results for the smaller and larger effect sizes are
qualitatively similar. The main difference is that, as expected, all methods have lower power
for the smaller effect sizes and higher power for the larger effect sizes.
Simulations with mediation. Figures 21–24 show the simulation results for sample
size 𝑛 = 50, 100, 200 and 500, respectively, and the following discussion focuses on the
setting with 𝑎 = 𝑏 = 𝑐 = 0.39 and true indirect effect 𝑎𝑏 = 0.1521. The top row of the
figures contains box plots of the estimates of the indirect effect, while the bottom row
displays rate of how often the methods reject the null hypothesis and the corresponding
estimate of 𝑎𝑏 has the correct sign (i.e., the realized power of the tests). The columns of the
For standard normal error terms, all methods estimate the indirect effect accurately.
As expected, the power is the highest for the OLS bootstrap and the winsorized bootstrap,
followed by ROBMED and the median bootstrap. The power of all bootstrap tests increases
with increasing sample size, and all tests already reach a 100% rejection rate for 𝑛 = 200 (or
For heavy tails in the errors (following a 𝑡 distribution with 2 degrees of freedom), the
Box-Cox bootstrap estimates and the OLS-based estimates of the indirect effect show larger
variability than the other methods, and the OLS-based tests have the lowest power. The other
methods are very similar in terms of estimating the indirect effect and remain accurate.
ROBMED thereby has the highest power, followed by the winsorized bootstrap, the Box-Cox
When the error terms are generated by a contaminated normal distribution, the results
are very similar to the setting with heavy tails. The OLS-based estimates and the Box-Cox
bootstrap estimates again show larger variability, and the OLS-based tests have the lowest
power. As before, ROBMED has the highest power, followed by the winsorized bootstrap,
Simulations with no mediation. Figures 25–28 show the simulation results for
sample size 𝑛 = 50, 100, 200 and 500, respectively, and the following discussion focuses on
the setting with 𝑎 = 0, 𝑏 = 𝑐 = 0.39 and true indirect effect 𝑎𝑏 = 0. The top row of the
figures again shows box plots of the estimates of the indirect effect, while the bottom row
displays the rejection rate (i.e., the realized size of the tests). For the most part, all methods
perform fairly similarly. The most interesting result is that the OLS-based estimates and the
Box-Cox bootstrap estimates again show larger variability for errors with heavy tails and
errors from a contaminated normal distribution. In addition, the realized size of the OLS
Concluding discussion. There are fewer differences between the methods than in
other simulation designs. ROBMED outperforms its competitors in the setting with heavy
tails in the errors, as well as in the setting where the error terms are generated by a
reports whether these articles (i) use OLS, (ii) report outliers, (iii) check model assumptions
(e.g., normality), (iv) use bootstrapping, and (v) use the PROCESS macro (Hayes, 2018) (or
References
Arellano-Valle, R. B., & Azzalini, A. (2013). The Centred Parameterization and Related
Azzalini, A., & Capitano, A. (2014). Azzalini, A., & Capitanio, A. (2014). The Skew-Normal
85(3), 439–450.
Bollen, K. A., & Stine, R. (1990). Direct and Indirect Effects: Classical and Bootstrap
Box, G. E., & Cox, D. R. (1964). An Analysis of Transformations. Journal of the Royal
Davison, A. C., & Hinkley, D. V. (1997). Bootstrap Methods and their Application.
43(4), 521–532.
Hawkins, D. M., & Weisberg, S. (2017). Combining the Box-Cox Power and Generalized
Headrick, T. C., & Kowalchuk, R. K. (2007). The Power Method Transformation: Its
Probability Density Function, and Its Further Use for Fitting Data. Journal of
Jehn, K. A., Northcraft, G. B., & Neale, M. A. (1999). Why Differences Make a Difference: A
Lindeman, M., & Verkasalo, M. (2005). Measuring Values With the Short Schwartz's Value
MacKinnon, D. P., Lockwood, C. M., & Williams, J. (2004). Confidence Limits for the
Mowday, R. T., Steers, R. M., & Porter, L. W. (1979). The Measurement of Organizational
Preacher, K. J., & Hayes, A. F. (2004). SPSS and SAS Procedures for Estimating Indirect
Preacher, K. J., & Hayes, A. F. (2008). Asymptotic and Resampling Strategies for Assessing
Rokeach, M. (1973). The Nature of Human Values. New York, NY: The Free Press.
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 27
Shrout, P. E., & Bolger, N. (2002). Mediation in Experimental and Nonexperimental Studies:
Yohai, V. J. (1987). High Breakdown-Point and High Efficiency Robust Estimates for
Yuan, Y., & MacKinnon, D. P. (2014). Robust Mediation Analysis Based on Median
Zu, J., & Yuan, K.-H. (2010). Local Influence and Robust Procedures for Mediation Analysis.
Tables
Table 1. Bias and standard deviation (in parenthesis) for simulation design 2 with outliers in
the setting with mediation (𝑎 = 𝑏 = 0.4).
Probability Outlier Box-Cox Winsorized Median
𝒏 of outliers shift 𝒅 OLS bootstrap OLS Sobel bootstrap bootstrap bootstrap ROBMED
100 0% 0 0.002 (0.052) 0.002 (0.052) 0.006 (0.066) 0.002 (0.052) 0.002 (0.059) 0.002 (0.056)
1% 1 -0.006 (0.051) -0.006 (0.051) -0.002 (0.064) -0.006 (0.052) -0.004 (0.058) -0.005 (0.056)
1% 2 -0.024 (0.053) -0.024 (0.053) -0.021 (0.063) -0.012 (0.053) -0.009 (0.058) 0.001 (0.058)
1% 3 -0.048 (0.063) -0.051 (0.065) -0.044 (0.066) -0.012 (0.053) -0.014 (0.059) 0.003 (0.057)
1% 4 -0.073 (0.079) -0.080 (0.083) -0.050 (0.071) -0.013 (0.053) -0.019 (0.060) 0.002 (0.056)
1% 5 -0.097 (0.097) -0.110 (0.104) -0.044 (0.072) -0.013 (0.053) -0.025 (0.062) 0.002 (0.056)
1% 6 -0.118 (0.114) -0.137 (0.125) -0.039 (0.072) -0.013 (0.053) -0.031 (0.064) 0.002 (0.056)
2% 1 -0.013 (0.050) -0.013 (0.050) -0.010 (0.062) -0.014 (0.051) -0.009 (0.058) -0.012 (0.055)
2% 2 -0.047 (0.053) -0.048 (0.054) -0.044 (0.059) -0.027 (0.054) -0.019 (0.058) 0.000 (0.058)
2% 3 -0.089 (0.066) -0.093 (0.068) -0.076 (0.059) -0.028 (0.055) -0.030 (0.059) 0.004 (0.057)
2% 4 -0.131 (0.083) -0.141 (0.086) -0.082 (0.065) -0.029 (0.055) -0.041 (0.062) 0.002 (0.056)
2% 5 -0.169 (0.101) -0.186 (0.104) -0.075 (0.069) -0.029 (0.055) -0.053 (0.067) 0.002 (0.056)
2% 6 -0.202 (0.118) -0.226 (0.121) -0.070 (0.072) -0.029 (0.055) -0.066 (0.074) 0.002 (0.056)
3% 1 -0.020 (0.049) -0.020 (0.049) -0.019 (0.060) -0.022 (0.050) -0.015 (0.057) -0.019 (0.054)
3% 2 -0.068 (0.053) -0.070 (0.054) -0.065 (0.056) -0.045 (0.057) -0.030 (0.058) -0.002 (0.060)
3% 3 -0.125 (0.068) -0.130 (0.068) -0.102 (0.054) -0.048 (0.059) -0.047 (0.061) 0.005 (0.058)
3% 4 -0.179 (0.085) -0.190 (0.086) -0.108 (0.062) -0.049 (0.061) -0.065 (0.067) 0.002 (0.056)
3% 5 -0.226 (0.102) -0.242 (0.101) -0.104 (0.070) -0.050 (0.061) -0.085 (0.077) 0.002 (0.056)
3% 6 -0.264 (0.118) -0.286 (0.116) -0.102 (0.076) -0.051 (0.062) -0.107 (0.089) 0.002 (0.056)
4% 1 -0.027 (0.048) -0.027 (0.048) -0.025 (0.058) -0.029 (0.049) -0.020 (0.056) -0.025 (0.053)
4% 2 -0.087 (0.053) -0.088 (0.053) -0.083 (0.054) -0.064 (0.059) -0.041 (0.058) -0.005 (0.063)
4% 3 -0.155 (0.068) -0.160 (0.068) -0.122 (0.051) -0.070 (0.066) -0.065 (0.063) 0.007 (0.058)
4% 4 -0.217 (0.085) -0.227 (0.084) -0.130 (0.062) -0.073 (0.070) -0.091 (0.074) 0.002 (0.056)
4% 5 -0.268 (0.103) -0.283 (0.100) -0.131 (0.074) -0.075 (0.073) -0.120 (0.090) 0.002 (0.056)
4% 6 -0.310 (0.120) -0.328 (0.116) -0.133 (0.082) -0.076 (0.075) -0.150 (0.105) 0.002 (0.056)
250 0% 0 0.002 (0.032) 0.002 (0.032) 0.001 (0.045) 0.002 (0.032) 0.002 (0.037) 0.001 (0.035)
1% 1 -0.006 (0.032) -0.006 (0.032) -0.006 (0.043) -0.006 (0.032) -0.003 (0.037) -0.005 (0.035)
1% 2 -0.024 (0.034) -0.024 (0.034) -0.024 (0.043) -0.011 (0.033) -0.008 (0.037) 0.002 (0.036)
1% 3 -0.050 (0.041) -0.052 (0.042) -0.049 (0.045) -0.012 (0.033) -0.012 (0.037) 0.002 (0.035)
1% 4 -0.081 (0.053) -0.085 (0.054) -0.055 (0.050) -0.012 (0.033) -0.017 (0.038) 0.001 (0.035)
1% 5 -0.113 (0.066) -0.120 (0.068) -0.043 (0.052) -0.012 (0.033) -0.022 (0.039) 0.001 (0.035)
1% 6 -0.144 (0.079) -0.154 (0.081) -0.035 (0.052) -0.012 (0.033) -0.027 (0.040) 0.001 (0.035)
2% 1 -0.013 (0.031) -0.013 (0.031) -0.013 (0.042) -0.014 (0.032) -0.008 (0.037) -0.012 (0.034)
2% 2 -0.048 (0.034) -0.049 (0.034) -0.048 (0.040) -0.027 (0.034) -0.018 (0.037) 0.002 (0.037)
2% 3 -0.095 (0.043) -0.097 (0.043) -0.081 (0.039) -0.028 (0.034) -0.027 (0.037) 0.004 (0.036)
2% 4 -0.145 (0.054) -0.150 (0.054) -0.080 (0.043) -0.028 (0.034) -0.037 (0.038) 0.001 (0.035)
2% 5 -0.193 (0.064) -0.201 (0.064) -0.069 (0.047) -0.028 (0.034) -0.048 (0.040) 0.001 (0.035)
2% 6 -0.236 (0.074) -0.247 (0.073) -0.063 (0.049) -0.028 (0.034) -0.058 (0.044) 0.001 (0.035)
3% 1 -0.020 (0.031) -0.020 (0.031) -0.019 (0.040) -0.021 (0.031) -0.013 (0.037) -0.018 (0.034)
3% 2 -0.068 (0.033) -0.069 (0.033) -0.067 (0.037) -0.043 (0.035) -0.027 (0.037) 0.001 (0.038)
3% 3 -0.130 (0.041) -0.132 (0.042) -0.104 (0.034) -0.044 (0.036) -0.042 (0.038) 0.005 (0.036)
3% 4 -0.191 (0.051) -0.196 (0.051) -0.103 (0.041) -0.045 (0.036) -0.057 (0.040) 0.001 (0.035)
3% 5 -0.246 (0.060) -0.253 (0.059) -0.096 (0.046) -0.046 (0.036) -0.073 (0.044) 0.001 (0.035)
3% 6 -0.293 (0.068) -0.302 (0.066) -0.094 (0.050) -0.046 (0.036) -0.091 (0.051) 0.001 (0.035)
4% 1 -0.026 (0.030) -0.026 (0.030) -0.026 (0.039) -0.028 (0.030) -0.018 (0.036) -0.024 (0.033)
4% 2 -0.087 (0.032) -0.088 (0.032) -0.085 (0.035) -0.061 (0.037) -0.037 (0.036) -0.001 (0.039)
4% 3 -0.160 (0.041) -0.162 (0.041) -0.124 (0.032) -0.064 (0.039) -0.057 (0.038) 0.006 (0.037)
4% 4 -0.228 (0.050) -0.232 (0.049) -0.127 (0.041) -0.066 (0.040) -0.079 (0.043) 0.002 (0.035)
4% 5 -0.286 (0.059) -0.292 (0.058) -0.126 (0.049) -0.067 (0.041) -0.103 (0.052) 0.001 (0.035)
4% 6 -0.333 (0.069) -0.339 (0.067) -0.127 (0.054) -0.067 (0.041) -0.130 (0.063) 0.001 (0.035)
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 29
Table 2. Rate of rejection with correct sign of the indirect effect (realized power; the higher
the better) for simulation design 2 with outliers in the setting with mediation (𝑎 = 𝑏 = 0.4).
Table 3. Bias and standard deviation (in parenthesis) for simulation design 2 with outliers in
Table 4. Rejection rate (realized size of the tests; the closer to 𝛼 = 0.05 the better) for
simulation design 2 with outliers in the setting with no mediation (𝑎 = 0.4, 𝑏 = 0).
Table 5. Bias and standard deviation (in parenthesis) for simulation design 3 with skew-t
error distributions in the setting with mediation (𝑎 = 𝑏 = 0.4). The parameters 𝜆 and 𝜈
kurtosis
Excess
Box-Cox Winsorized Median
𝒏 𝝀 𝝊 OLS bootstrap OLS Sobel bootstrap bootstrap bootstrap ROBMED
100 −∞ ∞ -0.995 0.869 0.000 (0.041) 0.000 (0.041) 0.075 (0.080) -0.002 (0.040) 0.002 (0.047) 0.002 (0.041)
-2.174 ∞ -0.5 0.347 0.000 (0.042) 0.000 (0.042) 0.052 (0.073) -0.001 (0.043) 0.000 (0.047) 0.001 (0.045)
0 ∞ 0 0 -0.001 (0.050) -0.001 (0.050) 0.002 (0.065) -0.001 (0.051) 0.000 (0.057) 0.000 (0.055)
2.174 ∞ 0.5 0.347 -0.001 (0.042) -0.001 (0.042) 0.038 (0.070) -0.001 (0.042) 0.000 (0.046) 0.000 (0.044)
∞ ∞ 0.995 0.869 -0.001 (0.040) -0.001 (0.040) 0.054 (0.073) -0.002 (0.040) 0.002 (0.046) 0.001 (0.040)
−∞ 5 -2.55 20.109 0.000 (0.048) 0.000 (0.048) 0.084 (0.089) -0.001 (0.043) 0.004 (0.044) 0.002 (0.038)
-2.174 5 -1.869 14.138 -0.001 (0.050) -0.001 (0.050) 0.056 (0.081) -0.001 (0.046) 0.002 (0.046) 0.001 (0.044)
0 5 0 6 -0.001 (0.058) -0.001 (0.058) 0.002 (0.080) -0.001 (0.056) 0.000 (0.056) 0.000 (0.055)
2.174 5 1.869 14.138 -0.001 (0.050) -0.001 (0.050) 0.007 (0.091) -0.001 (0.045) 0.001 (0.045) 0.000 (0.043)
∞ 5 2.55 20.109 -0.001 (0.048) -0.001 (0.048) 0.017 (0.098) -0.002 (0.041) 0.003 (0.043) 0.001 (0.037)
−∞ 2 - - -0.005 (0.135) -0.005 (0.133) 0.131 (0.111) 0.001 (0.050) 0.005 (0.044) 0.001 (0.034)
-2.174 2 - - -0.005 (0.136) -0.005 (0.134) 0.099 (0.108) 0.000 (0.055) 0.003 (0.046) 0.001 (0.041)
0 2 - - -0.003 (0.156) -0.003 (0.155) 0.107 (0.139) -0.001 (0.067) 0.000 (0.055) 0.001 (0.056)
2.174 2 - - 0.001 (0.157) 0.001 (0.155) 0.056 (0.163) 0.000 (0.053) 0.003 (0.046) 0.001 (0.042)
∞ 2 - - 0.001 (0.154) 0.001 (0.153) -0.006 (0.144) 0.000 (0.049) 0.005 (0.044) 0.002 (0.035)
250 −∞ ∞ -0.995 0.869 0.000 (0.024) 0.000 (0.024) 0.081 (0.051) -0.002 (0.024) 0.000 (0.029) 0.000 (0.024)
-2.174 ∞ -0.5 0.347 0.000 (0.025) 0.000 (0.025) 0.056 (0.048) 0.000 (0.025) 0.000 (0.029) 0.000 (0.026)
0 ∞ 0 0 0.000 (0.030) 0.000 (0.030) 0.002 (0.043) 0.000 (0.030) 0.000 (0.036) 0.000 (0.033)
2.174 ∞ 0.5 0.347 0.000 (0.025) 0.000 (0.025) 0.033 (0.047) 0.000 (0.025) 0.000 (0.029) 0.000 (0.027)
∞ ∞ 0.995 0.869 0.000 (0.024) 0.000 (0.024) 0.046 (0.051) -0.002 (0.023) 0.001 (0.029) 0.000 (0.024)
−∞ 5 -2.55 20.109 0.001 (0.030) 0.001 (0.030) 0.098 (0.060) -0.002 (0.025) 0.002 (0.027) 0.001 (0.022)
-2.174 5 -1.869 14.138 0.001 (0.031) 0.001 (0.031) 0.068 (0.058) -0.001 (0.027) 0.001 (0.028) 0.000 (0.025)
0 5 0 6 0.001 (0.036) 0.001 (0.036) 0.013 (0.064) 0.000 (0.034) 0.000 (0.035) 0.000 (0.033)
2.174 5 1.869 14.138 0.000 (0.030) 0.000 (0.030) 0.001 (0.077) 0.000 (0.027) 0.001 (0.028) 0.000 (0.026)
∞ 5 2.55 20.109 0.000 (0.028) 0.000 (0.028) 0.003 (0.083) -0.001 (0.025) 0.002 (0.027) 0.001 (0.022)
−∞ 2 - - 0.014 (0.290) 0.014 (0.294) 0.170 (0.081) 0.000 (0.029) 0.003 (0.027) 0.001 (0.020)
-2.174 2 - - 0.013 (0.267) 0.014 (0.271) 0.134 (0.081) 0.000 (0.032) 0.002 (0.029) 0.001 (0.025)
0 2 - - 0.007 (0.135) 0.007 (0.135) 0.165 (0.109) 0.000 (0.041) 0.001 (0.034) 0.000 (0.034)
2.174 2 - - 0.001 (0.068) 0.001 (0.068) 0.110 (0.143) 0.001 (0.033) 0.002 (0.028) 0.001 (0.024)
∞ 2 - - 0.000 (0.066) 0.000 (0.066) 0.015 (0.119) 0.001 (0.029) 0.003 (0.027) 0.001 (0.020)
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 33
Table 6. Rate of rejection with correct sign of the indirect effect (realized power; the higher
the better) for simulation design 3 with skew-t error distributions in the setting with
mediation (𝑎 = 𝑏 = 0.4). The parameters 𝜆 and 𝜈 control the skewness and kurtosis of the
skew-t distribution.
Table 7. Bias and standard deviation (in parenthesis) for simulation design 3 with skew-t
error distributions in the setting with no mediation (𝑎 = 0.4, 𝑏 = 0). The parameters 𝜆 and 𝜈
kurtosis
Excess
Box-Cox Winsorized Median
𝒏 𝝀 𝝊 OLS bootstrap OLS Sobel bootstrap bootstrap bootstrap ROBMED
100 −∞ ∞ -0.995 0.869 0.001 (0.042) 0.001 (0.042) 0.004 (0.068) -0.001 (0.042) 0.003 (0.048) 0.002 (0.042)
-2.174 ∞ -0.5 0.347 0.000 (0.042) 0.000 (0.042) 0.002 (0.061) 0.000 (0.042) 0.001 (0.046) 0.001 (0.044)
0 ∞ 0 0 0.000 (0.042) 0.000 (0.042) 0.001 (0.044) 0.000 (0.043) 0.000 (0.047) 0.001 (0.046)
2.174 ∞ 0.5 0.347 0.000 (0.042) 0.000 (0.042) 0.001 (0.055) 0.000 (0.042) 0.001 (0.045) 0.000 (0.044)
∞ ∞ 0.995 0.869 0.000 (0.041) 0.000 (0.041) 0.002 (0.058) -0.002 (0.041) 0.002 (0.046) 0.001 (0.041)
−∞ 5 -2.55 20.109 0.001 (0.045) 0.001 (0.045) 0.004 (0.064) -0.001 (0.040) 0.005 (0.040) 0.003 (0.035)
-2.174 5 -1.869 14.138 0.000 (0.044) 0.001 (0.044) 0.002 (0.060) 0.000 (0.041) 0.002 (0.039) 0.002 (0.038)
0 5 0 6 0.000 (0.044) 0.000 (0.044) 0.001 (0.051) 0.000 (0.042) 0.000 (0.040) 0.001 (0.040)
2.174 5 1.869 14.138 0.000 (0.043) 0.000 (0.043) 0.001 (0.052) 0.000 (0.040) 0.002 (0.038) 0.001 (0.037)
∞ 5 2.55 20.109 0.000 (0.043) 0.000 (0.043) 0.001 (0.042) -0.001 (0.039) 0.004 (0.038) 0.001 (0.033)
−∞ 2 - - 0.000 (0.081) 0.000 (0.075) 0.004 (0.067) 0.002 (0.038) 0.007 (0.028) 0.002 (0.022)
-2.174 2 - - 0.000 (0.080) 0.000 (0.075) 0.003 (0.066) 0.001 (0.039) 0.005 (0.028) 0.002 (0.025)
0 2 - - 0.000 (0.097) 0.001 (0.095) 0.001 (0.079) 0.001 (0.040) 0.001 (0.028) 0.001 (0.029)
2.174 2 - - 0.001 (0.080) 0.001 (0.079) 0.002 (0.077) 0.001 (0.036) 0.004 (0.027) 0.001 (0.025)
∞ 2 - - 0.001 (0.073) 0.001 (0.072) 0.000 (0.039) 0.001 (0.035) 0.007 (0.027) 0.002 (0.022)
250 −∞ ∞ -0.995 0.869 0.000 (0.025) 0.000 (0.025) 0.002 (0.043) -0.002 (0.025) 0.000 (0.029) -0.001 (0.025)
-2.174 ∞ -0.5 0.347 0.000 (0.025) 0.000 (0.025) 0.001 (0.038) -0.001 (0.025) -0.001 (0.027) -0.001 (0.026)
0 ∞ 0 0 0.000 (0.025) 0.000 (0.025) 0.000 (0.026) 0.000 (0.025) -0.001 (0.028) -0.001 (0.026)
2.174 ∞ 0.5 0.347 0.000 (0.025) 0.000 (0.025) 0.001 (0.031) -0.001 (0.025) 0.000 (0.028) 0.000 (0.026)
∞ ∞ 0.995 0.869 0.000 (0.025) 0.000 (0.025) 0.002 (0.032) -0.002 (0.025) 0.000 (0.029) 0.000 (0.025)
−∞ 5 -2.55 20.109 0.001 (0.028) 0.001 (0.028) 0.003 (0.041) -0.002 (0.024) 0.001 (0.023) 0.000 (0.019)
-2.174 5 -1.869 14.138 0.001 (0.028) 0.001 (0.028) 0.001 (0.038) -0.001 (0.024) 0.000 (0.023) -0.001 (0.021)
0 5 0 6 0.000 (0.026) 0.001 (0.026) 0.000 (0.034) 0.000 (0.024) -0.001 (0.023) -0.001 (0.023)
2.174 5 1.869 14.138 0.000 (0.026) 0.000 (0.026) 0.001 (0.032) -0.001 (0.024) 0.000 (0.023) 0.000 (0.022)
∞ 5 2.55 20.109 0.000 (0.026) 0.000 (0.026) 0.002 (0.021) -0.002 (0.024) 0.002 (0.023) 0.001 (0.019)
−∞ 2 - - 0.013 (0.337) 0.013 (0.342) 0.002 (0.045) 0.000 (0.021) 0.003 (0.014) 0.001 (0.011)
-2.174 2 - - 0.012 (0.306) 0.012 (0.312) 0.001 (0.043) 0.000 (0.022) 0.002 (0.014) 0.000 (0.013)
0 2 - - 0.005 (0.130) 0.005 (0.130) 0.001 (0.053) 0.000 (0.022) 0.000 (0.014) -0.001 (0.015)
2.174 2 - - 0.000 (0.040) 0.000 (0.038) 0.002 (0.054) 0.001 (0.022) 0.002 (0.014) 0.001 (0.012)
∞ 2 - - -0.001 (0.039) -0.001 (0.038) 0.002 (0.025) 0.001 (0.021) 0.003 (0.014) 0.001 (0.010)
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 35
Table 8. Rejection rate (realized size of the tests; the closer to 𝛼 = 0.05 the better) for
simulation design 3 with skew-t error distributions in the setting with no mediation (𝑎 = 0.4,
𝑏 = 0). The parameters 𝜆 and 𝜈 control the skewness and kurtosis of the skew-t distribution.
Table 9. Bias and standard deviation (in parenthesis) for simulation design 3 with non-normal
Table 10. Rate of rejection with correct sign of the indirect effect (realized power; the higher
the better) for simulation design 3 with non-normal errors via Fleishman’s method, in the
Table 11. Bias and standard deviation (in parenthesis) for simulation design 3 with non-
normal errors via Fleishman’s method, in the setting with no mediation (𝑎 = 0.4, 𝑏 = 0).
Table 12. Rejection rate (realized size of the tests; the closer to 𝛼 = 0.05 the better) for
simulation design 3 with non-normal errors via Fleishman’s method, in the setting with no
Model PROCESS
No Journal Year Reference OLS Outliers assumptions Bootstrap macro
Clarke, J. S., Cornelissen, J. P., & Healey, M. P. (2019). Actions speak louder than words: How figurative language and gesturing in
1 AMJ 2019 entrepreneurial pitches influences investment judgments. Academy of Management Journal, 62(2), 335-360. Yes No No Yes Yes
Lin, S. H., Scott, B. A., & Matta, F. K. (2019). The dark side of transformational leader behaviors for leaders themselves: A
2 AMJ 2019 conservation of resources perspective. Academy of Management Journal, 62(5), 1556-1582. No No No Yes No
Mitchell, M. S., Greenbaum, R. L., Vogel, R. M., Mawritz, M. B., & Keating, D. J. (2019). Can you handle the pressure? The effect
3 AMJ 2019 of performance pressure on stress appraisals, self-regulation, and behavior. Academy of Management Journal, 62(2), 531-552. No No No Yes No
Shin, J., & Grant, A. M. (2019). Bored by Interest: How Intrinsic Motivation in One Task Can Reduce Performance on Other Tasks.
4 AMJ 2019 Academy of Management Journal, 62(2), 415-436. Yes No No Yes No
Lu, S., Bartol, K. M., Venkataramani, V., Zheng, X., & Liu, X. (2019). Pitching novel ideas to the boss: The interactive effects of
employees’ idea enactment and influence tactics on creativity assessment and implementation. Academy of Management Journal,
5 AMJ 2019 62(2), 579-606. Yes No No Yes No
Shea, C. T., & Hawn, O. V. (2019). Microfoundations of corporate social responsibility and irresponsibility. Academy of
6 AMJ 2019 Management Journal, 62(5), 1609-1642. Yes No No Yes Yes
Porck, J. P., Matta, F. K., Hollenbeck, J. R., Oh, J. K., Lanaj, K., & Lee, S. M. (2019). Social identification in multiteam systems:
7 AMJ 2019 The role of depletion and task complexity. Academy of Management Journal, 62(4), 1137-1162. No No No Yes No
Sherf, E. N., Venkataramani, V., & Gajendran, R. S. (2019). Too busy to be fair? The effect of workload and rewards on managers’
8 AMJ 2019 justice rule adherence. Academy of Management Journal, 62(2), 469-502. No No No Yes No
Brands, R. A., & Mehra, A. (2019). Gender, brokerage, and performance: a construal approach. Academy of Management Journal,
9 AMJ 2019 62(1), 196-219. Yes No No Yes Yes
Venus, M., Stam, D., & Van Knippenberg, D. (2019). Visions of change as visions of continuity. Academy of Management Journal,
10 AMJ 2019 62(3), 667-690. Yes No No Yes Yes
Antino, M., Rico, R., & Thatcher, S. M. (2019). Structuring Reality Through the Faultlines Lens: The Effects of Structure, Fairness,
11 AMJ 2019 and Status Conflict on the Activated Faultlines–Performance Relationship. Academy of Management Journal, 62(5), 1444-1470. Yes No No Yes Yes
Kim, Y. J., & Toh, S. M. (2019). Stuck in the past? The influence of a leader’s past cultural experience on group culture and positive
12 AMJ 2019 and negative group deviance. Academy of Management Journal, 62(3), 944-969. Yes No No Yes Yes
Krause, R., Wu, Z., Bruton, G. D., & Carter, S. M. (2019). The coercive isomorphism ripple effect: An investigation of nonprofit
13 AMJ 2019 interlocks on corporate boards. Academy of Management Journal, 62(1), 283-308. Yes No No Yes No
Livne-Ofer, E., Coyle-Shapiro, J. A., & Pearce, J. L. (2019). Eyes Wide Open: Perceived Exploitation and Its Consequences.
14 AMJ 2019 Academy of Management Journal, 62(6), 1989-2018. No No No Yes No
Han, J. H., Kang, S., Oh, I. S., Kehoe, R. R., & Lepak, D. P. (2019). The Goldilocks Effect of Strategic Human Resource
Management? Optimizing the Benefits of a High-Performance Work System Through the Dual Alignment of Vertical and Horizontal
15 AMJ 2019 Fit. Academy of Management Journal, 62(5), 1388-1412. Yes No No Yes No
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 41
Model PROCESS
No Journal Year Reference OLS Outliers assumptions Bootstrap macro
Ehrhardt, K., & Ragins, B. R. (2019). Relational attachment at work: A complementary fit perspective on the role of relationships in
16 AMJ 2019 organizational life. Academy of Management Journal, 62(1), 248-282. Yes Yes No Yes No
Hussain, I., Shu, R., Tangirala, S., & Ekkirala, S. (2019). The voice bystander effect: How information redundancy inhibits employee
17 AMJ 2019 voice. Academy of Management Journal, 62(3), 828-849. No No No Yes No
Huang, T. Y., Souitaris, V., & Barsade, S. G. (2019). Which matters more? Group fear versus hope in entrepreneurial escalation of
18 SMJ 2019 commitment. Strategic Management Journal, 40(11), 1852-1881. Yes No No Yes Yes
Garg, S., John Li, Q., & Shaw, J. D. (2019). Entrepreneurial firms grow up: Board undervaluation, board evolution, and firm
19 SMJ 2019 performance in newly public firms. Strategic Management Journal, 40(11), 1882-1907. No No No No No
Petrenko, O. V., Aime, F., Recendes, T., & Chandler, J. A. (2019). The case for humble expectations: CEO humility and market
20 SMJ 2019 performance. Strategic Management Journal, 40(12), 1938-1964. Yes No No No No
Wang, L., Wu, B., Pechmann, C., & Wang, Y. (2019). The performance effects of creative imitation on original products: Evidence
21 SMJ 2019 from lab and field experiments. Strategic Management Journal. Yes No No Yes No
Li, J., Li, P., & Wang, B. (2019). The liability of opaqueness: State ownership and the likelihood of deal completion in international
22 SMJ 2019 acquisitions by Chinese firms. Strategic Management Journal, 40(2), 303-327. No No No Yes No
Westphal, J. D., & Zhu, D. H. (2019). Under the radar: How firms manage competitive uncertainty by appointing friends of other
23 SMJ 2019 chief executive officers to their boards. Strategic Management Journal, 40(1), 79-107. No No No Yes No
Hill, A. D., Recendes, T., & Ridge, J. W. (2019). Second‐order effects of CEO characteristics: How rivals' perceptions of CEOs as
24 SMJ 2019 submissive and provocative precipitate competitive attacks. Strategic Management Journal, 40(5), 809-835. No No No No No
Ng, T. W., & Yam, K. C. (2019). When and why does employee creativity fuel deviance? Key psychological mechanisms. Journal of
25 JAP 2019 Applied Psychology, 104(9), 1144. No No No No No
Moore, C., Mayer, D. M., Chiang, F. F., Crossley, C., Karlesky, M. J., & Birtch, T. A. (2019). Leaders matter morally: The role of
26 JAP 2019 ethical leadership in shaping employee moral cognition and misconduct. Journal of Applied Psychology, 104(1), 123. Yes No No Yes Yes
Evans, J. B., Slaughter, J. E., Ellis, A. P., & Rivin, J. M. (2019). Gender and the evaluation of humor at work. Journal of Applied
27 JAP 2019 Psychology. Yes No No Yes No
Lievens, F., Sackett, P. R., Dahlke, J. A., Oostrom, J. K., & De Soete, B. (2019). Constructed response formats and their effects on
28 JAP 2019 minority–majority differences and validity. Journal of Applied Psychology, 104(5), 715. Yes No No Yes Yes
Wang, L., Law, K. S., Zhang, M. J., Li, Y. N., & Liang, Y. (2019). It’s mine! Psychological ownership of one’s job explains positive
29 JAP 2019 and negative workplace outcomes of job engagement. Journal of Applied Psychology, 104(2), 229. No No No Yes No
Bindl, U. K., Unsworth, K. L., Gibson, C. B., & Stride, C. B. (2019). Job crafting revisited: Implications of an extended framework
30 JAP 2019 for active changes at work. Journal of Applied Psychology, 104(5), 605. Yes No No Yes No
Rosen, C. C., Simon, L. S., Gajendran, R. S., Johnson, R. E., Lee, H. W., & Lin, S. H. J. (2019). Boxed in by your inbox:
31 JAP 2019 Implications of daily e-mail demands for managers’ leadership behaviors. Journal of Applied Psychology, 104(1), 19. No No No No No
Owens, B. P., Yam, K. C., Bednar, J. S., Mao, J., & Hart, D. W. (2019). The impact of leader moral humility on follower moral self-
32 JAP 2019 efficacy and behavior. Journal of Applied Psychology, 104(1), 146. Yes No No Yes Yes
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 42
Model PROCESS
No Journal Year Reference OLS Outliers assumptions Bootstrap macro
Diefendorff, J. M., Gabriel, A. S., Nolan, M. T., & Yang, J. (2019). Emotion regulation in the context of customer mistreatment and
33 JAP 2019 felt affect: An event-based profile approach. Journal of Applied Psychology, 104(7), 965. No No No Yes No
Huang, Y. S. S., Greenbaum, R. L., Bonner, J. M., & Wang, C. S. (2019). Why sabotage customers who mistreat you? Activated
34 JAP 2019 hostility and subsequent devaluation of targets as a moral disengagement mechanism. Journal of Applied Psychology, 104(4), 495. No No No Yes No
Lee, H. W., Bradburn, J., Johnson, R. E., Lin, S. H. J., & Chang, C. H. D. (2019). The benefits of receiving gratitude for helpers: A
35 JAP 2019 daily investigation of proactive and reactive helping at work. Journal of Applied Psychology, 104(2), 197. No Yes No No No
Porter, C. M., Woo, S. E., Allen, D. G., & Keith, M. G. (2019). How do instrumental and expressive network positions relate to
36 JAP 2019 turnover? A meta-analytic investigation. Journal of Applied Psychology, 104(4), 511. No No No No No
Hernandez, M., Avery, D. R., Volpone, S. D., & Kaiser, C. R. (2019). Bargaining while Black: The role of race in salary negotiations.
37 JAP 2019 Journal of Applied Psychology, 104(4), 581. Yes No No No No
Koopmann, J., Johnson, R. E., Wang, M., Lanaj, K., Wang, G., & Shi, J. (2019). A self-regulation perspective on how and when
38 JAP 2019 regulatory focus differentially relates to citizenship behaviors. Journal of Applied Psychology, 104(5), 629. No No No Yes No
Taylor, S. G., Griffith, M. D., Vadera, A. K., Folger, R., & Letwin, C. R. (2019). Breaking the cycle of abusive supervision: How
39 JAP 2019 disidentification and moral identity help the trickle-down change course. Journal of Applied Psychology, 104(1), 164. No Yes No Yes No
Liao, Z., Liu, W., Li, X., & Song, Z. (2019). Give and take: An episodic perspective on leader-member exchange. Journal of Applied
40 JAP 2019 Psychology, 104(1), 34. No No No Yes No
Koopman, J., Scott, B. A., Matta, F. K., Conlon, D. E., & Dennerlein, T. (2019). Ethical leadership as a substitute for justice
41 JAP 2019 enactment: An information-processing perspective. Journal of Applied Psychology, 104(9), 1103. No No No Yes No
Lennard, A. C., Scott, B. A., & Johnson, R. E. (2019). Turning frowns (and smiles) upside down: A multilevel examination of surface
42 JAP 2019 acting positive and negative emotions on well-being. Journal of Applied Psychology, 104(9), 1164. No No No Yes No
Chen, G., Smith, T. A., Kirkman, B. L., Zhang, P., Lemoine, G. J., & Farh, J. L. (2019). Multiple team membership and
43 JAP 2019 empowerment spillover effects: Can empowerment processes cross team boundaries?. Journal of Applied Psychology, 104(3), 321. No Yes No Yes No
Carlson, D. S., Thompson, M. J., & Kacmar, K. M. (2019). Double crossed: The spillover and crossover effects of work demands on
44 JAP 2019 work outcomes through the family. Journal of Applied Psychology, 104(2), 214. No Yes Yes No No
Mayer, D. M., Ong, M., Sonenshein, S., & Ashford, S. J. (2019). The money or the morals? When moral language is more effective
45 JAP 2019 for selling social issues. Journal of Applied Psychology, 104(8), 1058. Yes Yes No Yes Yes
Gündemir, S., Carton, A. M., & Homan, A. C. (2019). The impact of organizational performance on the emergence of Asian
46 JAP 2019 American leaders. Journal of Applied Psychology, 104(1), 107. Yes No No Yes Yes
Sitzmann, T., Ployhart, R. E., & Kim, Y. (2019). A process model linking occupational strength to attitudes and behaviors: The
47 JAP 2019 explanatory role of occupational personality heterogeneity. Journal of Applied Psychology, 104(2), 247. No No No Yes No
Liu, Z., Riggio, R. E., Day, D. V., Zheng, C., Dai, S., & Bian, Y. (2019). Leader development begins at home: Overparenting harms
48 JAP 2019 adolescent leader emergence. Journal of Applied Psychology, 104(10), 1226. No Yes No Yes No
Zhou, Y., Zou, M., Woods, S. A., & Wu, C. H. (2019). The restorative effect of work after unemployment: An intraindividual
49 JAP 2019 analysis of subjective well-being recovery through reemployment. Journal of Applied Psychology, 104(9), 1195. Yes No No Yes No
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 43
Model PROCESS
No Journal Year Reference OLS Outliers assumptions Bootstrap macro
Booth-LeDoux, S. M., Matthews, R. A., & Wayne, J. H. (2019). Testing a resource-based spillover-crossover-spillover model:
50 JAP 2019 Transmission of social support in dual-earner couples. Journal of Applied Psychology. No No No Yes No
Lanaj, K., Foulk, T. A., & Erez, A. (2019). Energizing leaders via self-reflection: A within-person field experiment. Journal of
51 JAP 2019 Applied Psychology, 104(1), 1. No No No Yes No
Hulshof, I. L., Demerouti, E., & Le Blanc, P. M. (2019). Reemployment crafting: Proactively shaping one’s job search. Journal of
52 JAP 2019 Applied Psychology. No No No Yes No
Sessions, H., Nahrgang, J. D., Newton, D. W., & Chamberlin, M. (2019). I’m tired of listening: The effects of supervisor appraisals
53 JAP 2019 of group voice on supervisor emotional exhaustion and performance. Journal of Applied Psychology. Yes No No Yes Yes
Lin, K. J., Savani, K., & Ilies, R. (2019). Doing good, feeling good? The roles of helping motivation and citizenship pressure. Journal
54 JAP 2019 of Applied Psychology. Yes No No Yes Yes
Qin, X., Chen, C., Yam, K. C., Huang, M., & Ju, D. (2019). The double-edged sword of leader humility: Investigating when and why
55 JAP 2019 leader humility promotes versus inhibits subordinate deviance. Journal of Applied Psychology. Yes No No Yes No
Priesemuth, M., & Bigelow, B. (2019). It hurts me too!(or not?): Exploring the negative implications for abusive bosses. Journal of
56 JAP 2019 Applied Psychology. Yes No No Yes No
Cowen, A. P., & Montgomery, N. V. (2019). To be or not to be sorry? How CEO gender impacts the effectiveness of organizational
57 JAP 2019 apologies. Journal of Applied Psychology. Yes No No Yes Yes
McCarthy, J. E., & Levin, D. Z. (2019). Network residues: The enduring impact of intra-organizational dormant ties. Journal of
58 JAP 2019 Applied Psychology. Yes Yes No Yes No
Hall, E. V., Avery, D. R., McKay, P. F., Blot, J. F., & Edwards, M. (2019). Composition and compensation: The moderating effect of
individual and team performance on the relationship between Black team member representation and salary. Journal of Applied
59 JAP 2019 Psychology, 104(3), 448. No No No Yes No
Burmeister, A., Wang, M., & Hirschi, A. (2019). Understanding the motivational benefits of knowledge transfer for older and
60 JAP 2019 younger workers in age-diverse coworker dyads: An actor–partner interdependence model. Journal of applied psychology. No No No Yes No
Maltarich, M. A., Reilly, G., & DeRose, C. (2019). A theoretical assessment of dismissal rates and unit performance, with empirical
61 JAP 2019 evidence. Journal of Applied Psychology. No No No Yes No
Rapp, T. L., & Mathieu, J. E. (2019). Team and individual influences on members’ identification and performance per membership in
62 JAP 2019 multiple team membership arrangements. Journal of Applied Psychology, 104(3), 303. No No No Yes No
Parker, S. K., Andrei, D. M., & Van den Broeck, A. (2019). Poor work design begets poor work design: Capacity and willingness
63 JAP 2019 antecedents of individual work design behavior. Journal of Applied Psychology. Yes Yes No Yes Yes
Mohr, J. J., Markell, H. M., King, E. B., Jones, K. P., Peddie, C. I., & Kendra, M. S. (2019). Affective antecedents and consequences
64 JAP 2019 of revealing and concealing a lesbian, gay, or bisexual identity. Journal of Applied Psychology. No No Yes Yes No
Yu, K. Y. T. (2019). Influencing how one is seen by potential talent: Organizational impression management among recruiting firms.
65 JAP 2019 Journal of Applied Psychology. Yes No No Yes Yes
Hu, J., Zhang, Z., Jiang, K., & Chen, W. (2019). Getting ahead, getting along, and getting prosocial: Examining extraversion facets,
66 JAP 2019 peer reactions, and leadership emergence. Journal of Applied Psychology. No No No Yes No
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 44
Model PROCESS
No Journal Year Reference OLS Outliers assumptions Bootstrap macro
Ouyang, K., Cheng, B. H., Lam, W., & Parker, S. K. (2019). Enjoy your evening, be proactive tomorrow: How off-job experiences
67 JAP 2019 shape daily proactivity. Journal of Applied Psychology, 104(8), 1003. No No No Yes No
Zhang, Y., Zhang, Y., Ng, T. W., & Lam, S. S. (2019). Promotion-and prevention-focused coping: A meta-analytic examination of
68 JAP 2019 regulatory strategies in the work stress process. Journal of Applied Psychology, 104(10), 1296. No No No Yes No
Jacob, G. H., Frese, M., Krauss, S. I., & Friedrich, C. (2019). On the importance of a motivational agency variable: Being a formal
business in developing countries is only helpful for growth if business owners show a high degree of personal initiative. Journal of
69 JAP 2019 Applied Psychology, 104(9), 1181. Yes Yes No Yes Yes
Foulk, T. A., Lanaj, K., & Krishnan, S. (2019). The virtuous cycle of daily motivation: Effects of daily strivings on work behaviors,
70 JAP 2019 need satisfaction, and next-day strivings. Journal of Applied Psychology, 104(6), 755. No No No Yes No
Beck, J. W., Schmidt, A. M., & Natali, M. W. (2019). Efficient proximal resource allocation strategies predict distal team
71 JAP 2019 performance: Evidence from the National Hockey League. Journal of Applied Psychology. No No No Yes No
Org. Reiche, B. S., & Neeley, T. B. (2019). Head, Heart, or Hands: How Do Employees Respond to a Radical Global Language Change
72 Sci. 2019 over Time?. Organization Science, 30(6), 1252-1269. No No No Yes No
Org. Chatman, J. A., Greer, L. L., Sherman, E., & Doerr, B. (2019). Blurred lines: How the collectivism norm operates through perceived
73 Sci. 2019 group diversity to boost or harm group performance in Himalayan mountain climbing. Organization Science, 30(2), 235-259. Yes No No Yes Yes
Org. DesJardine, M., & Bansal, P. (2019). One step forward, two steps back: How negative external evaluations can shorten organizational
74 Sci. 2019 time horizons. Organization Science, 30(4), 761-780. No No No No No
Org. Radoynovska, N., & King, B. G. (2019). To Whom Are You True? Audience Perceptions of Authenticity in Nascent Crowdfunding
75 Sci. 2019 Ventures. Organization Science, 30(4), 781-802. Yes No No Yes Yes
Org. Baker, B., Derfler-Rozin, R., Pitesa, M., & Johnson, M. (2019). Stock Market Responses to Unethical Behavior in Organizations: An
76 Sci. 2019 Organizational Context Model. Organization Science, 30(2), 319-336. No No No Yes No
Org. Carson Marr, J., Pettit, N., & Thau, S. (2019). After the Fall: How Perceived Self-Control Protects the Legitimacy of Higher-Ranking
77 Sci. 2019 Employees after Status Loss. Organization Science, 30(6), 1165-1188. Yes No No Yes Yes
Org. Sherf, E. N., Tangirala, S., & Venkataramani, V. (2019). Why Managers Do Not Seek Voice from Employees: The Importance of
78 Sci. 2019 Managers’ Personal Control and Long-Term Orientation. Organization Science, 30(3), 447-466. Yes Yes No Yes Yes
Org. Wilhelm, H., Richter, A. W., & Semrau, T. (2019). Employee Learning from Failure: A Team-as-Resource Perspective. Organization
79 Sci. 2019 Science, 30(4), 694-714. No No No Yes No
Org. Lee, S. (2019). Learning-by-moving: can reconfiguring spatial proximity between organizational members promote individual-level
80 Sci. 2019 exploration?. Organization Science, 30(3), 467-488. Yes No No No No
Org. Moore, C. B., Payne, G. T., Filatotchev, I., & Zajac, E. J. (2019). The Cost of Status: When Social and Economic Interests Collide.
81 Sci. 2019 Organization Science, 30(5), 869-884. No No No No No
Org. Anderson, T., & Bidwell, M. (2019). Outside insiders: Understanding the role of contracting in the careers of managerial workers.
82 Sci. 2019 Organization Science, 30(5), 1000-1029. Yes No No Yes No
Galperin, R. V., Hahl, O., Sterling, A. D., & Guo, J. (2019). Too good to hire? Capability and inferences about commitment in labor
83 ASQ 2019 markets. Administrative Science Quarterly, 0001839219840022. Yes No No Yes Yes
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 45
Model PROCESS
No Journal Year Reference OLS Outliers assumptions Bootstrap macro
DeCelles, K. A., Sonenshein, S., & King, B. G. (2019). Examining Anger’s Immobilizing Effect on Institutional Insiders’ Action
84 ASQ 2019 Intentions in Social Movements. Administrative Science Quarterly, 0001839219879646. Yes No No Yes Yes
Uribe, J., Sytch, M., & Kim, Y. H. (2019). When Friends Become Foes: Collaboration as a Catalyst for Conflict. Administrative
85 ASQ 2019 Science Quarterly, 0001839219877507. Yes No No No No
Inoue, C. (2019). Election Cycles and Organizations: How Politics Shapes the Performance of State-owned Enterprises over Time.
86 ASQ 2019 Administrative Science Quarterly, 0001839219869913. Yes No Yes No No
Running head: ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 46
Figures
0.0
−0.1
−0.2
Bias
−0.3
−0.4
−0.5
1.00
0.50
0.25
0.00
0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8
Effect size of a and b
Figure 1. Results from 1000 simulation runs for simulation design 1 with varying effect sizes
in the setting with mediation (𝑎 = 𝑏 = 𝑐 = 0.2, … , 0.8), sample size 𝑛 = 50. The top row
shows the average bias of the indirect effect and includes a horizontal reference line at 0 for
no bias. The bottom row displays the rate of how often the methods reject the null hypothesis
and the corresponding estimate of ab has the correct sign (a measure of realized power of the
tests in the presence of outliers; the higher this rate the better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 47
0.0
−0.2
Bias
−0.4
−0.6
1.00
0.50
0.25
0.00
0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8
Effect size of a and b
Figure 2. Results from 1000 simulation runs for simulation design 1 with varying effect sizes
in the setting with mediation (𝑎 = 𝑏 = 𝑐 = 0.2, … , 0.8), and sample size 𝑛 = 100. The top
row shows the average bias of the indirect effect and includes a horizontal reference line at 0
for no bias. The bottom row displays the rate of how often the methods reject the null
hypothesis and the corresponding estimate of ab has the correct sign (a measure of realized
power of the tests in the presence of outliers; the higher this rate the better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 48
0.0
−0.2
Bias
−0.4
−0.6
1.00
0.50
0.25
0.00
0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8
Effect size of a and b
Figure 3. Results from 1000 simulation runs for simulation design 1 with varying effect sizes
in the setting with mediation (𝑎 = 𝑏 = 𝑐 = 0.2, … , 0.8), and sample size 𝑛 = 250. The top
row shows the average bias of the indirect effect and includes a horizontal reference line at 0
for no bias. The bottom row displays the rate of how often the methods reject the null
hypothesis and the corresponding estimate of ab has the correct sign (a measure of realized
power of the tests in the presence of outliers; the higher this rate the better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 49
0.00
Bias
−0.25
−0.50
1.00
0.50
0.25
0.00
0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8
Effect size of a and b
Figure 4. Results from 1000 simulation runs for simulation design 1 with varying effect sizes
in the setting with mediation (𝑎 = 𝑏 = 𝑐 = 0.2, … , 0.8), and sample size 𝑛 = 500. The top
row shows the average bias of the indirect effect and includes a horizontal reference line at 0
for no bias. The bottom row displays the rate of how often the methods reject the null
hypothesis and the corresponding estimate of ab has the correct sign (a measure of realized
power of the tests in the presence of outliers; the higher this rate the better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 50
0.00
Bias
−0.25
−0.50
1.00
0.50
0.25
0.00
0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8
Effect size of a and b
Figure 5. Results from 1000 simulation runs for simulation design 1 with varying effect sizes
in the setting with mediation (𝑎 = 𝑏 = 𝑐 = 0.2, … , 0.8), and sample size 𝑛 = 1000. The top
row shows the average bias of the indirect effect and includes a horizontal reference line at 0
for no bias. The bottom row displays the rate of how often the methods reject the null
hypothesis and the corresponding estimate of ab has the correct sign (a measure of realized
power of the tests in the presence of outliers; the higher this rate the better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 51
0.0
Relative bias
−0.1
−0.2
−0.3
1.00
0.75
Rejection rate
0.50
0.25
0.00
0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8
Effect size of a (b = 0)
Figure 6. Results from 1000 simulation runs for simulation design 1 with varying effect sizes
in the setting with no mediation (𝑎 = 𝑐 = 0.2, … , 0.8, 𝑏 = 0), and sample size 𝑛 = 50. The
top row shows the average relative bias of the indirect effect with respect to the effect size of
𝑎, and includes a horizontal reference line at 0 for no bias. The bottom row displays the
rejection rate of the corresponding tests (i.e., the realized size), and a horizontal line is drawn
for the nominal size 𝛼 = 0.05 (the closer to this line the better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 52
0.0
−0.1
Relative bias
−0.2
−0.3
1.00
0.75
Rejection rate
0.50
0.25
0.00
0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8
Effect size of a (b = 0)
Figure 7. Results from 1000 simulation runs for simulation design 1 with varying effect sizes
in the setting with no mediation (𝑎 = 𝑐 = 0.2, … , 0.8, 𝑏 = 0), and sample size 𝑛 = 100. The
top row shows the average relative bias of the indirect effect with respect to the effect size of
𝑎, and includes a horizontal reference line at 0 for no bias. The bottom row displays the
rejection rate of the corresponding tests (i.e., the realized size), and a horizontal line is drawn
for the nominal size 𝛼 = 0.05 (the closer to this line the better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 53
−0.1
Relative bias
−0.2
−0.3
1.00
0.75
Rejection rate
0.50
0.25
0.00
0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8
Effect size of a (b = 0)
Figure 8. Results from 1000 simulation runs for simulation design 1 with varying effect sizes
in the setting with no mediation (𝑎 = 𝑐 = 0.2, … , 0.8, 𝑏 = 0), and sample size 𝑛 = 250. The
top row shows the average relative bias of the indirect effect with respect to the effect size of
𝑎, and includes a horizontal reference line at 0 for no bias. The bottom row displays the
rejection rate of the corresponding tests (i.e., the realized size), and a horizontal line is drawn
for the nominal size 𝛼 = 0.05 (the closer to this line the better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 54
−0.1
Relative bias
−0.2
−0.3
−0.4
1.00
0.75
Rejection rate
0.50
0.25
0.00
0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8
Effect size of a (b = 0)
Figure 9. Results from 1000 simulation runs for simulation design 1 with varying effect sizes
in the setting with no mediation (𝑎 = 𝑐 = 0.2, … , 0.8, 𝑏 = 0), and sample size 𝑛 = 500. The
top row shows the average relative bias of the indirect effect with respect to the effect size of
𝑎, and includes a horizontal reference line at 0 for no bias. The bottom row displays the
rejection rate of the corresponding tests (i.e., the realized size), and a horizontal line is drawn
for the nominal size 𝛼 = 0.05 (the closer to this line the better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 55
−0.1
Relative bias
−0.2
−0.3
−0.4
1.00
0.75
Rejection rate
0.50
0.25
0.00
0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8
Effect size of a (b = 0)
Figure 10. Results from 1000 simulation runs for simulation design 1 with varying effect
sizes in the setting with no mediation (𝑎 = 𝑐 = 0.2, … , 0.8, 𝑏 = 0), and sample size 𝑛 =
1000. The top row shows the average relative bias of the indirect effect with respect to the
effect size of 𝑎, and includes a horizontal reference line at 0 for no bias. The bottom row
displays the rejection rate of the corresponding tests (i.e., the realized size), and a horizontal
line is drawn for the nominal size 𝛼 = 0.05 (the closer to this line the better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 56
−0.2
Bias
−0.4
−0.6
1.00
0.50
0.25
0.00
50
100
250
500
1000
50
100
250
500
1000
50
100
250
500
1000
50
100
250
500
1000
50
100
250
500
1000
50
100
250
500
1000
50
100
250
500
1000
Sample size
Figure 11. Results from 1000 simulation runs for simulation design 1 for the setting with
outliers, varying sample size 𝑛 = 50, 100, 250, 500, 1000, and varying effect sizes in the
setting with mediation (𝑎 = 𝑏 = 𝑐 = 0.2, … , 0.8). The top row shows the average bias of the
indirect effect and includes a horizontal reference line at 0 for no bias. The bottom row
displays the rate of how often the methods reject the null hypothesis and the corresponding
estimate of ab has the correct sign (a measure of realized power of the tests in the presence of
−0.1
Relative bias
−0.2
−0.3
−0.4
1.00
0.75
Rejection rate
0.50
0.25
0.00
50
100
250
500
1000
50
100
250
500
1000
50
100
250
500
1000
50
100
250
500
1000
50
100
250
500
1000
50
100
250
500
1000
50
100
250
500
1000
Sample size
Figure 12. Results from 1000 simulation runs for simulation design 1 for the setting with
outliers, varying sample size 𝑛 = 50, 100, 250, 500, 1000, and varying effect sizes in the
setting with no mediation (𝑎 = 𝑐 = 0.2, … , 0.8, 𝑏 = 0). The top row shows the average
relative bias of the indirect effect with respect to the effect size of 𝑎, and includes a
horizontal reference line at 0 for no bias. The bottom row displays the rejection rate of the
corresponding tests (i.e., the realized size), and a horizontal line is drawn for the nominal size
n = 100 n = 250
Indirect effect ab
0.1
0.0
0.75
0.50
0.25
0.00
0 2 4 6 0 2 4 6
Shift of outliers
Figure 13. Results from 1000 simulation runs for simulation design 2 with varying distance
of outliers (outlier probability 2%) and the setting with mediation (𝑎 = 0.4, 𝑏 = 0.4). The top
row shows the average estimates of the indirect effect and includes a horizontal reference line
for the true indirect effect 𝑎𝑏 = 0.16. The bottom row displays the rate of how often the
methods reject the null hypothesis and the corresponding estimate of ab has the correct sign
(a measure of realized power of the tests in the presence of outliers; the higher this rate the
better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 59
n = 100 n = 250
0.00
Indirect effect ab
−0.05
−0.10
−0.15
1.00
0.75
Rejection rate
0.50
0.25
0.00
0 2 4 6 0 2 4 6
Shift of outliers
Figure 14. Results from 1000 simulation runs for simulation design 2 with varying distance
of outliers (outlier probability 2%) and the setting with no mediation (𝑎 = 0.4, 𝑏 = 0). The
top row shows the average estimates of the indirect effect and includes a horizontal reference
line for the true indirect effect 𝑎𝑏 = 0. The bottom row displays the rejection rate of the
corresponding tests (i.e., the realized size), and a horizontal line is drawn for the nominal size
n = 100 n = 250
0.3
Indirect effect ab
0.2
0.1
0.0
0.75
0.50
0.25
0.00
OLS bootstrap
OLS Sobel
Box−Cox bootstrap
Winsorized bootstap
Median bootstrap
ROBMED
OLS bootstrap
OLS Sobel
Box−Cox bootstrap
Winsorized bootstap
Median bootstrap
ROBMED
Figure 15. Results from 1000 simulation runs for simulation design 2 with centered log-
normal distributions and the setting with mediation (𝑎 = 𝑏 = 𝑐 = 0.4). The top row shows
the average estimates of the indirect effect and includes a horizontal reference line for the
true indirect effect 𝑎𝑏 = 0.16. The bottom row displays the rate of how often the methods
reject the null hypothesis and the corresponding estimate of ab has the correct sign (a
measure of realized power of the tests in the presence of outliers; the higher this rate the
better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 61
n = 100 n = 250
0.05
Indirect effect ab
0.00
−0.05
−0.10
1.00
0.75
Rejection rate
0.50
0.25
0.00
OLS bootstrap
OLS Sobel
Box−Cox bootstrap
Winsorized bootstap
Median bootstrap
ROBMED
OLS bootstrap
OLS Sobel
Box−Cox bootstrap
Winsorized bootstap
Median bootstrap
ROBMED
Figure 16. Results from 1000 simulation runs for simulation design 2 with centered log-
normal distributions and the setting with mediation (𝑎 = 𝑐 = 0.4, 𝑏 = 0). The top row shows
the average estimates of the indirect effect and includes a horizontal reference line for the
true indirect effect 𝑎𝑏 = 0. The bottom row displays the rejection rate of the corresponding
tests (i.e., the realized size), and a horizontal line is drawn for the nominal size 𝛼 = 0.05 (the
n = 100 n = 250
Indirect effect ab
0.00
−0.05
−0.10
0.6
0.4
0.2
0.0
0.0% 1.0% 2.0% 3.0% 4.0% 0.0% 1.0% 2.0% 3.0% 4.0%
Percentage of outliers
Figure 17. Results from 1000 simulation runs for simulation design 4 with varying
percentage of outliers and the setting with mediation (𝑎 = 0.2, 𝑏 = 0.2). The top row shows
the average estimates of the indirect effect and includes a horizontal reference line for the
true indirect effect 𝑎𝑏 = 0.04. The bottom row displays the rate of how often the methods
reject the null hypothesis and the corresponding estimate of ab has the correct sign (a
measure of realized power of the tests in the presence of outliers; the higher this rate the
better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 63
n = 100 n = 250
0.00
−0.03
Indirect effect ab
−0.06
−0.09
−0.12
0.8
0.6
Rejection rate
0.4
0.2
0.0
0.0% 1.0% 2.0% 3.0% 4.0% 0.0% 1.0% 2.0% 3.0% 4.0%
Percentage of outliers
Figure 18. Results from 1000 simulation runs for simulation design 4 with varying
percentage of outliers and the setting with no mediation (𝑎 = 0.2, 𝑏 = 0). The top row shows
the average estimates of the indirect effect and includes a horizontal reference line for the
true indirect effect 𝑎𝑏 = 0. The bottom row displays the rejection rate of the corresponding
tests (i.e., the realized size), and a horizontal line is drawn for the nominal size 𝛼 = 0.05 (the
n = 100 n = 250
0.05
0.00
Indirect effect ab
−0.05
−0.10
−0.15
0.6
0.4
0.2
0.0
0 5 10 15 0 5 10 15
Shift of outliers
Figure 19. Results from 1000 simulation runs for simulation design 4 with varying distance
of outliers and the setting with mediation (𝑎 = 0.2, 𝑏 = 0.2). The top row shows the average
estimates of the indirect effect and includes a horizontal reference line for the true indirect
effect 𝑎𝑏 = 0.04. The bottom row displays the rate of how often the methods reject the null
hypothesis and the corresponding estimate of ab has the correct sign (a measure of realized
power of the tests in the presence of outliers; the higher this rate the better).
ROBUST MEDIATION ANALYSIS: ONLINE APPENDIX 65
n = 100 n = 250
0.00
Indirect effect ab
−0.05
−0.10
−0.15
0.8
0.6
Rejection rate
0.4
0.2
0.0
0 5 10 15 0 5 10 15
Shift of outliers
Figure 20. Results from 1000 simulation runs for simulation design 4 with varying distance
of outliers and the setting with no mediation (𝑎 = 0.2, 𝑏 = 0). The top row shows the
average estimates of the indirect effect and includes a horizontal reference line for the true
indirect effect 𝑎𝑏 = 0. The bottom row displays the rejection rate of the corresponding tests
(i.e., the realized size), and a horizontal line is drawn for the nominal size 𝛼 = 0.05 (the
1.0
Indirect effect ab
0.5
0.0
−0.5
0.75
0.50
0.25
0.00
0.14 0.39 0.59 0.14 0.39 0.59 0.14 0.39 0.59
Effect size of a and b
Figure 21. Results from 1000 simulation runs for simulation design 5 with different error
distributions, the setting with mediation (𝑎 = 𝑏 = 𝑐 = 0.14, 0.39, 0.59), and sample size 𝑛 =
50. The top row contains box plots of the estimates of the indirect effect, including horizontal
reference lines for the true indirect effect 𝑎𝑏. Points outside the whiskers are not displayed
for better readability. The bottom row displays the rate of how often the methods reject the
null hypothesis and the corresponding estimate of ab has the correct sign (a measure of
realized power of the tests; the higher this rate the better). The columns correspond to the
Indirect effect ab
0.8
0.4
0.0
0.75
0.50
0.25
0.00
0.14 0.39 0.59 0.14 0.39 0.59 0.14 0.39 0.59
Effect size of a and b
Figure 22. Results from 1000 simulation runs for simulation design 5 with different error
distributions, the setting with mediation (𝑎 = 𝑏 = 𝑐 = 0.14, 0.39, 0.59), and sample size 𝑛 =
100. The top row contains box plots of the estimates of the indirect effect, including
horizontal reference lines for the true indirect effect 𝑎𝑏. Points outside the whiskers are not
displayed for better readability. The bottom row displays the rate of how often the methods
reject the null hypothesis and the corresponding estimate of ab has the correct sign (a
measure of realized power of the tests; the higher this rate the better). The columns
Indirect effect ab
0.8
0.4
0.0
0.75
0.50
0.25
0.00
0.14 0.39 0.59 0.14 0.39 0.59 0.14 0.39 0.59
Effect size of a and b
Figure 23. Results from 1000 simulation runs for simulation design 5 with different error
distributions, the setting with mediation (𝑎 = 𝑏 = 𝑐 = 0.14, 0.39, 0.59), and sample size 𝑛 =
200. The top row contains box plots of the estimates of the indirect effect, including
horizontal reference lines for the true indirect effect 𝑎𝑏. Points outside the whiskers are not
displayed for better readability. The bottom row displays the rate of how often the methods
reject the null hypothesis and the corresponding estimate of ab has the correct sign (a
measure of realized power of the tests; the higher this rate the better). The columns
0.9
Indirect effect ab
0.6
0.3
0.0
0.75
0.50
0.25
0.00
0.14 0.39 0.59 0.14 0.39 0.59 0.14 0.39 0.59
Effect size of a and b
Figure 24. Results from 1000 simulation runs for simulation design 5 with different error
distributions, the setting with mediation (𝑎 = 𝑏 = 𝑐 = 0.14, 0.39, 0.59), and sample size 𝑛 =
500. The top row contains box plots of the estimates of the indirect effect, including
horizontal reference lines for the true indirect effect 𝑎𝑏. Points outside the whiskers are not
displayed for better readability. The bottom row displays the rate of how often the methods
reject the null hypothesis and the corresponding estimate of ab has the correct sign (a
measure of realized power of the tests; the higher this rate the better). The columns
0.4
Indirect effect ab
0.0
−0.4
1.00
0.75
Rejection rate
0.50
0.25
0.00
0.14 0.39 0.59 0.14 0.39 0.59 0.14 0.39 0.59
Effect size of b (a = 0)
Figure 25. Results from 1000 simulation runs for simulation design 5 with different error
distributions, the setting with no mediation (𝑎 = 0, 𝑏 = 𝑐 = 0.14, 0.39, 0.59), and sample
size 𝑛 = 50. The top row contains box plots of the estimates of the indirect effect, including a
horizontal reference line for the true indirect effect 𝑎𝑏 = 0. Points outside the whiskers are
not displayed for better readability. The bottom row displays the rejection rate of the
corresponding tests (i.e., the realized size), and a horizontal line is drawn for the nominal size
𝛼 = 0.05 (the closer to this line the better). The columns correspond to the three considered
0.25
Indirect effect ab
0.00
−0.25
−0.50
1.00
0.75
Rejection rate
0.50
0.25
0.00
0.14 0.39 0.59 0.14 0.39 0.59 0.14 0.39 0.59
Effect size of b (a = 0)
Figure 26. Results from 1000 simulation runs for simulation design 5 with different error
distributions, the setting with no mediation (𝑎 = 0, 𝑏 = 𝑐 = 0.14, 0.39, 0.59), and sample
size 𝑛 = 100. The top row contains box plots of the estimates of the indirect effect, including
a horizontal reference line for the true indirect effect 𝑎𝑏 = 0. Points outside the whiskers are
not displayed for better readability. The bottom row displays the rejection rate of the
corresponding tests (i.e., the realized size), and a horizontal line is drawn for the nominal size
𝛼 = 0.05 (the closer to this line the better). The columns correspond to the three considered
0.2
Indirect effect ab
0.0
−0.2
−0.4
1.00
0.75
Rejection rate
0.50
0.25
0.00
0.14 0.39 0.59 0.14 0.39 0.59 0.14 0.39 0.59
Effect size of b (a = 0)
Figure 27. Results from 1000 simulation runs for simulation design 5 with different error
distributions, the setting with no mediation (𝑎 = 0, 𝑏 = 𝑐 = 0.14, 0.39, 0.59), and sample
size 𝑛 = 200. The top row contains box plots of the estimates of the indirect effect, including
a horizontal reference line for the true indirect effect 𝑎𝑏 = 0. Points outside the whiskers are
not displayed for better readability. The bottom row displays the rejection rate of the
corresponding tests (i.e., the realized size), and a horizontal line is drawn for the nominal size
𝛼 = 0.05 (the closer to this line the better). The columns correspond to the three considered
0.2
Indirect effect ab
0.1
0.0
−0.1
−0.2
1.00
0.75
Rejection rate
0.50
0.25
0.00
0.14 0.39 0.59 0.14 0.39 0.59 0.14 0.39 0.59
Effect size of b (a = 0)
Figure 28. Results from 1000 simulation runs for simulation design 5 with different error
distributions, the setting with no mediation (𝑎 = 0, 𝑏 = 𝑐 = 0.14, 0.39, 0.59), and sample
size 𝑛 = 500. The top row contains box plots of the estimates of the indirect effect, including
a horizontal reference line for the true indirect effect 𝑎𝑏 = 0. Points outside the whiskers are
not displayed for better readability. The bottom row displays the rejection rate of the
corresponding tests (i.e., the realized size), and a horizontal line is drawn for the nominal size
𝛼 = 0.05 (the closer to this line the better). The columns correspond to the three considered