Methods: A simulation study compared 5 tests of mediation contrasts: Wald confidence intervals (CI), percentile bootstrap CI, bias-corrected CI, Likelihood-based CI, and a test based on dummy latent variables (DLV). Comparisons across groups and across different mediators were examined and the simulations varied sample size (N=50, 100, 250, 500), and path coefficient (0, .14, .39. .59, corresponding to previous simulations of mediated effects). Models with two paths (a single mediator) and three paths (two mediators) were examined. Tests were evaluated on Type I error, power, and confidence interval coverage.
Results: Preliminary results suggest that the percentile bootstrap and likelihood based CIs may be optimal. Both the bias-corrected bootstrap and the DLV method in particular have inaccurate (too high) Type I error for contrasts of effects with a true difference of zero when one effect has a medium or large effect size. Power to detect nonzero but small differences was greatest with these two methods however. As the difference between mediated effects increases, all five tests perform with comparable power. All findings for two path effects were magnified for three path effects (two mediator models).
Conclusions: Both the percentile bootstrap and likelihood CIs offer the best balance of power and Type I error. The bias-corrected bootstrap and DLV method offer greater power but also have too high Type I error in some situations. The Wald CI test should be avoided as there are superior methods available.