Dealing with missing covariates in epidemiologic studies: A comparison between multiple imputation and a full Bayesian approach

Erler, Nicole; Rizopoulos, Dimitris; van Rosmalen, Joost; Jaddoe, Vincent; Franco, Oscar; Lesaffre, Emmanuel

doi:10.1002/sim.6944

N.S. Erler (Nicole), D. Rizopoulos (Dimitris), J.M. van Rosmalen (Joost), V.W.V. Jaddoe (Vincent), O.H. Franco (Oscar) and E.M.E.H. Lesaffre (Emmanuel)

2016-04-04

Dealing with missing covariates in epidemiologic studies: A comparison between multiple imputation and a full Bayesian approach

Statistics in Medicine

Incomplete data are generally a challenge to the analysis of most large studies. The current gold standard to account for missing data is multiple imputation, and more specifically multiple imputation with chained equations (MICE). Numerous studies have been conducted to illustrate the performance of MICE for missing covariate data. The results show that the method works well in various situations. However, less is known about its performance in more complex models, specifically when the outcome is multivariate as in longitudinal studies. In current practice, the multivariate nature of the longitudinal outcome is often neglected in the imputation procedure, or only the baseline outcome is used to impute missing covariates. In this work, we evaluate the performance of MICE using different strategies to include a longitudinal outcome into the imputation models and compare it with a fully Bayesian approach that jointly imputes missing values and estimates the parameters of the longitudinal model. Results from simulation and a real data example show that MICE requires the analyst to correctly specify which components of the longitudinal process need to be included in the imputation models in order to obtain unbiased results. The full Bayesian approach, on the other hand, does not require the analyst to explicitly specify how the longitudinal outcome enters the imputation models. It performed well under different scenarios.

Additional Metadata
Keywords	Bayesian, Epidemiology, MICE, Missing covariate values, Multiple imputation
Persistent URL	doi.org/10.1002/sim.6944, hdl.handle.net/1765/87330
Journal	Statistics in Medicine
Organisation	Erasmus MC: University Medical Center Rotterdam
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Erler, N., Rizopoulos, D., van Rosmalen, J., Jaddoe, V., Franco, O., & Lesaffre, E. (2016). Dealing with missing covariates in epidemiologic studies: A comparison between multiple imputation and a full Bayesian approach. Statistics in Medicine. doi:10.1002/sim.6944

Dealing with missing covariates in epidemiologic studies: A comparison between multiple imputation and a full Bayesian approach

Publication

Publication

About

Dealing with missing covariates in epidemiologic studies: A comparison between multiple imputation and a full Bayesian approach

Publication

Publication

Workflow

Workflow

Add Content