Hierarchical Bayesian formulations for selecting variables in regression models

Rockova, Veronika; Lesaffre, Emmanuel; Luime, Jolanda; Löwenberg, Bob

doi:10.1002/sim.4439

V. Rockova (Veronika), E.M.E.H. Lesaffre (Emmanuel), J.J. Luime (Jolanda) and B. Löwenberg (Bob)

2012-05-01

Hierarchical Bayesian formulations for selecting variables in regression models

Statistics in Medicine , Volume 31 - Issue 11-12 p. 1221- 1237

The objective of finding a parsimonious representation of the observed data by a statistical model that is also capable of accurate prediction is commonplace in all domains of statistical applications. The parsimony of the solutions obtained by variable selection is usually counterbalanced by a limited prediction capacity. On the other hand, methodologies that assure high prediction accuracy usually lead to models that are neither simple nor easily interpretable. Regularization methodologies have proven to be useful in addressing both prediction and variable selection problems. The Bayesian approach to regularization constitutes a particularly attractive alternative as it is suitable for high-dimensional modeling, offers valid standard errors, and enables simultaneous estimation of regression coefficients and complexity parameters via computationally efficient MCMC techniques. Bayesian regularization falls within the versatile framework of Bayesian hierarchical models, which encompasses a variety of other approaches suited for variable selection such as spike and slab models and the MC 3 approach. In this article, we review these Bayesian developments and evaluate their variable selection performance in a simulation study for the classical small p large n setting. The majority of the existing Bayesian methodology for variable selection deals only with classical linear regression. Here, we present two applications in the contexts of binary and survival regression, where the Bayesian approach was applied to select markers prognostically relevant for the development of rheumatoid arthritis and for overall survival in acute myeloid leukemia patients.

Additional Metadata
Keywords	Bayesian regularization, MC 3, Probit regression, Spike and slab, Weibull regression
Persistent URL	doi.org/10.1002/sim.4439, hdl.handle.net/1765/66517
Journal	Statistics in Medicine
Organisation	Rheumatology
Citation APA APA Style APA-ALL Style AAA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Rockova, V., Lesaffre, E., Luime, J.& Löwenberg, B. (2012). Hierarchical Bayesian formulations for selecting variables in regression models. Statistics in Medicine, 31(11-12), 1221–1237.https://doi.org/10.1002/sim.4439

Hierarchical Bayesian formulations for selecting variables in regression models

Publication

Publication

About

Hierarchical Bayesian formulations for selecting variables in regression models

Publication

Publication

Workflow

Workflow

Add Content