Incorporating grouping information in bayesian variable selection with applications in genomics

Rockova, Veronika; Lesaffre, Emmanuel

doi:10.1214/13-BA846

V. Rockova (Veronika) and E.M.E.H. Lesaffre (Emmanuel)

2014

Incorporating grouping information in bayesian variable selection with applications in genomics

Bayesian Analysis , Volume 9 - Issue 1 p. 221- 258

In many applications it is of interest to determine a limited number of important explanatory factors (representing groups of potentially overlapping predictors) rather than original predictor variables. The often imposed require-ment that the clustered predictors should enter the model simultaneously may be limiting as not all the variables within a group need to be associated with the out-come. Within-group sparsity is often desirable as well. Here we propose a Bayesian variable selection method, which uses the grouping information as a means of in-troducing more equal competition to enter the model within the groups rather than as a source of strict regularization constraints. This is achieved within the context of Bayesian LASSO (least absolute shrinkage and selection operator) by allowing each regression coefficient to be penalized differentially and by considering an additional regression layer to relate individual penalty parameters to a group identification matrix. The proposed hierarchical model therefore enables inference simultaneously on two levels: (1) the regression layer for the continuous outcome in relation to the predictors and (2) the regression layer for the penalty param-eters in relation to the grouping information. Both situations with overlapping and non-overlapping groups are applicable. The method does not assume within-group homogeneity across the regression coefficients, which is implicit in many structured penalized likelihood approaches. The smoothness here is enforced at the penalty level rather than within the regression coefficients. To enhance the potential of the proposed method we develop two rapid computational procedures based on the expectation maximization (EM) algorithm, which offer substantial time savings in applications where the high-dimensionality renders Markov chain Monte Carlo (MCMC) approaches less practical. We demonstrate the usefulness of our method in predicting time to death in glioblastoma patients using pathways of genes.

Additional Metadata
Keywords	Bayesian LASSO, Bayesian shrinkage estimation, EM algorithm, Minorization-maximization
Persistent URL	doi.org/10.1214/13-BA846, hdl.handle.net/1765/64425
Journal	Bayesian Analysis
Organisation	Department of Biostatistics
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Rockova, V., & Lesaffre, E. (2014). Incorporating grouping information in bayesian variable selection with applications in genomics. Bayesian Analysis, 9(1), 221–258. doi:10.1214/13-BA846

Incorporating grouping information in bayesian variable selection with applications in genomics

Publication

Publication

About

Incorporating grouping information in bayesian variable selection with applications in genomics

Publication

Publication

Workflow

Workflow

Add Content