Aggregating published prediction models with individual participant data: A comparison of different approaches

Debray, Thomas P.A.; Koffijberg, Hendrik; Vergouwe, Yvonne; Moons, Karel; Steyerberg, Ewout

doi:10.1002/sim.5412

T.P. Debray (Thomas P.A.), H. Koffijberg (Hendrik), Y. Vergouwe (Yvonne), K.G.M. Moons (Karel) and E.W. Steyerberg (Ewout)

2012-10-15

Aggregating published prediction models with individual participant data: A comparison of different approaches

Statistics in Medicine , Volume 31 - Issue 23 p. 2697- 2712

During the recent decades, interest in prediction models has substantially increased, but approaches to synthesize evidence from previously developed models have failed to keep pace. This causes researchers to ignore potentially useful past evidence when developing a novel prediction model with individual participant data (IPD) from their population of interest. We aimed to evaluate approaches to aggregate previously published prediction models with new data. We consider the situation that models are reported in the literature with predictors similar to those available in an IPD dataset. We adopt a two-stage method and explore three approaches to calculate a synthesis model, hereby relying on the principles of multivariate meta-analysis. The former approach employs a naive pooling strategy, whereas the latter accounts for within-study and between-study covariance. These approaches are applied to a collection of 15 datasets of patients with traumatic brain injury, and to five previously published models for predicting deep venous thrombosis. Here, we illustrated how the generally unrealistic assumption of consistency in the availability of evidence across included studies can be relaxed. Results from the case studies demonstrate that aggregation yields prediction models with an improved discrimination and calibration in a vast majority of scenarios, and result in equivalent performance (compared with the standard approach) in a small minority of situations. The proposed aggregation approaches are particularly useful when few participant data are at hand. Assessing the degree of heterogeneity between IPD and literature findings remains crucial to determine the optimal approach in aggregating previous evidence into new prediction models.

Additional Metadata
Keywords	Bayesian inference, Logistic regression, Meta-analysis, Multivariable, Prediction models, Prediction research
Persistent URL	doi.org/10.1002/sim.5412, hdl.handle.net/1765/37395
Journal	Statistics in Medicine
Organisation	Erasmus MC: University Medical Center Rotterdam
Citation APA APA Style APA-ALL Style AAA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Debray, T. P. A., Koffijberg, H., Vergouwe, Y., Moons, K.& Steyerberg, E. (2012). Aggregating published prediction models with individual participant data: A comparison of different approaches. Statistics in Medicine, 31(23), 2697–2712.https://doi.org/10.1002/sim.5412

Aggregating published prediction models with individual participant data: A comparison of different approaches

Publication

Publication

About

Aggregating published prediction models with individual participant data: A comparison of different approaches

Publication

Publication

Workflow

Workflow

Add Content