A calibration hierarchy for risk models was defined: From utopia to empirical data

Van Calster, Ben; Nieboer, Daan; Vergouwe, Yvonne; De Cock, Bavo; Pencina, Michael; Steyerberg, Ewout

doi:10.1016/j.jclinepi.2015.12.005

B. Van Calster (Ben), D. Nieboer (Daan), Y. Vergouwe (Yvonne), B. De Cock (Bavo), M. Pencina (Michael) and E.W. Steyerberg (Ewout)

2016-06-01

A calibration hierarchy for risk models was defined: From utopia to empirical data

Journal of Clinical Epidemiology , Volume 74 p. 167- 176

Objective: Calibrated risk models are vital for valid decision support. We define four levels of calibration and describe implications for model development and external validation of predictions. Study Design and Setting: We present results based on simulated data sets. Results: A common definition of calibration is "having an event rate of R% among patients with a predicted risk of R%," which we refer to as "moderate calibration." Weaker forms of calibration only require the average predicted risk (mean calibration) or the average prediction effects (weak calibration) to be correct. "Strong calibration" requires that the event rate equals the predicted risk for every covariate pattern. This implies that the model is fully correct for the validation setting. We argue that this is unrealistic: the model type may be incorrect, the linear predictor is only asymptotically unbiased, and all nonlinear and interaction effects should be correctly modeled. In addition, we prove that moderate calibration guarantees nonharmful decision making. Finally, results indicate that a flexible assessment of calibration in small validation data sets is problematic. Conclusion: Strong calibration is desirable for individualized decision support but unrealistic and counter productive by stimulating the development of overly complex models. Model development and external validation should focus on moderate calibration.

Additional Metadata
Keywords	Calibration, Decision curve analysis, External validation, Loess, Overfitting, Risk prediction models
Persistent URL	doi.org/10.1016/j.jclinepi.2015.12.005, hdl.handle.net/1765/87511
Journal	Journal of Clinical Epidemiology
Organisation	Department of Public Health
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Van Calster, B., Nieboer, D., Vergouwe, Y., De Cock, B., Pencina, M., & Steyerberg, E. (2016). A calibration hierarchy for risk models was defined: From utopia to empirical data. Journal of Clinical Epidemiology, 74, 167–176. doi:10.1016/j.jclinepi.2015.12.005

A calibration hierarchy for risk models was defined: From utopia to empirical data

Publication

Publication

About

A calibration hierarchy for risk models was defined: From utopia to empirical data

Publication

Publication

Workflow

Workflow

Add Content