Substantial effective sample sizes were required for external validation studies of predictive logistic regression models

Vergouwe, Yvonne; Steyerberg, Ewout; Eijkemans, René; Habbema, Dik

doi:10.1016/j.jclinepi.2004.06.017

Y. Vergouwe (Yvonne), E.W. Steyerberg (Ewout), M.J.C. Eijkemans (René) and J.D.F. Habbema (Dik)

2005-05-01

Substantial effective sample sizes were required for external validation studies of predictive logistic regression models

Journal of Clinical Epidemiology , Volume 58 - Issue 5 p. 475- 483

Background and Objectives: The performance of a prediction model is usually worse in external validation data compared to the development data. We aimed to determine at which effective sample sizes (i.e., number of events) relevant differences in model performance can be detected with adequate power. Methods: We used a logistic regression model to predict the probability that residual masses of patients treated for metastatic testicular cancer contained only benign tissue. We performed standard power calculations and Monte Carlo simulations to estimate the numbers of events that are required to detect several types of model invalidity with 80% power at the 5% significance level. Results: A validation sample with 111 events was required to detect that a model predicted too high probabilities, when predictions were on average 1.5 times too high on the odds scale. A decrease in discriminative ability of the model, indicated by a decrease in the c-statistic from 0.83 to 0.73, required 81 to 106 events, depending on the specific scenario. Conclusion: We suggest a minimum of 100 events and 100 nonevents for external validation samples. Specific hypotheses may, however, require substantially higher effective sample sizes to obtain adequate power.

Additional Metadata
Keywords	External validation, Performance, Prediction models, Sample size, Simulations
Persistent URL	doi.org/10.1016/j.jclinepi.2004.06.017, hdl.handle.net/1765/61773
Journal	Journal of Clinical Epidemiology
Organisation	Erasmus MC: University Medical Center Rotterdam
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Vergouwe, Y., Steyerberg, E., Eijkemans, R., & Habbema, D. (2005). Substantial effective sample sizes were required for external validation studies of predictive logistic regression models. Journal of Clinical Epidemiology, 58(5), 475–483. doi:10.1016/j.jclinepi.2004.06.017

Substantial effective sample sizes were required for external validation studies of predictive logistic regression models

Publication

Publication

About

Substantial effective sample sizes were required for external validation studies of predictive logistic regression models

Publication

Publication

Workflow

Workflow

Add Content