Validation in prediction research: the waste by data splitting

Steyerberg, Ewout

doi:10.1016/j.jclinepi.2018.07.010

Accurate prediction of medical outcomes is important for diagnosis and prognosis. The standard requirement in major medical journals is nowadays that validity outside the development sample needs to be shown. Is such data splitting an example of a waste of resources? In large samples, interest should shift to assessment of heterogeneity in model performance across settings. In small samples, cross-validation and bootstrapping are more efficient approaches. In conclusion, random data splitting should be abolished for validation of prediction models.

Additional Metadata
Persistent URL	doi.org/10.1016/j.jclinepi.2018.07.010, hdl.handle.net/1765/109868
Journal	Journal of Clinical Epidemiology
Organisation	Department of Public Health
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Steyerberg, E. (2018). Validation in prediction research: the waste by data splitting. Journal of Clinical Epidemiology. doi:10.1016/j.jclinepi.2018.07.010

Free Full Text ( Author Manuscript , 126kb )

Validation in prediction research: the waste by data splitting

Publication

Publication

About

Validation in prediction research: the waste by data splitting

Publication

Publication

Workflow

Workflow

Add Content