Bagging has been found to be successful in increasing the predictive performance of unstable classifiers. Bagging draws bootstrap samples from the training sample, applies the classifier to each bootstrap sample, and then averages over all obtained classification rules. The idea of trimmed bagging is to exclude the bootstrapped classification rules that yield the highest error rates, as estimated by the out-of-bag error rate, and to aggregate over the remaining ones. In this note we explore the potential benefits of trimmed bagging. On the basis of numerical experiments, we conclude that trimmed bagging performs comparably to standard bagging when applied to unstable classifiers as decision trees, but yields better results when applied to more stable base classifiers, like support vector machines.

Aggregation, Bagging, Decision trees, Error rate, Support vector machine, Trimmed means
dx.doi.org/10.1016/j.csda.2007.06.012, hdl.handle.net/1765/66205
Computational Statistics & Data Analysis
Erasmus Research Institute of Management

Croux, C, Joossens, K, & Lemmens, A. (2007). Trimmed bagging. Computational Statistics & Data Analysis, 52(1), 362–368. doi:10.1016/j.csda.2007.06.012