Being able to accurately predict what a customer will purchase next is of paramount importance to successful online retailing. In practice, customer purchase history data is readily available to make such predictions, sometimes complemented with customer characteristics. Given the large assortments maintained by online retail- ers, scalability of the prediction method is just as important as its accuracy. We study two classes of models that use such data to predict what a customer will buy next: A novel approach that uses latent Dirichlet allocation (LDA), and mixtures of Dirichlet-Multinomials (MDM). A key benefit of a model-based approach is the potential to accommodate observed customer heterogeneity through the inclusion of predictor variables. We show that LDA can be extended in this direction while retaining its scalability. We apply the models to purchase data from an online re- tailer and contrast their predictive performance with that of a collaborative filter and a discrete choice model. Both LDA and MDM outperform the other meth- ods. Moreover, LDA attains performance similar to that of MDM while being far more scalable, rendering it a promising approach to purchase prediction in large assortments.

, , , , ,
hdl.handle.net/1765/51434
ERIM Report Series Research in Management
ERIM report series research in management Erasmus Research Institute of Management
Erasmus Research Institute of Management

Jacobs, B., Donkers, B., & Fok, D. (2016). Model-based Purchase Predictions for Large Assortments (No. ERS-2014-007-MKT). ERIM report series research in management Erasmus Research Institute of Management. Retrieved from http://hdl.handle.net/1765/51434