Operations research and management science are often confronted with sequential decision making problems with large state spaces. Standard methods that are used for solving such complex problems are associated with some difficulties. As we discuss in this article, these methods are plagued by the so-called curse of dimensionality and the curse of modelling. In this article, we discuss reinforcement learning, a machine learning technique for solving sequential decision making problems with large state spaces. We describe how reinforcement learning can be combined with a function approximation method to avoid both the curse of dimensionality and the curse of modelling. To illustrate the usefulness of this approach, we apply it to a problem with a huge state space-learning to play the game of Othello. We describe experiments in which reinforcement learning agents learn to play the game of Othello without the use of any knowledge provided by human experts. It turns out that the reinforcement learning agents learn to play the game of Othello better than players that use basic strategies.

, , , , , , ,
doi.org/10.1016/j.cor.2006.10.004, hdl.handle.net/1765/76386
Computers & Operations Research
Erasmus Research Institute of Management

van Eck, N. J., & van Wezel, M. (2008). Application of reinforcement learning to the game of Othello. Computers & Operations Research, 35(6), 1999–2017. doi:10.1016/j.cor.2006.10.004