Q-learning agents in a Cournot oligopoly model
Journal of Economic Dynamics and Control , Volume 32 - Issue 10 p. 3275- 3293
Q-learning is a reinforcement learning model from the field of artificial intelligence. We study the use of Q-learning for modeling the learning behavior of firms in repeated Cournot oligopoly games. Based on computer simulations, we show that Q-learning firms generally learn to collude with each other, although full collusion usually does not emerge. We also present some analytical results. These results provide insight into the underlying mechanism that causes collusive behavior to emerge. Q-learning is one of the few learning models available that can explain the emergence of collusive behavior in settings in which there is no punishment mechanism and no possibility for explicit communication between firms.
|Collusion, Cournot oligopoly, Q-learning, Reinforcement learning|
|Econometric Institute Reprint Series|
|Journal of Economic Dynamics and Control|
|Organisation||Erasmus Research Institute of Management|
Waltman, L, & Kaymak, U. (2008). Q-learning agents in a Cournot oligopoly model. Journal of Economic Dynamics and Control, 32(10), 3275–3293. doi:10.1016/j.jedc.2008.01.003