http://hdl.handle.net/1765/7323
series: ERS-2006-006-LIS

A Theoretical Analysis of Cooperative Behavior in Multi-Agent Q-learning


Research Paper
This publication is part of collection
Related Files
asset icon
(ERS 2006 006 LIS.pdf, 0.3MB)

A number of experimental studies have investigated whether cooperative behavior may emerge in multi-agent Q-learning. In some studies cooperative behavior did emerge, in others it did not. This report provides a theoretical analysis of this issue. The analysis focuses on multi-agent Q-learning in iterated prisoner’s dilemmas. It is shown that under certain assumptions cooperative behavior may emerge when multi-agent Q-learning is applied in an iterated prisoner’s dilemma. An important consequence of the analysis is that multi-agent Q-learning may result in non-Nash behavior. It is found experimentally that the theoretical results derived in this report are quite robust to violations of the underlying assumptions.



Keywords


Classifications using Journal of Economic Literature (JEL) Classification System
Automatically Extracted Terms
  • agent
  • strategy
  • q-learning
  • behavior
  • value
  • multi-agent q-learning
  • iterated prisoner
  • prisoner
  • assumption
  • action
  • multi-agent
  • dilemma
  • boltzmann strategy
  • state
  • analysis
  • result
  • payoff
  • defect
  • experiment
  • probability