9853

Expected Motion in 2x2 Symmetric Games Played by Reinforcement Learners

The figure shows the expected motion of a system where two players using the Bush–Mosteller reinforcement learning algorithm play a symmetric game. (for temptation) is the payoff a defector gets when the other player cooperates; (for reward) is the payoff obtained by both players when they both cooperate; both players obtain a payoff of (for punishment) when they both defect; and finally, (for sucker) is the payoff a cooperator gets when the other player defects. Parameter denotes both players' aspiration threshold, and is their learning rate. Noise is the probability that a player undertakes the opposite action she or he intended. The arrows represent the expected motion at various states of the system. The background is colored using the norm of the expected motion.

THINGS TO TRY

SNAPSHOTS

  • [Snapshot]
  • [Snapshot]
  • [Snapshot]
  • [Snapshot]

DETAILS

Reinforcement learners tend to repeat actions that led to satisfactory outcomes in the past, and avoid choices that resulted in unsatisfactory experiences. This behavior is one of the most widespread adaptation mechanisms in nature. This Demonstration shows the expected motion of a system where two players using the Bush–Mosteller reinforcement learning algorithm play a symmetric game. Mathematical analyses conducted by the contributors of this Demonstration show that the expected motion displayed in the figure is especially relevant to characterize the transient dynamics of the system, particularly with small learning rates, but, on the other hand, this expected motion can be misleading when studying the asymptotic behavior of the model. Further information at http://luis.izquierdo.name and http://segis.name.
Snapshot 1: stag hunt game (; ; ; ) played by two players with aspiration threshold equal to 0 and learning rate equal to 0.5; the noise is equal to 0.5, which means that players are making their decisions at random
Snapshot 2: prisoner's dilemma game (; ; ; ) played by two players with aspiration threshold equal to 2 and learning rate equal to 0.5
Snapshot 3: a game (; ; ; ) played by two players with aspiration threshold equal to 0 and learning rate equal to 0.5
Snapshot 4: a coordination game (; ; ; ) played by two players with aspiration threshold equal to 0 and learning rate equal to 0.5
    • Share:

Embed Interactive Demonstration New!

Just copy and paste this snippet of JavaScript code into your website or blog to put the live Demonstration on your site. More details »

Files require Wolfram CDF Player or Mathematica.









 
RELATED RESOURCES
Mathematica »
The #1 tool for creating Demonstrations
and anything technical.
Wolfram|Alpha »
Explore anything with the first
computational knowledge engine.
MathWorld »
The web's most extensive
mathematics resource.
Course Assistant Apps »
An app for every course—
right in the palm of your hand.
Wolfram Blog »
Read our views on math,
science, and technology.
Computable Document Format »
The format that makes Demonstrations
(and any information) easy to share and
interact with.
STEM Initiative »
Programs & resources for
educators, schools & students.
Computerbasedmath.org »
Join the initiative for modernizing
math education.
Step-by-step Solutions »
Walk through homework problems one step at a time, with hints to help along the way.
Wolfram Problem Generator »
Unlimited random practice problems and answers with built-in Step-by-step solutions. Practice online or make a printable study sheet.
Wolfram Language »
Knowledge-based programming for everyone.
Powered by Wolfram Mathematica © 2014 Wolfram Demonstrations Project & Contributors  |  Terms of Use  |  Privacy Policy  |  RSS Give us your feedback
Note: To run this Demonstration you need Mathematica 7+ or the free Mathematica Player 7EX
Download or upgrade to Mathematica Player 7EX
I already have Mathematica Player or Mathematica 7+