Observational and reinforcement pattern learning: an exploratory study

Hanaki, N.; Kirman, A.; Pezanis-Christou, P.

doi:10.1016/j.euroecorev.2018.01.009

Observational and reinforcement pattern learning: an exploratory study

dc.contributor.author	Hanaki, N.
dc.contributor.author	Kirman, A.
dc.contributor.author	Pezanis-Christou, P.
dc.date.issued	2018
dc.description.abstract	Understanding how individuals learn in an unknown environment is an important problem in economics. We model and examine experimentally behavior in a very simple multi-armed bandit framework in which participants do not know the inter-temporal payoff structure. We propose a baseline reinforcement learning model that allows for pattern-recognition and change in the strategy space. We also analyse three augmented versions that accommodate observational learning from the actions and/or payoffs of another player. The models successfully reproduce the distributional properties of observed discovery times and total payoffs. Our study further shows that when one of the pair discovers the hidden pattern, observing another’s actions and/or payoffs improves discovery time compared to the baseline case.
dc.description.statementofresponsibility	Nobuyuki Hanaki, Alan Kirman, Paul Pezanis-Christou
dc.identifier.citation	European Economic Review, 2018; 104:1-21
dc.identifier.doi	10.1016/j.euroecorev.2018.01.009
dc.identifier.issn	0014-2921
dc.identifier.issn	1873-572X
dc.identifier.orcid	Pezanis-Christou, P. [0000-0001-6521-4139]
dc.identifier.uri	https://hdl.handle.net/2440/132302
dc.language.iso	en
dc.publisher	Elsevier BV
dc.relation.grant	http://purl.org/au-research/grants/arc/DP140102949
dc.rights	© 2018 Published by Elsevier B.V.
dc.source.uri	https://doi.org/10.1016/j.euroecorev.2018.01.009
dc.subject	Multi-armed bandit; reinforcement learning; payoff patterns; observational learning
dc.title	Observational and reinforcement pattern learning: an exploratory study
dc.type	Journal article
pubs.publication-status	Published

Collections

Economics publications

Observational and reinforcement pattern learning: an exploratory study

Files

Collections