Adaptive multiagent reinforcement learning with non-positive regret

Nguyen, D.; White, L.; Nguyen, H.

Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/109431

Scopus	Web of Science®	Altmetric
Citations
?	?

Full metadata record

DC Field	Value	Language
dc.contributor.author	Nguyen, D.	-
dc.contributor.author	White, L.	-
dc.contributor.author	Nguyen, H.	-
dc.contributor.editor	Kang, B.	-
dc.contributor.editor	Bai, Q.	-
dc.date.issued	2016	-
dc.identifier.citation	Lecture Notes in Artificial Intelligence, 2016 / Kang, B., Bai, Q. (ed./s), vol.9992 LNAI, pp.29-41	-
dc.identifier.isbn	9783319501260	-
dc.identifier.issn	0302-9743	-
dc.identifier.issn	1611-3349	-
dc.identifier.uri	http://hdl.handle.net/2440/109431	-
dc.description	LNAI 9992	-
dc.description.abstract	We propose a novel adaptive reinforcement learning (RL) procedure for multi-agent non-cooperative repeated games. Most existing regret-based algorithms only use positive regrets in updating their learning rules. In this paper, we adopt both positive and negative regrets in reinforcement learning to improve its convergence behaviour. We prove theoretically that the empirical distribution of the joint play converges to the set of correlated equilibrium. Simulation results demonstrate that our proposed procedure outperforms the standard regret-based RL approach and a well-known state-of-the-art RL scheme in the literature in terms of both computational requirements and system fairness. Further experiments demonstrate that the performance of our solution is robust to variations in the total number of agents in the system; and that it can achieve markedly better fairness performance when compared to other relevant methods, especially in a large-scale multiagent system.	-
dc.description.statementofresponsibility	Duong D. Nguyen, B, Langford B. White, and Hung X. Nguyen	-
dc.language.iso	en	-
dc.publisher	Springer	-
dc.relation.ispartofseries	Lecture notes in computer science	-
dc.rights	Springer International Publishing AG 2016	-
dc.source.uri	http://dx.doi.org/10.1007/978-3-319-50127-7_3	-
dc.subject	Multiagent systems; Reinforcement Learning; Game theory; Correlated equilibrium; No regret	-
dc.title	Adaptive multiagent reinforcement learning with non-positive regret	-
dc.type	Conference paper	-
dc.contributor.conference	29th Australasian Joint Conference on Artificial Intelligence (AI) (5 Dec 2016 - 8 Dec 2016 : Hobart, Tas)	-
dc.identifier.doi	10.1007/978-3-319-50127-7_3	-
dc.relation.grant	http://purl.org/au-research/grants/arc/LP100200493	-
pubs.publication-status	Published	-
dc.identifier.orcid	Nguyen, D. [0000-0003-1048-5825]	-
dc.identifier.orcid	White, L. [0000-0001-6660-0517]	-
dc.identifier.orcid	Nguyen, H. [0000-0003-1028-920X]	-
Appears in Collections:	Aurora harvest 3 Electrical and Electronic Engineering publications

Files in This Item:

There are no files associated with this item.

Show simple item record

Adelaide Research & Scholarship