Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful Visual Navigation

Kazemi Moghaddam, M.; Wu, Q.; Abbasnejad, E.; Shi, J.

Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/135896

Scopus	Web of Science®	Altmetric
Citations
?	?

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kazemi Moghaddam, M.	-
dc.contributor.author	Wu, Q.	-
dc.contributor.author	Abbasnejad, E.	-
dc.contributor.author	Shi, J.	-
dc.date.issued	2021	-
dc.identifier.citation	Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV 2021), 2021, pp.3732-3741	-
dc.identifier.isbn	9780738142661	-
dc.identifier.issn	2472-6737	-
dc.identifier.uri	https://hdl.handle.net/2440/135896	-
dc.description.abstract	We humans can impeccably search for a target object, given its name only, even in an unseen environment. We argue that this ability is largely due to three main reasons: the incorporation of prior knowledge (or experience), the adaptation of it to the new environment using the observed visual cues and most importantly optimistically searching without giving up early. This is currently missing in the state-of-the-art visual navigation methods based on Reinforcement Learning (RL). In this paper, we propose to use externally learned prior knowledge of the relative object locations and integrate it into our model by constructing a neural graph. In order to efficiently incorporate the graph without increasing the state-space complexity, we propose Graph-based Value Estimation (GVE) module. GVE provides a more accurate baseline for estimating the Advantage function in actor-critic RL algorithm. This results in reduced value estimation error and, consequently, convergence to a more optimal policy. Through empirical studies, we show that our agent, dubbed as the optimistic agent, has a more realistic estimate of the state value during a navigation episode which leads to a higher success rate. Our extensive ablation studies show the efficacy of our simple method which achieves the state-of-the-art results measured by the conventional visual navigation metrics, e.g. Success Rate (SR) and Success weighted by Path Length (SPL), in AI2THOR environment.	-
dc.description.statementofresponsibility	Mahdi Kazemi Moghaddam, Qi Wu, Ehsan Abbasnejad and Javen Shi	-
dc.language.iso	en	-
dc.publisher	IEEE	-
dc.relation.ispartofseries	IEEE Winter Conference on Applications of Computer Vision	-
dc.rights	©2021 IEEE	-
dc.source.uri	https://ieeexplore.ieee.org/xpl/conhome/9423008/proceeding	-
dc.subject	cs.CV; cs.CV	-
dc.title	Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful Visual Navigation	-
dc.type	Conference paper	-
dc.contributor.conference	IEEE Winter Conference on Applications of Computer Vision (WACV) (5 Jan 2021 - 9 Jan 2021 : virtual online)	-
dc.identifier.doi	10.1109/WACV48630.2021.00378	-
dc.publisher.place	online	-
pubs.publication-status	Published	-
dc.identifier.orcid	Kazemi Moghaddam, M. [0000-0001-6544-1120]	-
dc.identifier.orcid	Wu, Q. [0000-0003-3631-256X]	-
dc.identifier.orcid	Shi, J. [0000-0002-9126-2107]	-
Appears in Collections:	Aurora harvest 8 Computer Science publications

Files in This Item:

There are no files associated with this item.

Show simple item record

Adelaide Research & Scholarship