Multi-modality Data Analysis Using Deep Reinforcement Learning

Wang, Hu

Multi-modality Data Analysis Using Deep Reinforcement Learning

dc.contributor.advisor	Shen, Chunhua
dc.contributor.advisor	Wu, Qi
dc.contributor.author	Wang, Hu
dc.contributor.school	School of Computer Science	en
dc.date.issued	2021
dc.description.abstract	Deep Reinforcement Learning (DRL) is a set of algorithms to deal with sequential decision problems. It mimics human learning behaviours through trial-and-error. Deep Reinforcement Learning has been developing steadily after decades of research and development. It has a wide range of applications that can deal with multi-modality data, such as robot navigation, traffic signal control, self-driving vehicles and etc. Recently, many impressive results have been achieved. However, it is not trivial to design a good state space, action space and reward function for reinforcement learning models to tackle a specific task. In this thesis, we propose a series of novel techniques to cope with challenging multi-modality data based on the idea of deep reinforcement learning. We first proposed a DRL solution to alleviate the traffic congestion problem in urban areas. To achieve this, we introduce a DRL model with an edge-weighted graph convolutional encoder and unified structure decoder with states, actions and reward functions settings. Additionally, we also propose a new dataset and settings, including synthetic and real traffic data in more complex scenarios. Behaviour cloning techniques directly copy the demonstrations from experts and it will inevitably incur worse performance on unseen environments due to error accumulation. Reinforcement learning based methods have better generalisation ability, but designing suitable reward functions towards a specific task is expensive. We thus propose a generic soft expert reward learning strategy to distil the expert’s behaviour directly and successfully apply it on Vision-and-Language Navigation. The experimental results show the effectiveness of the model. Finally, inspired by the exploration and expert’s behaviour distillation of deep reinforcement learning, we propose a generic random network prediction strategy on anomaly detection and clustering tasks in an unsupervised learning manner. We propose to learn important features without using any labelled data by training neural networks to predict data distances in a randomly projected space.	en
dc.description.dissertation	Thesis (Ph.D.) -- University of Adelaide, School of Computer Science, 2021	en
dc.identifier.uri	https://hdl.handle.net/2440/134156
dc.language.iso	en	en
dc.provenance	This electronic version is made publicly available by the University of Adelaide in accordance with its open access policy for student theses. Copyright in this thesis remains with the author. This thesis may incorporate third party material which has been used by the author pursuant to Fair Dealing exceptions. If you are the owner of any included third party copyright material you wish to be removed from this electronic version, please complete the take down form located at: http://www.adelaide.edu.au/legals	en
dc.subject	Deep learning	en
dc.subject	reinforcement learning	en
dc.subject	multi-modality	en
dc.title	Multi-modality Data Analysis Using Deep Reinforcement Learning	en
dc.type	Thesis	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Wang2021_PhD.pdf
Size:: 4.61 MB
Format:: Adobe Portable Document Format
Description:: Thesis

Download

Collections

Research Theses