Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/84846
Type: Thesis
Title: Hypergraph modeling for saliency detection and beyond.
Author: Li, Yao
Issue Date: 2014
School/Discipline: School of Computer Science
Abstract: Salient object detection aims to locate objects that capture human attention within images. Previous approaches often pose this as a problem of image contrast analysis. In this work, we model an image as a hypergraph that utilizes a set of hyperedges to capture the contextual properties of image pixels or regions. As a result, the problem of salient object detection becomes one of finding salient vertices and hyperedges in the hypergraph. The main advantage of hypergraph modeling is that it takes into account each pixel’s (or region’s) affinity with its neighborhood as well as its separation from image background. Furthermore, we propose an alternative approach based on center-versus-surround contextual contrast analysis, which performs salient object detection by optimizing a cost-sensitive support vector machine (SVM) objective function. Experimental results on four challenging datasets demonstrate the effectiveness of the proposed approaches against the state-of-the-art approaches to salient object detection. In addition to a novel method for salient object detection, we tackle scene text detection, a challenging research problem in the both vision and document analysis community, from the saliency detection prospective. Motivated by the need to consider the widely varying forms of natural text, we propose a bottom-up approach to the problem which reflects the ‘characterness’ of an image region. In this sense our approach mirrors the move from saliency detection methods to measures of ‘objectness’. In order to measure the characterness we develop three novel cues that are tailored for character detection, and a Bayesian method for their integration. Because text is made up of sets of characters, we then design a Markov random field (MRF) model so as to exploit the inherent dependencies between characters. We experimentally demonstrate the effectiveness of our characterness cues as well as the advantage of Bayesian multi-cue integration. The proposed text detector outperforms state-of-the-art methods on a few benchmark scene text detection datasets. We also show that our measurement of ‘characterness’ is superior than state-of-the-art saliency detection models when applied to the same task.
Advisor: Shen, Chunhua
van den Hengel, Anton John
Dissertation Note: Thesis (M.Eng.Sc.) -- University of Adelaide, School of Computer Science, 2014
Provenance: This electronic version is made publicly available by the University of Adelaide in accordance with its open access policy for student theses. Copyright in this thesis remains with the author. This thesis may incorporate third party material which has been used by the author pursuant to Fair Dealing exceptions. If you are the owner of any included third party copyright material you wish to be removed from this electronic version, please complete the take down form located at: http://www.adelaide.edu.au/legals
Appears in Collections:Research Theses

Files in This Item:
File Description SizeFormat 
01front.pdf77.28 kBAdobe PDFView/Open
02whole.pdf2.21 MBAdobe PDFView/Open
Permissions
  Restricted Access
Library staff access only561.06 kBAdobe PDFView/Open
Restricted
  Restricted Access
Library staff access only10.88 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.