Please use this identifier to cite or link to this item:
https://hdl.handle.net/2440/84846
Type: | Thesis |
Title: | Hypergraph modeling for saliency detection and beyond. |
Author: | Li, Yao |
Issue Date: | 2014 |
School/Discipline: | School of Computer Science |
Abstract: | Salient object detection aims to locate objects that capture human attention within images. Previous approaches often pose this as a problem of image contrast analysis. In this work, we model an image as a hypergraph that utilizes a set of hyperedges to capture the contextual properties of image pixels or regions. As a result, the problem of salient object detection becomes one of finding salient vertices and hyperedges in the hypergraph. The main advantage of hypergraph modeling is that it takes into account each pixel’s (or region’s) affinity with its neighborhood as well as its separation from image background. Furthermore, we propose an alternative approach based on center-versus-surround contextual contrast analysis, which performs salient object detection by optimizing a cost-sensitive support vector machine (SVM) objective function. Experimental results on four challenging datasets demonstrate the effectiveness of the proposed approaches against the state-of-the-art approaches to salient object detection. In addition to a novel method for salient object detection, we tackle scene text detection, a challenging research problem in the both vision and document analysis community, from the saliency detection prospective. Motivated by the need to consider the widely varying forms of natural text, we propose a bottom-up approach to the problem which reflects the ‘characterness’ of an image region. In this sense our approach mirrors the move from saliency detection methods to measures of ‘objectness’. In order to measure the characterness we develop three novel cues that are tailored for character detection, and a Bayesian method for their integration. Because text is made up of sets of characters, we then design a Markov random field (MRF) model so as to exploit the inherent dependencies between characters. We experimentally demonstrate the effectiveness of our characterness cues as well as the advantage of Bayesian multi-cue integration. The proposed text detector outperforms state-of-the-art methods on a few benchmark scene text detection datasets. We also show that our measurement of ‘characterness’ is superior than state-of-the-art saliency detection models when applied to the same task. |
Advisor: | Shen, Chunhua van den Hengel, Anton John |
Dissertation Note: | Thesis (M.Eng.Sc.) -- University of Adelaide, School of Computer Science, 2014 |
Provenance: | This electronic version is made publicly available by the University of Adelaide in accordance with its open access policy for student theses. Copyright in this thesis remains with the author. This thesis may incorporate third party material which has been used by the author pursuant to Fair Dealing exceptions. If you are the owner of any included third party copyright material you wish to be removed from this electronic version, please complete the take down form located at: http://www.adelaide.edu.au/legals |
Appears in Collections: | Research Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01front.pdf | 77.28 kB | Adobe PDF | View/Open | |
02whole.pdf | 2.21 MB | Adobe PDF | View/Open | |
Permissions Restricted Access | Library staff access only | 561.06 kB | Adobe PDF | View/Open |
Restricted Restricted Access | Library staff access only | 10.88 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.