Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/105003
Citations
Scopus Web of Science® Altmetric
?
?
Type: Journal article
Title: Compositional model based Fisher vector coding for image classification
Author: Liu, L.
Wang, P.
Shen, C.
Wang, L.
Van Den Hengel, A.
Wang, C.
Shen, H.T.
Citation: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017; 39(12):2335-2348
Publisher: IEEE
Issue Date: 2017
ISSN: 0162-8828
2160-9292
Statement of
Responsibility: 
Lingqiao Liu, Peng Wang, Chunhua Shen, Lei Wang, Anton van den Hengel, Chao Wang, Heng Tao Shen
Abstract: Deriving from the gradient vector of a generative model of local features, Fisher vector coding (FVC) has been identified as an effective coding method for image classification. Most, if not all, FVC implementations employ the Gaussian mixture model (GMM) as the generative model for local features. However, the representative power of a GMM can be limited because it essentially assumes that local features can be characterized by a fixed number of feature prototypes, and the number of prototypes is usually small in FVC. To alleviate this limitation, in this work, we break the convention which assumes that a local feature is drawn from one of a few Gaussian distributions. Instead, we adopt a compositional mechanism which assumes that a local feature is drawn from a Gaussian distribution whose mean vector is composed as a linear combination of multiple key components, and the combination weight is a latent random variable. In doing so we greatly enhance the representative power of the generative model underlying FVC. To implement our idea, we design two particular generative models following this compositional approach. In our first model, the mean vector is sampled from the subspace spanned by a set of bases and the combination weight is drawn from a Laplace distribution. In our second model, we further assume that a local feature is composed of a discriminative part and a residual part. As a result, a local feature is generated by the linear combination of discriminative part bases and residual part bases. The decomposition of the discriminative and residual parts is achieved via the guidance of a pre-trained supervised coding method. By calculating the gradient vector of the proposed models, we derive two new Fisher vector coding strategies. The first is termed Sparse Coding-based Fisher Vector Coding (SCFVC) and can be used as the substitute of traditional GMM based FVC. The second is termed Hybrid Sparse Coding-based Fisher vector coding (HSCFVC) since it combines the merits of both pre-trained supervised coding methods and FVC. Using pre-trained Convolutional Neural Network (CNN) activations as local features, we experimentally demonstrate that the proposed methods are superior to traditional GMM based FVC and achieve state-of-the-art performance in various image classification tasks.
Rights: © 2017 IEEE.
DOI: 10.1109/TPAMI.2017.2651061
Grant ID: http://purl.org/au-research/grants/arc/FT120100969
Published version: http://dx.doi.org/10.1109/tpami.2017.2651061
Appears in Collections:Aurora harvest 3
Electrical and Electronic Engineering publications

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.