Mining Mid-level Visual Patterns with Deep CNN Activations

Files

RA_hdl_104453.pdf (7.13 MB)
  (Restricted Access)

Date

2017

Authors

Li, Y.
Liu, L.
Shen, C.
Hengel, A.

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Journal article

Citation

International Journal of Computer Vision, 2017; 121(3):344-364

Statement of Responsibility

Yao Li, Lingqiao Liu, Chunhua Shen, Anton van den Hengel

Conference Name

Abstract

The purpose of mid-level visual element discovery is to find clusters of image patches that are representative of, and which discriminate between, the contents of the relevant images. Here we propose a pattern-mining approach to the problem of identifying mid-level elements within images, motivated by the observation that such techniques have been very effective, and efficient, in achieving similar goals when applied to other data types. We show that Convolutional Neural Network (CNN) activations extracted from image patches typical possess two appealing properties that enable seamless integration with pattern mining techniques. The marriage between CNN activations and a pattern mining technique leads to fast and effective discovery of representative and discriminative patterns from a huge number of image patches, from which mid-level elements are retrieved. Given the patterns and retrieved mid-level visual elements, we propose two methods to generate image feature representations. The first encoding method uses the patterns as codewords in a dictionary in a manner similar to the Bag-of-Visual-Words model. We thus label this a Bag-of-Patterns representation. The second relies on mid-level visual elements to construct a Bag-of-Elements representation. We evaluate the two encoding methods on object and scene classification tasks, and demonstrate that our approach outperforms or matches the performance of the state-of-the-arts on these tasks.

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

© Springer Science+Business Media New York 2016

License

Call number

Persistent link to this record