Supervised hashing using graph cuts and boosted decision trees

Files

RA_hdl_97945.pdf (1.01 MB)
  (Restricted Access)

Date

2015

Authors

Lin, G.
Shen, C.
van den Hengel, A.

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Journal article

Citation

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015; 37(11):2317-2331

Statement of Responsibility

Guosheng Lin, Chunhua Shen, and Anton van den Hengel

Conference Name

Abstract

To build large-scale query-by-example image retrieval systems, embedding image features into a binary Hamming space provides great benefits. Supervised hashing aims to map the original features to compact binary codes that are able to preserve label based similarity in the binary Hamming space. Most existing approaches apply a single form of hash function, and an optimization process which is typically deeply coupled to this specific form. This tight coupling restricts the flexibility of those methods, and can result in complex optimization problems that are difficult to solve. In this work we proffer a flexible yet simple framework that is able to accommodate different types of loss functions and hash functions. The proposed framework allows a number of existing approaches to hashing to be placed in context, and simplifies the development of new problem-specific hashing methods. Our framework decomposes the hashing learning problem into two steps: binary code (hash bit) learning and hash function learning. The first step can typically be formulated as binary quadratic problems, and the second step can be accomplished by training a standard binary classifier. For solving large-scale binary code inference, we show how it is possible to ensure that the binary quadratic problems are submodular such that efficient graph cut methods may be used. To achieve efficiency as well as efficacy on large-scale high-dimensional data, we propose to use boosted decision trees as the hash functions, which are nonlinear, highly descriptive, and are very fast to train and evaluate. Experiments demonstrate that the proposed method significantly outperforms most state-of-the-art methods, especially on high-dimensional data.

School/Discipline

Dissertation Note

Provenance

Description

Date of Publication : 18 February 2015

Access Status

Rights

© 2015 IEEE

License

Grant ID

Call number

Persistent link to this record