PWP3D: Real-time segmentation and tracking of 3D objects

Files

RA_hdl_84216.pdf (3.01 MB)
  (Restricted Access)

Date

2012

Authors

Prisacariu, V.
Reid, I.

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Journal article

Citation

International Journal of Computer Vision, 2012; 98(3):335-354

Statement of Responsibility

Victor A. Prisacariu, Ian D. Reid

Conference Name

Abstract

We formulate a probabilistic framework for simultaneous region-based 2D segmentation and 2D to 3D pose tracking, using a known 3D model. Given such a model, we aim to maximise the discrimination between statistical foreground and background appearance models, via direct optimisation of the 3D pose parameters. The foreground region is delineated by the zero-level-set of a signed distance embedding function, and we define an energy over this region and its immediate background surroundings based on pixel-wise posterior membership probabilities (as opposed to likelihoods). We derive the differentials of this energy with respect to the pose parameters of the 3D object, meaning we can conduct a search for the correct pose using standard gradient-based non-linear minimisation techniques. We propose novel enhancements at the pixel level based on temporal consistency and improved online appearance model adaptation. Furthermore, straightforward extensions of our method lead to multi-camera and multi-object tracking as part of the same framework. The parallel nature of much of the processing in our algorithm means it is amenable to GPU acceleration, and we give details of our real-time implementation, which we use to generate experimental results on both real and artificial video sequences, with a number of 3D models. These experiments demonstrate the benefit of using pixel-wise posteriors rather than likelihoods, and showcase the qualities, such as robustness to occlusions and motion blur (and also some failure modes), of our tracker. © Springer Science+Business Media, LLC 2011.

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

© Springer Science+Business Media, LLC 2012

License

Grant ID

Call number

Persistent link to this record