Bayesian integration of audio and visual information for multi-target tracking using a CB-member filter

Files

RA_hdl_70434.pdf (270.75 KB)
  (Restricted Access)

Date

2011

Authors

Hoseinnezhad, R.
Vo, B.
Vo, B.
Suter, D.

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Conference paper

Citation

2011 IEEE International Conference on Acoustics, Speech, and Signal Processing : Proceedings, 2011: pp.2300-2303

Statement of Responsibility

Reza Hoseinnezhad, Ba-Ngu Vo, Ba-Tuong Vo, David Suter

Conference Name

IEEE International Conference on Acoustics, Speech and Signal Processing (36th : 2011 : Prague, Czech Republic)

Abstract

A new method is presented for integration of audio and visual information in multiple target tracking applications. The proposed approach uses a Bayesian filtering formulation and exploits multi-Bernoulli random finite set approximations. The work presented in this paper is the first principled Bayesian estimation approach to solve the sensor fusion problems that involve intermittent sensory data (e.g. audio data for a person who occasionally speaks.) We have examined our method with case studies from the SPEVI database. The results show nearly perfect tracking of people not only when they are silent but also when they are not visible to the camera (but speaking).

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

Copyright ©2011 by the Institute of Electrical and Electronics Engineers, Inc. All rights reserved.

License

Grant ID

Call number

Persistent link to this record