Automatic adaptive speech separation using beamformer-output-ratio for voice activity classification

Tran, T.N.; Cowley, W.; Pollok, A.

doi:10.1016/j.sigpro.2015.01.015

Automatic adaptive speech separation using beamformer-output-ratio for voice activity classification

Date

2015

Authors

Tran, T.N.

Cowley, W.

Pollok, A.

Type:

Journal article

Citation

Signal Processing, 2015; 113:259-272

DOI

10.1016/j.sigpro.2015.01.015

Abstract

This paper focuses on the practical challenge of adaptation control for speech separation systems. Adaptive beam forming methods, such as minimum variance distortion less response (MDVR), can effectively extract the desired speech signal from interference and noise. However, to avoid the signal cancellation problem, the beamformer adaptation is halted when the desired speaker is active. An automated scheme for this adaptation requires classifying speakers' voice activity status, which remains a challenge for multispeaker environments. In this paper, we propose a novel approach to identify voice activities for two speakers based on a new metric, called the beam former-output-ratio (BOR). Statistical properties of the BOR are studied and used to develop a hypothesis based method for voice activity classification. The method is further refined using an algorithm detecting incorrect beamformer adaptation by analysing changes in the output power of a blind adapting MVDR beamformer. Based on the new methods, we constructan automatic adaptive beamforming system to simultaneously separate speech for two speakers. The speech separation module of the system uses MVDR beamformers whose adaptation is guided by the voice activity classification. Our methods can lead to, in some cases, 20% reduction in voice activity classification error, and 8 dB improvement on theoutput SINR. The results are verified on both synthesised signals and realistic recordings.

Rights

Published Version

https://doi.org/10.1016/j.sigpro.2015.01.015

Persistent link to this record

https://hdl.handle.net/11541.2/119576

Full item page

Automatic adaptive speech separation using beamformer-output-ratio for voice activity classification

Date

Authors

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Citation

Statement of Responsibility

Conference Name

DOI

Abstract

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

License

Grant ID

Published Version

Call number

Persistent link to this record