Loughborough University
Leicestershire, UK
LE11 3TU
+44 (0)1509 263171
Loughborough University

Loughborough University Institutional Repository

Please use this identifier to cite or link to this item: https://dspace.lboro.ac.uk/2134/5352

Title: Multimodal blind source separation for moving sources
Authors: Naqvi, Syed M.
Zhang, Yonggang
Chambers, Jonathon
Keywords: BSS
Multimodal signal processing
Particle filtering
3-D tracking
Beamforming
FastICA
Issue Date: 2009
Publisher: © IEEE
Citation: NAQVI, S.M., ZHANG, Y. and CHAMBERS, J.A., 2009. Multimodal blind source separation for moving sources. IN: IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2009), Taipei, 19-24 April, pp. 125-128 .
Abstract: A novel multimodal approach is proposed to solve the problem of blind source separation (BSS) of moving sources. The challenge of BSS for moving sources is that the mixing filters are time varying, thus the unmixing filters should also be time varying, which are difficult to track in real time. In the proposed approach, the visual modality is utilized to facilitate the separation for both stationary and moving sources. The movement of the sources is detected by a 3-D tracker based on particle filtering. The full BSS solution is formed by integrating a frequency domain blind source separation algorithm and beamforming: if the sources are identified as stationary, a frequency domain BSS algorithm is implemented with an initialization derived from the visual information. Once the sources are moving, a beamforming algorithm is used to perform real time speech enhancement and provide separation of the sources. Experimental results show that by utilizing the visual modality, the proposed algorithm can not only improve the performance of the BSS algorithm and mitigate the permutation problem for stationary sources, but also provide a good BSS performance for moving sources in a low reverberant environment.
Description: This is a conference paper [© IEEE]. It is also available at: http://ieeexplore.ieee.org/ Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Version: Published
DOI: 10.1109/ICASSP.2009.4959536
URI: https://dspace.lboro.ac.uk/2134/5352
ISBN: 9781424423538
ISSN: 1520-6149
Appears in Collections:Conference Papers and Contributions (Electronic, Electrical and Systems Engineering)

Files associated with this item:

File Description SizeFormat
naqvi.pdf391.26 kBAdobe PDFView/Open

 

SFX Query

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.