On the effect of SNR and superdirective beamforming in speaker diarisation in meetings

E. Zwyssig, S. Renals, M. Lincoln

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper examines the effect of sensor performance on speaker diarisation in meetings and investigates the use of more advanced beamforming techniques, beyond the typically employed delay-sum beamformer, for mitigating the effects of poorer sensor performance. We present super-directive beamforming and investigate how different time difference of arrival (TDOA) smoothing and beamforming techniques influence the performance of state-of-the-art diarisation systems. We produced and transcribed a new corpus of meetings recorded in the instrumented meeting room using a high SNR analogue and a newly developed low SNR digital MEMS microphone array (DMMA.2). This research demonstrates that TDOA smoothing has a significant effect on the diarisation error rate and that simple noise reduction and beamforming schemes suffice to overcome audio signal degradation due to the lower SNR of modern MEMS microphones.
Original languageEnglish
Title of host publicationAcoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Pages4177-4180
Number of pages4
DOIs
Publication statusPublished - 2012
EventICASSP 2012 - Kyoto, Japan
Duration: 26 Mar 201230 Mar 2012

Conference

ConferenceICASSP 2012
CountryJapan
CityKyoto
Period26/03/1230/03/12

Fingerprint Dive into the research topics of 'On the effect of SNR and superdirective beamforming in speaker diarisation in meetings'. Together they form a unique fingerprint.

Cite this