Edinburgh Research Explorer

On the effect of SNR and superdirective beamforming in speaker diarisation in meetings

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Related Edinburgh Organisations

Open Access permissions

Open

Documents

Original languageEnglish
Title of host publicationAcoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Pages4177-4180
Number of pages4
DOIs
Publication statusPublished - 2012
EventICASSP 2012 - Kyoto, Japan
Duration: 26 Mar 201230 Mar 2012

Conference

ConferenceICASSP 2012
CountryJapan
CityKyoto
Period26/03/1230/03/12

Abstract

This paper examines the effect of sensor performance on speaker diarisation in meetings and investigates the use of more advanced beamforming techniques, beyond the typically employed delay-sum beamformer, for mitigating the effects of poorer sensor performance. We present super-directive beamforming and investigate how different time difference of arrival (TDOA) smoothing and beamforming techniques influence the performance of state-of-the-art diarisation systems. We produced and transcribed a new corpus of meetings recorded in the instrumented meeting room using a high SNR analogue and a newly developed low SNR digital MEMS microphone array (DMMA.2). This research demonstrates that TDOA smoothing has a significant effect on the diarisation error rate and that simple noise reduction and beamforming schemes suffice to overcome audio signal degradation due to the lower SNR of modern MEMS microphones.

Event

ICASSP 2012

26/03/1230/03/12

Kyoto, Japan

Event: Conference

Download statistics

No data available

ID: 14332348