Edinburgh Research Explorer

Recognition of overlapping speech using digital MEMS microphone arrays

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Original languageEnglish
Title of host publication2013 IEEE International Conference on Acoustics, Speech and Signal Processing
Pages7068-7072
Number of pages5
DOIs
Publication statusPublished - 21 Oct 2013
Event38th IEEE International Conference on Acoustics, Speech, and Signal Processing - Vancouver, Canada
Duration: 26 May 201331 May 2013
https://www2.securecms.com/ICASSP2013/default.asp

Conference

Conference38th IEEE International Conference on Acoustics, Speech, and Signal Processing
Abbreviated titleICASSP 2013
CountryCanada
CityVancouver
Period26/05/1331/05/13
Internet address

Abstract

This paper presents a new corpus comprising single and overlapping speech recorded using digital MEMS and analogue microphone arrays. In addition to this, the paper presents results from speech separation and recognition experiments on this data. The corpus is a reproduction of the multi-channel Wall Street Journal audio-visual corpus (MC-WSJAV), containing recorded speech in both a meeting room and an anechoic chamber using two different microphone types as well as two different array geometries. The speech separation and speech recognition experiments were performed using SRP-PHAT-based speaker localisation, superdirective beamforming and multiple post-processing schemes, such as residual echo suppression and binary masking. Our simple, cMLLR-based recognition system matches the performance of state-of-the-art ASR systems on the single speaker task and outperforms them on overlapping speech. The corpus will be made publicly available via the LDC in spring 2013.

    Research areas

  • ASR, MEMS microphones, microphone array, speech separation, WSJ

Event

38th IEEE International Conference on Acoustics, Speech, and Signal Processing

26/05/1331/05/13

Vancouver, Canada

Event: Conference

ID: 112494786