Abstract
The paper is about the automatic structuring of multiparty meetings using audio information. We have used a corpus of 53 meetings, recorded using a microphone array and lapel microphones for each participant. The task was to segment meetings into a sequence of meeting actions, or phases. We have adopted a statistical approach using dynamic Bayesian networks (DBNs). Two DBN architectures were investigated: a two-level hidden Markov model (HMM) in which the acoustic observations were concatenated; and a multistream DBN in which two separate observation sequences were modelled. We have also explored the use of counter variables to constrain the number of action transitions. Experimental results indicate that the DBN architectures are an improvement over a simple baseline HMM, with the multistream DBN with counter constraints producing an action error rate of 6%.
Original language | English |
---|---|
Title of host publication | Acoustics, Speech, and Signal Processing, 2004 |
Subtitle of host publication | Proceedings. (ICASSP '04). IEEE International Conference on (Volume:5 ) |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 629-632 |
DOIs | |
Publication status | Published - 2004 |
Event | 2004 IEEE International Conference on Acoustics, Speech and Signal Processing (IEEE ICASSP) - Fairmount Queen Elizabeth Hotel, Montreal, Quebec, Canada Duration: 17 May 2004 → 21 May 2004 |
Conference
Conference | 2004 IEEE International Conference on Acoustics, Speech and Signal Processing (IEEE ICASSP) |
---|---|
Country/Territory | Canada |
City | Montreal, Quebec |
Period | 17/05/04 → 21/05/04 |