Abstract
We investigate approaches to accessing information from the streams of audio data that result from multi-channel recordings of meetings. The methods investigated use word-level transcriptions, and information derived from models of speaker activity and speaker turn patterns. Our experiments include spoken document retrieval for meetings, automatic structuring of meetings based on self-similarity matrices of speaker turn patterns and a simple model of speaker activity. Meeting recordings are rich in both lexical and non-lexical information; our results illustrate some novel kinds of analysis made possible by a transcribed corpus of natural meetings.
Original language | English |
---|---|
Title of host publication | 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings of the |
Subtitle of host publication | ICASSP'03 |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 744-747 |
Volume | 4 |
ISBN (Print) | 0-7803-7663-3 |
DOIs | |
Publication status | Published - 2003 |
Event | 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing - Hong Kong Exhibition and Convention Centre, Hong Kong, Hong Kong Duration: 6 Apr 2003 → 10 Apr 2003 |
Conference
Conference | 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing |
---|---|
Country/Territory | Hong Kong |
City | Hong Kong |
Period | 6/04/03 → 10/04/03 |