Abstract
This paper is about the development of statistical models of prosodic features to generate linguistic meta-data for spoken language. In particular, we are concerned with automatically punctuating the output of a broadcast news speech recogniser. We present a statistical finite state model that combines prosodic, linguistic and punctuation class features. Experimental results are presented using the Hub-4 Broadcast News corpus, and in the light of our results we discuss the issue of a suitable method of evaluating the present task.
Original language | English |
---|---|
Title of host publication | Proceedings of the ITRW on Prosody in Speech Recognition and Understanding |
Subtitle of host publication | Prosody 2001 |
Publisher | ISCA |
Number of pages | 6 |
Publication status | Published - 2001 |
Event | ITRW on Prosody in Speech Recognition and Understanding (Prosody 2001) - Molly Pitcher Inn, Red Bank, NJ, United States Duration: 22 Oct 2001 → 24 Oct 2001 |
Workshop
Workshop | ITRW on Prosody in Speech Recognition and Understanding (Prosody 2001) |
---|---|
Country/Territory | United States |
City | Red Bank, NJ |
Period | 22/10/01 → 24/10/01 |