Projects per year
Abstract
This paper introduces a method for automatic alignment of speech data with unsynchronised, imperfect transcripts, for a domain where no initial acoustic models are available. Using grapheme-based acoustic models, word skip networks and orthographic speech transcripts, we are able to harvest 55% of the speech with a 93% utterance-level accuracy and 99% word accuracy for the produced transcriptions. The work is based on the assumption that there is a high degree of correspondence between the speech and text, and that a full transcription of all of the speech is not required. The method is language independent and the only prior knowledge and resources required are the speech and text transcripts, and a few minor user interventions.
Original language | English |
---|---|
Title of host publication | Spoken Language Technology Workshop (SLT), 2012 IEEE |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 286-290 |
Number of pages | 5 |
ISBN (Electronic) | 978-1-4673-5125-6 |
ISBN (Print) | 978-1-4673-5124-9 |
DOIs | |
Publication status | Published - 2012 |
Keywords / Materials (for Non-textual outputs)
- acoustic signal processing
- natural language processing
- speech synthesis
- text analysis
- word processing
- automatic speech data alignment
- automatic text data alignment
- grapheme-based acoustic models
- language independent method
- orthographic speech transcripts
- text transcription
- unsynchronised-imperfect transcripts
- utterance-level accuracy
- word accuracy
- word skip networks
- Acoustics
- Data models
- Error analysis
- Hidden Markov models
- Speech
- Speech recognition
- Training
- grapheme-based models
- imperfect transcripts
- speech alignment
- word networks
Fingerprint
Dive into the research topics of 'A grapheme-based method for automatic alignment of speech and text data'. Together they form a unique fingerprint.Projects
- 2 Finished
-
Simple4All: Speech synthesis that improves through adaptive learning
King, S. (Principal Investigator) & Renals, S. (Co-investigator)
1/11/11 → 31/10/14
Project: Research
File -
HELP4MOOD:A computational distributed system to support the treatment of patients with major depression
Matheson, C. (Principal Investigator) & Wolters, M. (Co-Investigator (External))
1/01/11 → 30/06/14
Project: Research
Activities
- 1 Invited talk
-
EACL 2014 keynote: Speech synthesis needs YOU!
Simon King (Speaker)
29 Apr 2014Activity: Academic talk or presentation types › Invited talk
File