Projects per year
Abstract / Description of output
We compare a series of time compression methods applied to normal and
clear speech. First we evaluate a linear (uniform) method applied to
these styles as well as to naturally-produced fast speech. We found, in
line with the literature, that unprocessed fast speech was less
intelligible than linearly compressed normal speech. Fast speech was
also less intelligible than compressed clear speech but at the highest
rate (three times faster than normal) the advantage of clear over fast
speech was lost. To test whether this was due to shorter speech duration
we evaluate, in our second experiments, a range of methods that compress
speech and silence at different rates. We found that even when the
overall duration of speech and silence is kept the same across styles,
compressed normal speech is still more intelligible than compressed
clear speech. Compressing silence twice as much as speech improved
results further for normal speech with very little additional
computational costs.
Original language | English |
---|---|
Publisher | arXiv.org |
Number of pages | 5 |
Publication status | Published - 1 Jan 2019 |
Keywords / Materials (for Non-textual outputs)
- Electrical Engineering and Systems Science - Audio and Speech Processing
Projects
- 1 Finished
-
Synthesis of Fast Speech/Speech Synthesis of Auditive:Lecture Books (SALB)
Yamagishi, J.
1/02/13 → 31/03/14
Project: Research
Datasets
-
Alba speech corpus
Valentini Botinhao, C. (Creator) & Yamagishi, J. (Creator), Edinburgh DataShare, 6 Mar 2019
DOI: 10.7488/ds/2506
Dataset