Edinburgh Research Explorer

The listening talker: A review of human and algorithmic context-induced modifications of speech

Research output: Contribution to journalLiterature review

  • Martin Cooke
  • Simon King
  • Maeva Garnier
  • Vincent Aubanel

Related Edinburgh Organisations

Documents

  • Download as Adobe PDF

    Rights statement: © Cooke, M., King, S., Garnier, M., & Aubanel, V. (2014). The listening talker: A review of human and algorithmic context-induced modifications of speech. Computer Speech & Language, 28(2), 543-571. 10.1016/j.csl.2013.08.003

    Submitted manuscript, 402 KB, PDF-document

Original languageEnglish
Pages (from-to)543-571
Number of pages29
JournalComputer Speech and Language
Volume28
Issue number2
DOIs
StatePublished - Mar 2014

Abstract

Speech output technology is finding widespread application, including in scenarios where intelligibility might be compromised at least for some listeners by adverse conditions. Unlike most current algorithms, talkers continually adapt their speech patterns as a response to the immediate context of spoken communication, where the type of interlocutor and the environment are the dominant situational factors influencing speech production. Observations of talker behaviour can motivate the design of more robust speech output algorithms. Starting with a listener-oriented categorisation of possible goals for speech modification, this review article summarises the extensive set of behavioural findings related to human speech modification, identifies which factors appear to be beneficial, and goes on to examine previous computational attempts to improve intelligibility in noise. The review concludes by tabulating 46 speech modifications, many of which have yet to be perceptually or algorithmically evaluated. Consequently, the review provides a roadmap for future work in improving the robustness of speech output. (C) 2013 Elsevier Ltd. All rights reserved.

    Research areas

  • Speech production, Modification algorithms, INFANT-DIRECTED SPEECH, ACOUSTIC-PHONETIC CHARACTERISTICS, FLATTENED FUNDAMENTAL-FREQUENCY, HEARING-IMPAIRED LISTENERS, HIGH NOISE-LEVELS, HARD-OF-HEARING, CLEAR SPEECH, CONVERSATIONAL SPEECH, MOTHERS SPEECH, WORD RECOGNITION

Download statistics

No data available

ID: 14407786