Edinburgh Research Explorer

Measuring the cognitive load of synthetic speech using a dual task paradigm

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Original languageEnglish
Title of host publication19th Annual Conference of the International Speech Communication, INTERSPEECH 2018
EditorsCC Sekhar, P Rao, PK Ghosh, HA Murthy, B Yegnanarayana, S Umesh, P Alku, SRM Prasanna, S Narayanan
PublisherInternational Speech Communication Association
Pages2843-2847
Number of pages5
DOIs
Publication statusPublished - 6 Sep 2018

Publication series

Name
ISSN (Print)1990-9772

Abstract

We present a methodology for measuring the cognitive load (listening effort) of synthetic speech using a dual task paradigm. Cognitive load is calculated from changes in a listener's performance on a secondary task (e.g., reaction time to decide if a visually-displayed digit is odd or even). Previous related studies have only found significant differences between the best and worst quality systems but failed to separate the systems that lie in between. A paradigm that is sensitive enough to detect differences between state-of-the-art, high quality speech synthesizers would be very useful for advancing the state of the art. In our work, four speech synthesis systems from a previous Blizzard Challenge, and the corresponding natural speech, were compared. Our results show that reaction times slow down as speech quality reduces, as we expected: lower quality speech imposes a greater cognitive load, taking resources away from the secondary task. However, natural speech did not have the fastest reaction times. This intriguing result might indicate that, as speech synthesizers attain near-perfect intelligibility, this paradigm is measuring something like the listener's level of sustained attention and not listening effort. © 2018 International Speech Communication Association. All rights reserved.

    Research areas

  • cognitive load, dual task paradigm, speech synthesis, speech communication, storms, cognitive loads, dual-tasks, quality systems, speech synthesis system, speech synthesizer, state of the art, sustained attention, synthetic speech, speech intelligibility

ID: 76505414