Artificial Personality and Disfluency

Mirjam Wester, Matthew Aylett, Marcus Tomalin, Rasmus Dall

Research output: Chapter in Book/Report/Conference proceedingConference contribution


The focus of this paper is artificial voices with different personalities. Previous studies have shown links between an individual's use of disfluencies in their speech and their perceived personality. Here, filled pauses (uh and um) and discourse markers (like, you know, I mean) have been included in synthetic speech as a way of creating an artificial voice with different personalities. We discuss the automatic insertion of filled pauses and discourse markers (i.e., fillers) into otherwise fluent texts. The automatic system is compared to a ground truth of human "acted" filler insertion. Perceived personality (as defined by the big five personality dimensions) of the synthetic speech is assessed by means of a standardised questionnaire. Synthesis without fillers is compared to synthesis with either spontaneous or synthetic fillers. Our findings explore how the inclusion of disfluencies influences the way in which subjects rate the perceived personality of an artificial voice.
Original languageEnglish
Title of host publicationINTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association
Place of PublicationDresden
PublisherInternational Speech Communication Association
Number of pages5
Publication statusPublished - 10 Sep 2015


Dive into the research topics of 'Artificial Personality and Disfluency'. Together they form a unique fingerprint.

Cite this