Requirements and Motivations of Low-Resource Speech Synthesis for Language Revitalization

Aidan Pine, Dan Wells, Nathan Thanyehténhas Brinklow, Patrick Littell, Korin Richmond

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract / Description of output

This paper describes the motivation and development of speech synthesis systems for the purposes of language revitalization. By building speech synthesis systems for three Indigenous languages spoken in Canada, Kanien’kéha, Gitksan & SENĆOŦEN, we re-evaluate the question of how much data is required to build low-resource speech synthesis systems featuring state-of-the-art neural models. For example, preliminary results with English data show that a FastSpeech2 model trained with 1 hour of training data can produce speech with comparable naturalness to a Tacotron2 model trained with 10 hours of data. Finally, we motivate future research in evaluation and classroom integration in the field of speech synthesis for language revitalization.
Original languageEnglish
Title of host publicationProceedings of the 60th Annual Meeting of the Association for Computational Linguistics Volume 1: Long Papers
EditorsSmaranda Muresan, Preslav Nakov, Aline Villavicencio
PublisherACL Anthology
Pages14
Number of pages7346
Volume1
ISBN (Electronic)978-1-955917-21-6
Publication statusPublished - 16 May 2022
Event60th Annual Meeting of the Association for Computational Linguistics - The Convention Centre Dublin, Dublin, Ireland
Duration: 22 May 202227 May 2022
https://www.2022.aclweb.org

Conference

Conference60th Annual Meeting of the Association for Computational Linguistics
Abbreviated titleACL 2022
Country/TerritoryIreland
CityDublin
Period22/05/2227/05/22
Internet address

Fingerprint

Dive into the research topics of 'Requirements and Motivations of Low-Resource Speech Synthesis for Language Revitalization'. Together they form a unique fingerprint.

Cite this