Abstract / Description of output
This paper describes the motivation and development of speech synthesis systems for the purposes of language revitalization. By building speech synthesis systems for three Indigenous languages spoken in Canada, Kanien’kéha, Gitksan & SENĆOŦEN, we re-evaluate the question of how much data is required to build low-resource speech synthesis systems featuring state-of-the-art neural models. For example, preliminary results with English data show that a FastSpeech2 model trained with 1 hour of training data can produce speech with comparable naturalness to a Tacotron2 model trained with 10 hours of data. Finally, we motivate future research in evaluation and classroom integration in the field of speech synthesis for language revitalization.
Original language | English |
---|---|
Title of host publication | Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics Volume 1: Long Papers |
Editors | Smaranda Muresan, Preslav Nakov, Aline Villavicencio |
Publisher | ACL Anthology |
Pages | 14 |
Number of pages | 7346 |
Volume | 1 |
ISBN (Electronic) | 978-1-955917-21-6 |
Publication status | Published - 16 May 2022 |
Event | 60th Annual Meeting of the Association for Computational Linguistics - The Convention Centre Dublin, Dublin, Ireland Duration: 22 May 2022 → 27 May 2022 https://www.2022.aclweb.org |
Conference
Conference | 60th Annual Meeting of the Association for Computational Linguistics |
---|---|
Abbreviated title | ACL 2022 |
Country/Territory | Ireland |
City | Dublin |
Period | 22/05/22 → 27/05/22 |
Internet address |