TUNDRA: a multilingual corpus of found data for TTS research created with light supervision

Adriana Stan, Oliver Watts, Yoshitaka Mamiya, Mircea Giurgiu, Robert A. J. Clark, Junichi Yamagishi, Simon King

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Simple4All Tundra (version 1.0) is the first release of a standardised multilingual corpus designed for text-to-speech research with imperfect or found data. The corpus consists of approximately 60 hours of speech data from audiobooks in 14 languages, as well as utterance-level alignments obtained with a lightly-supervised process. Future versions of the corpus will include finer-grained alignment and prosodic annotation, all of which will be made freely available. This paper gives a general outline of the data collected so far, as well as a detailed description of how this has been done, emphasizing the minimal language-specific knowledge and manual intervention used to compile the corpus. To demonstrate its potential use, textto-speech systems have been built for all languages using unsupervised or lightly supervised methods, also briefly presented in the paper.
Original languageEnglish
Title of host publicationINTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association
Subtitle of host publicationLyon, France, August 25-29, 2013
PublisherISCA-INST SPEECH COMMUNICATION ASSOC
Pages2331-2335
Number of pages5
Publication statusPublished - 2013

Fingerprint Dive into the research topics of 'TUNDRA: a multilingual corpus of found data for TTS research created with light supervision'. Together they form a unique fingerprint.

Cite this