Edinburgh Research Explorer

Alba speech corpus

Dataset

Related Edinburgh Organisations

PublisherEdinburgh DataShare
Date made available6 Mar 2019

Abstract

Single speaker read speech corpus of a Scottish accented female native English speaker (Alba). The corpus was recorded in four speaking styles: plain (normal read speech, around 4 hours of recordings), fast (speaking as fast as possible, around 20 mins), clear_c (computer directed speech, around 28 mins) and clear_h (hearing impaired directed speech, around 20mins). Audio files are sampled at 48kHz, segmented into sentences and saved in uncompressed formant (.wav). The underlying sentences are stored in separate text files (.txt).

Data Citation

Valentini-Botinhao, Cassia; Yamagishi, Junichi. (2019). Alba speech corpus, [dataset]. University of Edinburgh. https://doi.org/10.7488/ds/2506.

ID: 82005068