The Voice Conversion Challenge 2016

  • Tomoki Toda (Creator)
  • Ling-Hui Chen (Creator)
  • Daisuke Saito (Creator)
  • Fernando Villavicencio (Creator)
  • Mirjam Wester (Creator)
  • Zhizheng Wu (Creator)
  • Junichi Yamagishi (Creator)



The Voice Conversion Challenge (VCC) 2016, one of the special sessions at Interspeech 2016, deals with speaker identity conversion, referred as Voice Conversion (VC). The task of the challenge was speaker conversion, i.e., to transform the voice identity of a source speaker into that of a target speaker while preserving the linguistic content. Using a common dataset consisting of 162 utterances for training and 54 utterances for evaluation from each of 5 source and 5 target speakers, 17 groups working in VC around the world developed their own VC systems for every combination of the source and target speakers, i.e., 25 systems in total, and generated voice samples converted by the developed systems. The objective of the VCC was to compare various VC techniques on identical training and evaluation speech data. The samples were evaluated in terms of target speaker similarity and naturalness by 200 listeners in a controlled environment. This dataset consists of the participants' VC submissions and the listening test results for naturalness and similarity.

Data Citation

Toda, Tomoki; Chen, Ling-Hui; Saito, Daisuke; Villavicencio, Fernando; Wester, Mirjam; Wu, Zhizheng; Yamagishi, Junichi. (2016). The Voice Conversion Challenge 2016, 2016 [dataset]. University of Edinburgh. School of Informatics. Centre for Speech Technology Research.
Date made available23 Jun 2016
PublisherEdinburgh DataShare
Date of data production2016

Cite this