Projects per year
Abstract
The depth of the neural network is a vital factor that affects its performance. Recently a new architecture called highway network was proposed. This network facilitates the training process of a very deep neural network by using gate units to control a information highway over the conventional hidden layer. For the speech synthesis task, we investigate the performance of highway networks with up to 40 hidden layers. The results suggest that a highway network with 14 non-linear transformation layers is the best choice on our speech corpus and this highway network achieves better performance than a feed-forward network with 14 hidden layers. On the basis of these results, we further investigate a multi-stream highway network where separate highway networks are used to predict different kinds of acoustic features such as the spectral and F0 features. Results
of the experiments suggest that the multi-stream highway network can achieve better objective results than the single network that predicts all the acoustic features. Analysis on the output of highway gate units also supports the assumption for the multi-stream network that different hidden representation may be necessary to predict spectral and F0 features.
of the experiments suggest that the multi-stream highway network can achieve better objective results than the single network that predicts all the acoustic features. Analysis on the output of highway gate units also supports the assumption for the multi-stream network that different hidden representation may be necessary to predict spectral and F0 features.
Original language | English |
---|---|
Title of host publication | 9th ISCA Speech Synthesis Workshop |
Pages | 166-171 |
Number of pages | 6 |
DOIs | |
Publication status | Published - 15 Sept 2016 |
Event | 9th ISCA Speech Synthesis Workshop - Sunnyvale, United States Duration: 13 Sept 2016 → 15 Sept 2016 http://ssw9.talp.cat/ |
Publication series
Name | |
---|---|
ISSN (Print) | 1234-5678 |
Conference
Conference | 9th ISCA Speech Synthesis Workshop |
---|---|
Abbreviated title | ISCA 2016 |
Country/Territory | United States |
City | Sunnyvale |
Period | 13/09/16 → 15/09/16 |
Internet address |
Fingerprint
Dive into the research topics of 'Investigating Very Deep Highway Networks for Parametric Speech Synthesis'. Together they form a unique fingerprint.Projects
- 2 Finished
-
Deep architectures for statistical speech synthesis
Yamagishi, J.
UK industry, commerce and public corporations
4/09/12 → 3/03/16
Project: Research
-