Edinburgh Research Explorer

Centre for Speech Technology Research

Organisational unit: Research Centre

  1. 2020
  2. A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis

    Wang, X., Takaki, S., Yamagishi, J., King, S. & Tokuda, K., 1 Jan 2020, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 28, p. 157-170 13 p.

    Research output: Contribution to journalArticle

  3. 2019
  4. Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments

    Yasuda, Y., Wang, X. & Yamagishi, J., 22 Sep 2019, Proceedings of the 10th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 1-6 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  5. Measuring the contribution to cognitive load of each predicted vocoder speech parameter in DNN-based speech synthesis

    Govender, A., Valentini-Botinhao, C. & King, S., 22 Sep 2019, Proceedings of the 10th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 121-126 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  6. Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis

    Wang, X. & Yamagishi, J., 22 Sep 2019, Proceedings of the 10th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 1-6 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  7. Where do the improvements come from in sequence-to-sequence neural TTS?

    Watts, O., Henter, G., Fong, J. & Valentini-Botinhao, C., 22 Sep 2019, 10th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 217-222 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  8. ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection

    Todisco, M., Wang, X., Vestman, V., Sahidullah, M., Delgado, H., Nautsch, A., Yamagishi, J., Evans, N., Kinnunen, T. & Aik Lee, K., 19 Sep 2019, Proceedings Interspeech 2019. International Speech Communication Association, p. 1008-1012 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  9. Detecting Topic-Oriented Speaker Stance in Conversational Speech

    Lai, C., Alex, B., Moore, J. D., Tian, L., Hori, T. & Francesca, G., 19 Sep 2019, Proceedings of Interspeech 2019. International Speech Communication Association, p. 46-50 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  10. Direct F0 Estimation with Neural-Network-based Regression

    Xu, S. & Shimodaira, H., 19 Sep 2019, Proc. Interspeech 2019. International Speech Communication Association, p. 1995-1999 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  11. Evaluating Near End Listening Enhancement Algorithms in Realistic Environments

    Chermaz, C., Valentini Botinhao, C., Schepker, H. & King, S., 19 Sep 2019, Proceedings Interspeech 2019. International Speech Communication Association, p. 1373-1377 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  12. GELP: GAN-Excited Liner Prediction for Speech Synthesis from Mel-Spectrogram

    Juvela, L., Bollepalli, B., Yamagishi, J. & Alku, P., 19 Sep 2019, Proceedings Interspeech 2019. International Speech Communication Association, p. 694-698 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

Previous 1 2 3 4 5 6 7 8 ...73 Next