Edinburgh Research Explorer

Centre for Speech Technology Research

Organisational unit: Research Centre

  1. 2020
  2. Integrating lexical and prosodic features for automatic paragraph segmentation

    Lai, C., Farrús, M. & Moore, J., 11 May 2020, In : Speech Communication. 121, p. 44-57

    Research output: Contribution to journalArticle

  3. Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0

    Hodari, Z., Lai, C. & King, S., 1 Mar 2020, (Accepted/In press) Proceedings of Speech Prosody 2020.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  4. Acoustic model adaptation from raw waveforms with Sincnet

    Fainberg, J., Klejch, O., Loweimi, E., Bell, P. & Renals, S., 20 Feb 2020, 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Institute of Electrical and Electronics Engineers (IEEE), p. 897-904 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  5. Bootstrapping Non-Parallel Voice Conversion From Speaker-Adaptive Text-to-Speech

    Luong, H-T. & Yamagishi, J., 20 Feb 2020, 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Institute of Electrical and Electronics Engineers (IEEE), p. 200-207 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  6. Embeddings for DNN speaker adaptive training

    Równicka, J., Bell, P. & Renals, S., 20 Feb 2020, 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Institute of Electrical and Electronics Engineers (IEEE), p. 479-486 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  7. Speaker adaptive training using model agnostic meta-learning

    Klejch, O., Fainberg, J., Bell, P. & Renals, S., 20 Feb 2020, 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Institute of Electrical and Electronics Engineers (IEEE), p. 881-888 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  8. The MGB-5 Challenge: Recognition and Dialect Identification of Dialectal Arabic Speech

    Ali, A., Shon, S., Samih, Y., Mubarak, H., Abdelali, A., Glass, J., Renals, S. & Choukri, K., 20 Feb 2020, 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Institute of Electrical and Electronics Engineers (IEEE), p. 1026-1033 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  9. European Language Grid: An Overview

    Rehm, G., Berger, M., Elsholz, E., Hegele, S., Kintzel, F., Marheinecke, K., Piperidis, S., Deligiannis, M., Galanis, D., Gkirtzou, K., Labropoulou, P., Bontcheva, K., Jones, D., Roberts, I., Hajic, J., Hamrlová, J., Kačena, L., Choukri, K., Arranz, V., Vasiļjevs, A. & 16 others, Anvari, O., Lagzdiņš, A., Meļņika, J., Backfried, G., Dikici, E., Janosik, M., Prinz, K., Prinz, C., Stampler, S., Thomas-Aniola, D., Manuel Gomez-Perez, J., Garcia Silva, A., Berrío, C., Germann, U., Renals, S. & Klejch, O., 11 Feb 2020, (Accepted/In press) Proceedings of the 12th Language Resources and Evaluation Conference (LREC 2020). 15 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  10. Channel Adversarial Training for Speaker Verification and Diarization

    Luu, C., Bell, P. & Renals, S., 24 Jan 2020, (Accepted/In press) Proceedings of the 45th International Conference on Acoustics, Speech, and Signal Processing. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  11. Cross Lingual Transfer Learning for Zero-Resource Domain Adaptation

    Abad Gareta, A., Bell, P., Carmantini, A. & Renals, S., 24 Jan 2020, (Accepted/In press) Proceedings of the 45th International Conference on Acoustics, Speech, and Signal Processing. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  12. Learning Noise Invariant Features Through Transfer Learning for Robust End-to-End Speech Recognition

    Zhang, S., Do, C-T., Doddipatla, R. & Renals, S., 24 Jan 2020, (Accepted/In press) 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing. Institute of Electrical and Electronics Engineers (IEEE), 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  13. Multi-Scale Octave Convolutions for Robust Speech Recognition

    Równicka, J., Bell, P. & Renals, S., 24 Jan 2020, (Accepted/In press) Proceedings of the 45th International Conference on Acoustics, Speech, and Signal Processing. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  14. A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis

    Wang, X., Takaki, S., Yamagishi, J., King, S. & Tokuda, K., 1 Jan 2020, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 28, p. 157-170 13 p.

    Research output: Contribution to journalArticle

  15. Neural Source-Filter Waveform Models for Statistical Parametric Speech Synthesis

    Wang, X., Takaki, S. & Yamagishi, J., 2020, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing . 28, p. 402-415 14 p.

    Research output: Contribution to journalArticle

  16. 2019
  17. Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments

    Yasuda, Y., Wang, X. & Yamagishi, J., 22 Sep 2019, Proceedings of the 10th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 1-6 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  18. Measuring the contribution to cognitive load of each predicted vocoder speech parameter in DNN-based speech synthesis

    Govender, A., Valentini-Botinhao, C. & King, S., 22 Sep 2019, Proceedings of the 10th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 121-126 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  19. Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis

    Wang, X. & Yamagishi, J., 22 Sep 2019, Proceedings of the 10th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 1-6 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  20. Where do the improvements come from in sequence-to-sequence neural TTS?

    Watts, O., Henter, G., Fong, J. & Valentini-Botinhao, C., 22 Sep 2019, 10th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 217-222 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  21. ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection

    Todisco, M., Wang, X., Vestman, V., Sahidullah, M., Delgado, H., Nautsch, A., Yamagishi, J., Evans, N., Kinnunen, T. & Aik Lee, K., 19 Sep 2019, Proceedings Interspeech 2019. International Speech Communication Association, p. 1008-1012 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  22. Detecting Topic-Oriented Speaker Stance in Conversational Speech

    Lai, C., Alex, B., Moore, J. D., Tian, L., Hori, T. & Francesca, G., 19 Sep 2019, Proceedings of Interspeech 2019. International Speech Communication Association, p. 46-50 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  23. Direct F0 Estimation with Neural-Network-based Regression

    Xu, S. & Shimodaira, H., 19 Sep 2019, Proc. Interspeech 2019. International Speech Communication Association, p. 1995-1999 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  24. Evaluating Near End Listening Enhancement Algorithms in Realistic Environments

    Chermaz, C., Valentini Botinhao, C., Schepker, H. & King, S., 19 Sep 2019, Proceedings Interspeech 2019. International Speech Communication Association, p. 1373-1377 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  25. GELP: GAN-Excited Liner Prediction for Speech Synthesis from Mel-Spectrogram

    Juvela, L., Bollepalli, B., Yamagishi, J. & Alku, P., 19 Sep 2019, Proceedings Interspeech 2019. International Speech Communication Association, p. 694-698 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  26. Improving speech synthesis with discourse relations

    Aubin, A., Cervone, A., Watts, O. & King, S., 19 Sep 2019, Interspeech 2019. ISCA, Vol. 2019-September. p. 4470-4474 (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  27. Lattice-based lightly-supervised acoustic model training

    Fainberg, J., Klejch, O., Renals, S. & Bell, P., 19 Sep 2019, Proceedings Interspeech 2019. International Speech Communication Association, p. 1596-1600 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  28. On Learning Interpretable CNNs with Parametric Modulated Kernel-based Filters

    Loweimi, E., Bell, P. & Renals, S., 19 Sep 2019, Proceedings Interspeech 2019. International Speech Communication Association, p. 3480-3484 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  29. Synchronising audio and ultrasound by learning cross-modal embeddings

    Eshky, A., Ribeiro, M., Richmond, K. & Renals, S., 19 Sep 2019, INTERSPEECH 2019: Proceedings of the 20th Annual Conference of the International Speech Communication Association (ISCA). Graz, Austria: International Speech Communication Association, p. 4100-4104 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  30. Trainable Dynamic Subsampling for End-to-End Speech Recognition

    Zhang, S., Loweimi, E., Xu, Y., Bell, P. & Renals, S., 19 Sep 2019, Proceedings Interspeech 2019. International Speech Communication Association, p. 1413-1417 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  31. Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora

    Luong, H-T., Wang, X., Yamagishi, J. & Nishizawa, N., 19 Sep 2019, Proceedings Interspeech 2019. International Speech Communication Association, p. 1303-1307 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  32. Ultrasound tongue imaging for diarization and alignment of child speech therapy sessions

    Ribeiro, M., Eshky, A., Richmond, K. & Renals, S., 19 Sep 2019, INTERSPEECH 2019: Proceedings of the 20th Annual Conference of the International Speech Communication Association (ISCA). Graz, Austria: International Speech Communication Association, p. 16-20 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  33. Untranscribed web audio for low resource speech recognition

    Carmantini, A., Bell, P. & Renals, S., 19 Sep 2019, Proceedings Interspeech 2019. International Speech Communication Association, p. 226-230 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  34. The prosody of presupposition projection in naturally-occurring utterances

    Mahler, T., de Marneffe, M-C. & Lai, C., 7 Sep 2019. 2 p.

    Research output: Contribution to conferencePoster

  35. Modern speech synthesis for phonetic sciences: a discussion and an evaluation

    Malisz, Z., Eje Henter, G., Valentini Botinhao, C., Watts, O., Beskow, J. & Gustafson, J., 31 Aug 2019, Proceedings of the 19th International Congress of Phonetic Sciences ICPhS 2019. Calhoun, S., Escudero, P., Tabain, M. & Warren, P. (eds.). Canberra, Australia: Australasian Speech Science and Technology Association Inc.: Australian Speech Science & Technology Association Inc, p. 487-491 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  36. Normal-to-Lombard Adaptation of Speech Synthesis Using Long Short-Term Memory Recurrent Neural Networks

    Bollepalli, B., Juvela, L., Airaksinen, M., Valentini Botinhao, C. & Alku, P., 1 Jul 2019, In : Speech Communication. 110, p. 64-75 21 p.

    Research output: Contribution to journalArticle

  37. Multi-task Learning For Detecting and Segmenting Manipulated Facial Images and Videos

    H. Nguyen, H., Fang, F., Yamagishi, J. & Echizen, I., 15 Jun 2019, (Accepted/In press) The Tenth IEEE International Conference on Biometrics: Theory, Applications, and Systems (BTAS 2019). 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  38. Spatio-temporal generative adversarial network for gait anonymization

    Tieu, N. D. T., Nguyen, H. H., Nguyen-Son, H. Q., Yamagishi, J. & Echizen, I., 1 Jun 2019, In : Journal of Information Security and Applications. 46, p. 307-319 13 p.

    Research output: Contribution to journalArticle

  39. Audiovisual Speaker Conversion: Jointly and Simultaneously Transforming Facial Expression and Acoustic Characteristics

    Fang, F., Wang, X., Yamagishi, J. & Echizen, I., 17 May 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 6795-6799 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  40. Capsule-forensics: Using Capsule Networks to Detect Forged Images and Videos

    Nguyen, H. H., Yamagishi, J. & Echizen, I., 17 May 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 2307-2311 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  41. Cycle-consistent Adversarial Networks for Non-parallel Vocal Effort Based Speaking Style Conversion

    Seshadri, S., Juvela, L., Yamagishi, J., Rasanen, O. & Alku, P., 17 May 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 6835-6839 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  42. Investigation of Enhanced Tacotron Text-to-speech Synthesis Systems with Self-attention for Pitch Accent Language

    Yasuda, Y., Wang, X., Takaki, S. & Yamagishi, J., 17 May 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 6905-6909 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  43. STFT Spectral Loss for Training a Neural Speech Waveform Model

    Takaki, S., Nakashika, T., Wang, X. & Yamagishi, J., 17 May 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 7065-7069 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  44. Waveform Generation for Text-to-speech Synthesis Using Pitch-synchronous Multi-scale Generative Adversarial Networks

    Juvela, L., Bollepalli, B., Yamagishi, J. & Alku, P., 17 May 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 6915-6919 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  45. "Why is the Doctor a Man?" Reactions of Older Adults to a Virtual Training Doctor

    Constantin, A., Lai, C., Farrow, E., Alex, B., Pel-Littel, R., Nap, H. H. & Jeuring, J., 2 May 2019, Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems. Glasgow, Scotland UK: ACM, 6 p. LBW1719. (CHI EA '19).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  46. Attentive filtering networks for audio replay attack detection

    Lai, C-I., Abad, A., Richmond, K., Yamagishi, J., Dehak, N. & King, S., 17 Apr 2019, 2019 IEEE International Conference on Acoustics, Speech and Signal Processing. p. 6316-6320

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  47. Dynamic Evaluation of Transformer Language Models

    Krause, B., Mbabazi, E., Murray, I. & Renals, S., 17 Apr 2019, 6 p.

    Research output: Working paper

  48. On the Usefulness of Statistical Normalisation of Bottleneck Features for Speech Recognition

    Loweimi, E., Bell, P. & Renals, S., 17 Apr 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brighton, United Kingdom: Institute of Electrical and Electronics Engineers (IEEE), p. 3862-3866 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  49. Speaker-Independent Classification of Phonetic Segments from Raw Ultrasound in Child Speech

    Ribeiro, M. S., Eshky, A., Richmond, K. & Renals, S., 17 Apr 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brighton, United Kingdom: Institute of Electrical and Electronics Engineers (IEEE), p. 1328-1332 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  50. Speech Waveform Reconstruction using Convolutional Neural Networks with Noise and Periodic Inputs

    Watts, O., Valentini Botinhao, C. & King, S., 17 Apr 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brighton, United Kingdom: Institute of Electrical and Electronics Engineers (IEEE), p. 7045-7049 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  51. Windowed Attention Mechanisms for Speech Recognition

    Zhang, S., Loweimi, E., Bell, P. & Renals, S., 17 Apr 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brighton, United Kingdom: Institute of Electrical and Electronics Engineers (IEEE), p. 7100-7104 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  52. Unsupervised Speaker Adaptation for DNN-based Speech Synthesis using Input Codes

    Takaki, S., Nishimura, Y. & Yamagishi, J., 7 Mar 2019, Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018. Honolulu, Hawaii, USA: Institute of Electrical and Electronics Engineers (IEEE), p. 649-658 10 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  53. Recognizing Induced Emotions of Movie Audiences From Multimodal Information

    Muszynski, M., Tian, L., Lai, C., Moore, J., Kostoulas, T., Lombardo, P., Pun, T. & Chanel, G., 27 Feb 2019, In : IEEE Transactions on Affective Computing. 17 p.

    Research output: Contribution to journalArticle

  54. Analyzing deep CNN-based utterance embeddings for acoustic model adaptation

    Równicka, J., Bell, P. & Renals, S., 14 Feb 2019, 2018 IEEE Spoken Language Technology Workshop (SLT). Institute of Electrical and Electronics Engineers (IEEE), p. 235-241 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  55. Exemplar-based speech waveform generation for text-to-speech

    Valentini Botinhao, C., Watts, O., Espic Calderón, F. & King, S., 14 Feb 2019, 2018 IEEE Workshop on Spoken Language Technology (SLT). Institute of Electrical and Electronics Engineers (IEEE), p. 332-338 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  56. Scaling and Bias Codes for Modeling Speaker-Adaptive DNN-based Speech Synthesis Systems

    Luong, H-T. & Yamagishi, J., 14 Feb 2019, IEEE 2018 Workshop on spoken language technology (SLT 2018). Athens, Greece: Institute of Electrical and Electronics Engineers (IEEE), p. 610-617 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  57. Complex-Valued Restricted Boltzmann Machine for Speaker-Dependent Speech Parameterization from Complex Spectra

    Nakashika, T., Takaki, S. & Yamagishi, J., Feb 2019, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. p. 244-254 11 p.

    Research output: Contribution to journalArticle

  58. Transforming acoustic characteristics to deceive playback spoofing countermeasures of speaker verification systems

    Fang, F., Yamagishi, J., Echizen, I., Sahidullah, M. & Kinnunen, T., 31 Jan 2019, IEEE International Workshop on Information Forensics and Security (WIFS) 2018. Hong Kong: Institute of Electrical and Electronics Engineers (IEEE), 9 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  59. Comunicaci ́on enriquecida a lo largo de la vida

    Cooke, M., King, S., Hazan, V., Stylianou, Y., Janse, E., Baskent, D., Hohmann, V., Winneke, A. & Hernaez, I., 2019, In : Procesamiento del Lenguaje Natural. 63, p. 175-178

    Research output: Contribution to journalArticle

  60. 2018
  61. Identifying Computer-Translated Paragraphs using Coherence Features

    Nguyen-Son, H-Q., T. Tieu, N-D., H. Nguyen, H., Yamagishi, J. & Echizen, I., 3 Dec 2018, Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation (PACLIC 32). Hung Hom, Kowloon Hong Kong: Association for Computational Linguistics (ACL), 9 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  62. Multimodal Analysis of Group Attitudes Towards Meeting Management

    Murray, G. & Lai, C., 16 Oct 2018, Group Interaction Frontiers in Technology (GIFT'18). Boulder, CO, USA: ACM, p. 4:1-4:6 6 p. 4

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  63. Predicting Group Satisfaction in Meeting Discussions

    Lai, C. & Murray, G., 16 Oct 2018, Workshop on Modeling Cognitive Processes from Multimodal Data (MCPMD'18). ACM, 8 p. 1

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  64. Transformation on Computer-Generated Facial Image to Avoid Detection by Spoofing Detector

    H. Nguyen, H., T. Tieu, N-D., Nguyen-Son, H-Q., Yamagishi, J. & Echizen, I., 11 Oct 2018, IEEE International Conference on Multimedia and Expo (ICME) 2018. San Diego, USA: Institute of Electrical and Electronics Engineers (IEEE), 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  65. Group Interaction Frontiers in Technology

    Murray, G., Hung, H., Keyton, J., Lai, C., Lehmann-Willenbrock, N. & Oertel, C., 2 Oct 2018, Proceedings of the 20th ACM International Conference on Multimodal Interaction. Boulder, CO, USA: ACM, p. 660-662 3 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  66. Dynamic Evaluation of Neural Sequence Models

    Krause, B., Mbabazi, E., Murray, I. & Renals, S., 1 Oct 2018, Proceedings of the 35th International Conference on Machine Learning. Dy, J. & Krause, A. (eds.). Stockholmsmässan, Stockholm Sweden: PMLR, Vol. 80. p. 2766-2775 10 p. (Proceedings of Machine Learning Research).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  67. Capturing the sounds of an urban greenspace

    Klein, E., Chapple, S., Fainberg, J., Magill, C., Parker, M., Raab, C. & Silvertown, J., 20 Sep 2018, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences: 3rd International Conference on Smart Data and Smart Cities. Delft, The Netherlands: Copernicus Publications, Vol. XLII-4/W11. p. 19-26 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  68. A Comparison of Recent Waveform Generation and Acoustic Modeling Methods for Neural-Network-Based Speech Synthesis

    Wang, X., Lorenzo-Trueba, J., Takaki, S., Juvela, L. & Yamagishi, J., 13 Sep 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP): Calgary, AB, Canada. Calgary, Alberta, Canada: Institute of Electrical and Electronics Engineers (IEEE), p. 4804-4808 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  69. Cyborg Speech: Deep Multilingual Speech Synthesis For Generating Segmental Foreign Accent With Natural Prosody

    Eje Henter, G., Lorenzo-Trueba, J., Wang, X., Kondo, M. & Yamagishi, J., 13 Sep 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP): Calgary, AB, Canada. Calgary, Alberta, Canada: Institute of Electrical and Electronics Engineers (IEEE), p. 4799-4803 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  70. High-Quality Nonparallel Voice Conversion Based On Cycle-Consistent Adversarial Network

    Fang, F., Yamagishi, J., Echizen, I. & Lorenzo-Trueba, J., 13 Sep 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP): Calgary, AB, Canada. Calgary, Alberta, Canada: Institute of Electrical and Electronics Engineers (IEEE), p. 5279-5283 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  71. Speech Waveform Synthesis From MFCC Sequences With Generative Adversarial Networks

    Juvela, L., Bollepalli, B., Wang, X., Kameoka, H., Airaksinen, M., Yamagishi, J. & Alku, P., 13 Sep 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP): Calgary, AB, Canada. Calgary, Alberta, Canada: Institute of Electrical and Electronics Engineers (IEEE), p. 5679-5683 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  72. Exemplar-based Speech Waveform Generation

    Watts, O., Valentini Botinhao, C., Espic calderón, F. & King, S., 6 Sep 2018, Interspeech 2018. Hyderabad, India: ISCA, p. 2022-2026 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  73. Impact of different speech types on listening effort

    Symantiraki, O., Cooke, M. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2267-2271

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  74. Learning interpretable control dimensions for speech synthesis by using external data

    Hodari, Z., Watts, O., Ronanki, S. & King, S., 6 Sep 2018, Interspeech 2018. Hyderabad, India: ISCA, p. 32-36 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  75. Measuring the cognitive load of synthetic speech using a dual task paradigm

    Govender, A. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2843-2847 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  76. Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation

    Luong, H-T. & Yamagishi, J., 6 Sep 2018, Proc. Interspeech 2018. Hyderabad, India: ISCA, p. 2494-2498 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  77. Using pupillometry to measure the cognitive load of synthetic speech

    Govender, A. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2843-2847 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  78. UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions

    Eshky, A., Ribeiro, M. S., Cleland, J., Richmond, K., Roxburgh, Z., Scobbie, J. & Wrench, A., 5 Sep 2018, INTERSPEECH 2018: Proceedings of the 19th Annual Conference of the International Speech Communication Association (ISCA). Hyderabad, India: ISCA, p. 1888-1892 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  79. Learning to Adapt: a Meta-learning Approach for Speaker Adaptation

    Klejch, O., Fainberg, J. & Bell, P., Sep 2018, Proc. of Interspeech 2018. Hyderabad, India: ISCA, p. 867-871 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  80. Modular Convolutional Neural Network for Discriminating between Computer-Generated Images and Photographic Images

    H. Nguyen, H., T. Tieu, N-D., Nguyen-Son, H-Q., Nozick, V., Yamagishi, J. & Echizen, I., 27 Aug 2018, 13th International Conference on Availability, Reliability and Security (ARES 2018). Hamburg, Germany: ACM, p. 1:1-1:10 10 p. 1

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  81. Speech Enhancement of Noisy and Reverberant Speech for Text-to-Speech

    Valentini Botinhao, C. & Yamagishi, J., Aug 2018, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 26, 8, p. 1420-1433 14 p.

    Research output: Contribution to journalArticle

  82. Polarity and Intensity: the Two Aspects of Sentiment Analysis

    Tian, L., Lai, C. & Moore, J., Jul 2018, Proceedings of Grand Challenge and Workshop on Human Multimodal Language (Challenge-HML) . Melbourne, Australia: Association for Computational Linguistics (ACL), p. 40-47 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  83. Word Error Rate Estimation for Speech Recognition: e-WER

    Ali, A. & Renals, S., Jul 2018, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Melbourne, Australia : Association for Computational Linguistics (ACL), p. 20-24 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  84. A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment

    Kinnunen, T., Lorenzo-Trueba, J., Yamagishi, J., Toda, T., Saito, D., Villavicencio, F. & Ling, Z., 29 Jun 2018, Speaker Odyssey 2018: The Speaker and Language Recognition Workshop. Les Sables d'Olonne, France: ISCA, p. 187-194 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  85. ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements

    Delgado, H., Todisco, M., Sahidullah, M., Evans, N., Kinnunen, T., Aik Lee, K. & Yamagishi, J., 29 Jun 2018, Speaker Odyssey 2018: The Speaker and Language Recognition Workshop. Les Sables d’Olonne, France: ISCA, p. 296-303 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  86. Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama’s voice using GAN, WaveNet and low-quality found data

    Lorenzo-Trueba, J., Fang, F., Wang, X., Echizen, I., Yamagishi, J. & Kinnunen, T., 29 Jun 2018, Speaker Odyssey 2018: The Speaker and Language Recognition Workshop. Les Sables d’Olonne, France: ISCA, p. 240-247 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  87. The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods

    Lorenzo-Trueba, J., Yamagishi, J., Toda, T., Saito, D., Villavicencio, F., Kinnunen, T. & Ling, Z., 29 Jun 2018, Speaker Odyssey 2018: The Speaker and Language Recognition Workshop. Les Sables d’Olonne, France: ISCA, p. 195-202 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  88. t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification

    Kinnunen, T., Aik Lee, K., Delgado, H., Evans, N., Todisco, M., Sahidullah, M., Yamagishi, J. & A. Reynolds, D., 29 Jun 2018, Speaker Odyssey 2018: The Speaker and Language Recognition Workshop. Les Sables d'Olonne, France: ISCA, p. 312-319 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  89. Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion

    Todisco, M., Delgado, H., Aik Lee, K., Sahidullah, M., Evans, N., Kinnunen, T. & Yamagishi, J., 4 Jun 2018, (Accepted/In press) Interspeech 2018. Hyderabad, India, 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  90. Investigating accuracy of pitch-accent annotations in neural network-based speech synthesis and denoising effects

    Luong, H-T., Wang, X., Yamagishi, J. & Nishizawa, N., 4 Jun 2018, (Accepted/In press) Interspeech 2018. Hyderabad, India, 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  91. Speaker-independent raw waveform model for glottal excitation

    Juvela, L., Tsiaras, V., Bollepalli, B., Airaksinen, M., Yamagishi, J. & Alku, P., 4 Jun 2018, (Accepted/In press) Interspeech 2018. Hyderabad, India, 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  92. A Comparison Between STRAIGHT, Glottal, an Sinusoidal Vocoding in Statistical Parametric Speech Synthesis

    Airaksinen, M., Juvela, L., Bollepalli, B., Yamagishi, J. & Alku, P., 11 May 2018, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 26, 9, p. 1658-1670 13 p.

    Research output: Contribution to journalArticle

  93. Investigating different representations for modeling and controlling multiple emotions in DNN-based speech synthesis

    Lorenzo-Trueba, J., Eje Henter, G., Takaki, S., Yamagishi, J., Morino, Y. & Ochiai, Y., May 2018, In : Speech Communication. 99, p. 135-143 9 p.

    Research output: Contribution to journalArticle

  94. Autoregressive neural F0 model for statistical parametric speech synthesis

    Wang, X., Takaki, S. & Yamagishi, J., 19 Apr 2018, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 26, 8, p. 1406-1419 14 p.

    Research output: Contribution to journalArticle

  95. Dual-modality Talking-metrics: 3D Visual-Audio Integrated Behaviometric Cues from Speakers

    Zhang, J., Richmond, K. & Fisher, R., 10 Apr 2018, (Accepted/In press) The 24th International Conference on Pattern Recognition. 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  96. Exploring the Use of Group Delay for Generalised VTS Based Noise Compensation

    Loweimi, E., Barker, J. & Hain, T., 1 Apr 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 4824-4828 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  97. A multilinear tongue model derived from speech related MRI data of the human vocal tract

    Hewer, A., Wuhrer, S., Steiner, I. & Richmond, K., 21 Feb 2018, In : Computer Speech and Language. 51, p. 68-92

    Research output: Contribution to journalArticle

  98. Identifying Computer-Generated Text Using Statistical Analysis

    Nguyen-Son, H-Q., T. Tieu, N-D., H. Nguyen, H., Yamagishi, J. & Echizen, I., 8 Feb 2018, 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference. Kuala Lumpur, Malaysia: Institute of Electrical and Electronics Engineers (IEEE), p. 1504-1511 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  99. Recognizing Induced Emotions of Movie Audiences: Are Induced and Perceived Emotions the Same?

    Tian, L., Muszynski, MI., Lai, C., Moore, J., Kostoulas, T., Lombardo, P., Pun, T. & Chanel, G., 1 Feb 2018, Seventh International Conference on Affective Computing and Intelligent Interaction (ACII2017). Institute of Electrical and Electronics Engineers (IEEE), p. 28-35 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  100. An Approach for Gait Anonymization Using Deep Learning

    T. Tieu, N-D., H. Nguyen, H., Nguyen-Son, H-Q., Yamagishi, J. & Echizen, I., 25 Jan 2018, 9th IEEE International Workshop on Information Forensics and Security (WIFS) 2017. Rennes, France: Institute of Electrical and Electronics Engineers (IEEE), 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  101. Distinguishing Computer Graphics from Natural Images Using Convolution Neural Networks

    Rahmouni, N., Nozick, V., Yamagishi, J. & Echizen, I., 25 Jan 2018, 9th IEEE International Workshop on Information Forensics and Security (WIFS) 2017. Institute of Electrical and Electronics Engineers (IEEE), Vol. Rennes, France. 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  102. Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features

    Tsunoo, E., Klejch, O., Bell, P. & Renals, S., 25 Jan 2018, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Institute of Electrical and Electronics Engineers (IEEE), p. 525-532 9 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  103. Simplifying very deep convolutional neural network architectures for robust speech recognition

    Rownicka, J., Renals, S. & Bell, P., 25 Jan 2018, IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2017). Institute of Electrical and Electronics Engineers (IEEE), p. 236-243 9 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  104. Speech Recognition Challenge in the Wild: Arabic MGB-3

    Ali, A., Vogel, S. & Renals, S., 25 Jan 2018, IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2017). Institute of Electrical and Electronics Engineers (IEEE), p. 316-322 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  105. WERd: Using Social Text Spelling Variants for Evaluating Dialectal Speech Recognition

    Ali, A., Nakov, P., Bell, P. & Renals, S., 25 Jan 2018, IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2017). Institute of Electrical and Electronics Engineers (IEEE), p. 141-148 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  106. On the Usefulness of the Speech Phase Spectrum for Pitch Extraction

    Loweimi, E., Barker, J. & Hain, T., 2018, Proc. Interspeech 2018. ISCA, p. 696-700 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  107. The CSTR entry to the 2018 Blizzard Challenge

    Espic calderón, F., Govender, A., Ribeiro, M. S., Valentini Botinhao, C. & Watts, O., 2018, Blizzard Challenge 2018 workshop. Hyderabad, India, 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  108. 2017
  109. Recognizing Emotions in Spoken Dialogue with Acoustic and Lexical Cues

    Tian, L., Moore, J. & Lai, C., 13 Nov 2017, ICMI 2017 Satellite Workshop Investigating Social Interactions with Artificial Agents. ACM, p. 45-46 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  110. Investigating very deep highway networks for parametric speech synthesis

    Wang, X., Takaki, S. & Yamagishi, J., 8 Nov 2017, In : Speech Communication. 96, p. 1-9 9 p.

    Research output: Contribution to journalArticle

  111. Influence of speaker familiarity on blind and visually impaired children’s and young adults’ perception of synthetic voices

    Puchera, M., Zillinger, B., Toman, M., Schabus, D., Valentini Botinhao, C., Yamagishi, J., Schmid, E. & Woltron, T., 1 Nov 2017, In : Computer Speech and Language. 46, p. 179-195 10 p.

    Research output: Contribution to journalArticle

  112. Distant Speech Recognition Experiments Using the AMI Corpus

    Renals, S. & Swietojanski, P., 30 Oct 2017, New Era for Robust Speech Recognition: Exploiting Deep Learning. Springer, p. 355-368 14 p.

    Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)

  113. End-to-end neural segmental models for speech recognition

    Hao, T., Lu, L., Kong, L., Gimpel, K., Livescu, K., Dyer, C., Smith, N. A. & Renals, S., 14 Sep 2017, In : IEEE Journal of Selected Topics in Signal Processing. 11, 8, p. 1254-1264 11 p.

    Research output: Contribution to journalArticle

  114. A Hierarchical Encoder-Decoder Model for Statistical Parametric Speech Synthesis

    Ronanki, S., Watts, O. & King, S., 24 Aug 2017, Proceedings Interspeech 2017. p. 1133-1137 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  115. A system for real time collaborative transcription correction

    Bell, P., Fainberg, J., Lai, C. & Sinclair, M., 24 Aug 2017, Interspeech 2017. 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  116. An RNN-based Quantized F0 Model with Multi-tier Feedback Links for Text-to-Speech Synthesis

    Wang, X., Takaki, S. & Yamagishi, J., 24 Aug 2017, Proceedings Interspeech 2017. p. 1059-1063 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  117. Channel Compensation in the Generalised Vector Taylor Series Approach to Robust ASR

    Loweimi, E., Barker, J. & Hain, T., 24 Aug 2017, Proc. Interspeech 2017. Stockholm, Sweden: ISCA, p. 2466-2470 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  118. Complex-valued restricted Boltzmann machine for direct learning of frequency spectra

    Nakashika, T., Takaki, S. & Yamagishi, J., 24 Aug 2017, Proceedings Interspeech 2017. p. 4021-4025 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  119. Factorised representations for neural network adaptation to diverse acoustic environments

    Fainberg, J., Renals, S. & Bell, P., 24 Aug 2017, Proceedings Interspeech 2017. p. 749-753 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  120. Hierarchical Recurrent Neural Network for Story Segmentation

    Tsunoo, E., Bell, P. & Renals, S., 24 Aug 2017, Proceedings Interspeech 2017. p. 2919-2923 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  121. Learning word vector representations based on acoustic counts

    Ribeiro, M. S., Watts, O. & Yamagishi, J., 24 Aug 2017, Interspeech 2017. p. 799-803 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  122. Misperceptions of the emotional content of natural and vocoded speech in a car

    Lorenzo-Trueba, J., Valentini Botinhao, C., Henter, G. & Yamagishi, J., 24 Aug 2017, Proceedings Interspeech 2017. International Speech Communication Association, p. 606-610 5 p. (Interspeech).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  123. Nativization of foreign names in TTS for automatic reading of world news in Swahili

    Mendelson, J., Oplustil, P., Watts, O. & King, S., 24 Aug 2017, Proceedings Interspeech 2017. p. 2188-2192 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  124. Principles for learning controllable TTS from annotated and latent variation

    Henter, G., Lorenzo-Trueba, J., Wang, X. & Yamagishi, J., 24 Aug 2017, Proceedings Interspeech 2017. p. 3956-3960 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  125. Reducing mismatch in training of DNN-based glottal excitation models in a statistical parametric text-to-speech system

    Juvela, L., Bollepalli, B., Yamagishi, J. & Alku, P., 24 Aug 2017, Proceedings Interspeech 2017. p. 1368-1372 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  126. Robust Source-Filter Separation of Speech Signal in the Phase Domain

    Loweimi, E., Barker, J., Torralba, O. S. & Hain, T., 24 Aug 2017, Proc. Interspeech 2017. Stockholm, Sweden: ISCA, p. 414-418 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  127. Speech intelligibility in cars: the effect of speaking style, noise and listener age

    Valentini Botinhao, C. & Yamagishi, J., 24 Aug 2017, Proceedings Interspeech 2017. International Speech Communication Association, p. 2944-2948 5 p. (Interspeech).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  128. The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection

    Kinnunen, T., Sahidullah, M., Delgado, H., Todisco, M., Evans, N., Yamagishi, J. & Aik Lee, K., 24 Aug 2017, Proceedings Interspeech 2017. p. 2-6 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  129. Using Prosody to Classify Discourse Relations

    Kleinhans, J., Farrus, M., Gravano, A., Perez, J. M., Lai, C. & Wanner, L., 24 Aug 2017, Proceedings Interspeech 2017. p. 3201-3205 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  130. Direct Modelling of Magnitude and Phase Spectra for Statistical Parametric Speech Synthesis

    Espic calderón, F., Valentini Botinhao, C. & King, S., 20 Aug 2017, Interspeech 2017. 5 p. (Interspeech).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  131. Direct modeling of frequency spectra and waveform generation based on phase recovery for DNN-based speech synthesis

    Takaki, S., Kameoka, H. & Yamagishi, J., 20 Aug 2017, Proceedings Interspeech 2017. p. 1128-1132 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  132. Generative Adversarial Network-based Postfilter for STFT Spectrograms

    Kaneko, T., Takaki, S., Kameoka, H. & Yamagishi, J., 20 Aug 2017, Proceedings Interspeech 2017. p. 3389-3393 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  133. Investigating different representations for modeling multiple emotions in DNN-based speech synthesis

    Lorenzo-Trueba, J., Eje Henter, G., Takahashi, S., Yamagishi, J., Morino, Y. & Ochiai, Y., 18 Jul 2017, (Accepted/In press). 6 p.

    Research output: Contribution to conferencePaper

  134. Small-footprint highway deep neural networks for speech recognition

    LU, L. & Renals, S., Jul 2017, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25, 7, p. 1502-1511 10 p.

    Research output: Contribution to journalArticle

  135. Adapting and Controlling DNN-Based Speech Synthesis Using Input Codes

    Luong, H-T., Takaki, S., Henter, G. & Yamagishi, J., 19 Jun 2017, The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2017. Institute of Electrical and Electronics Engineers (IEEE), p. 1905-1909 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  136. An Autoregressive Recurrent Mixture density Network For Parametric Speech Synthesis

    Wang, X., Takaki, S. & Yamagishi, J., 19 Jun 2017, The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2017. Institute of Electrical and Electronics Engineers (IEEE), p. 4895-4899 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  137. Knowledge Distillation for Small-footprint Highway Networks

    Lu, L., Guo, M. & Renals, S., 19 Jun 2017, 2017 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP 2017). Institute of Electrical and Electronics Engineers (IEEE), p. 4280-4284 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  138. Non-Parallel Voice Conversion Using I-Vector PLDA: Towards Unifying Speaker Verification and Transformation

    Kinnunen, T., Juvela, L., Alku, P. & Yamagishi, J., 19 Jun 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5535-5539 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  139. Sequence-to-Sequence Models for Punctuated Transcription Combing Lexical and Acoustic Features

    Klejch, O., Bell, P. & Renals, S., 19 Jun 2017, The 42nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017). Institute of Electrical and Electronics Engineers (IEEE), p. 5700-5704 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  140. Introduction to the Issue on Spoofing and Countermeasures for Automatic Speaker Verification

    Yamagishi, J., Kinnunen, T. H., Evans, N., Leon, P. D. & Trancoso, I., 1 Jun 2017, In : IEEE Journal of Selected Topics in Signal Processing. 11, 4, p. 585-587 3 p.

    Research output: Contribution to journalSpecial issue

  141. ASVspoof: the Automatic Speaker Verification Spoofing and Countermeasures Challenge

    Wu, Z., Yamagishi, J., Kinnunen, T., Hanilc, C., Sahidullah, M., Sizov, A., Evans, N., Todisco, M. & Delgado, H., Jun 2017, In : IEEE Journal of Selected Topics in Signal Processing. 11, 4, p. 588-604 17 p.

    Research output: Contribution to journalArticle

  142. Multiplicative LSTM for sequence modelling

    Krause, B., Murray, I., Renals, S. & LU, L., 26 Apr 2017, International Conference on Learning Representations - ICLR 2017 - Workshop Track. p. 2872-2880 9 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  143. User Generated Dialogue Systems: uDialogue

    Tokuda, K., Lee, A., Nankaku, Y., Oura, K., Hashimoto, K., Yamamoto, D., Takumi, I., Uchiya, T., Tsutsumi, S., Renals, S. & Yamagishi, J., 21 Apr 2017, Human-Harmonized Information Technology, Volume 2: Horizontal Expansion. Nishida, T. (ed.). Tokyo: Springer Japan, p. 77-114 38 p.

    Research output: Chapter in Book/Report/Conference proceedingChapter

  144. The SUMMA Platform Prototype

    Liepins, R., Germann, U., Barzdins, G., Birch, A., Renals, S., Weber, S., Kreeft, P. V. D., Bourlard, H., Prieto, J., Klejch, O., Bell, P., Lazaridis, A., Mendes, A., Riedel, S., Almeida, M. S. C., Balage, P., Cohen, S., Dwojak, T., Garner, P., Giefer, A. & 25 others, Junczys-Dowmunt, M., Imrani, H., Nogueira, D., Ali, A., Miranda, S., Popescu-Belis, A., Werlen, L. M., Papasarantopoulos, N., Obamuyide, A., Jones, C., Dalvi, F., Vlachos, A., Wang, Y., Tong, S., Sennrich, R., Pappas, N., Narayan, S., Damonte, M., Durrani, N., Khurana, S., Abdelali, A., Sajjad, H., Vogel, S., Sheppey, D. & Hernon, C., 7 Apr 2017, Proceedings of the EACL 2017 Software Demonstrations. London. UK: Association for Computational Linguistics (ACL), p. 116–119 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  145. Using eigenvoices and nearest-neighbours in HMM-based cross-lingual speaker adaptation with limited data

    Sarfjoo, S. S., Demiroglu, C. & King, S., Apr 2017, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25, 4, p. 839-851 13 p.

    Research output: Contribution to journalArticle

  146. Statistical normalisation of phase-based feature representation for robust speech recognition

    Loweimi, E., Barker, J. & Hain, T., 1 Mar 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). New Orleans, LA, USA: Institute of Electrical and Electronics Engineers (IEEE), p. 5310-5314 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  147. Blind Speech Segmentation using Spectrogram Image-based Features and Mel Cepstral Coefficients

    Stan, A., Valentini Botinhao, C., Orza, B. & Giurgiu, M., 9 Feb 2017, 2016 IEEE Workshop on Spoken Language Technology. Institute of Electrical and Electronics Engineers (IEEE), p. 597-602 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  148. Median-based generation of synthetic speech durations using a non-parametric approach

    Ronanki, S., Watts, O., King, S. & Henter, G. E., 9 Feb 2017, 2016 IEEE Spoken Language Technology Workshop (SLT). Institute of Electrical and Electronics Engineers (IEEE), p. 686-692 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  149. Punctuated Transcription of Multi-genre Broadcasts Using Acoustic and Lexical Approaches

    Klejch, O., Bell, P. & Renals, S., 9 Feb 2017, 2016 IEEE Workshop on Spoken Language Technology. Institute of Electrical and Electronics Engineers (IEEE), p. 433-440 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  150. Recognising emotions in spoken dialogue with hierarchically fused acoustic and lexical features

    Tian, L., Moore, J. & Lai, C., 9 Feb 2017, 2016 IEEE Workshop on Spoken Language Technology. Institute of Electrical and Electronics Engineers (IEEE), p. 565-572 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  151. The MGB-2 Challenge: Arabic Multi-Device Broadcast Media Recognition

    Ali, A., Bell, P., Glass, J., Messaoui, Y., Mubarak, H., Renals, S. & Zhang, Y., 9 Feb 2017, 2016 IEEE Workshop on Spoken Language Technology. Institute of Electrical and Electronics Engineers (IEEE), p. 279-284 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  152. Multitask learning of context-dependent targets in deep neural network acoustic models

    Bell, P., Swietojanski, P. & Renals, S., Feb 2017, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25, 2, p. 238 - 247 10 p.

    Research output: Contribution to journalArticle

  153. 2016
  154. Continuous Expressive Speaking Styles Synthesis based on CVSM and MR-HMM

    Lorenzo-Trueba, J., Barra-Chicote, R., Gallardo-Antolin, A., Yamagishi, J. & Montero, J. M., 16 Dec 2016, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. Association for Computational Linguistics (ACL), p. 369-376 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  155. Bidirectional LSTM Networks Employing Stacked Bottleneck Features for Expressive Speech-Driven Head Motion Synthesis

    Haag, K. & Shimodaira, H., 19 Oct 2016, Intelligent Virtual Agents: 16th International Conference, IVA 2016, Los Angeles, CA, USA, September 20--23, 2016, Proceedings. Traum, D., Swartout, W., Khooshabeh, P., Kopp, S., Scherer, S. & Leuski, A. (eds.). Cham: Springer International Publishing, p. 198-207 10 p. (Lecture Notes in Computer Science; vol. 10011).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  156. Investigation of Using Continuous Representation of Various Linguistic Units in Neural Network Based Text-to-Speech Synthesis

    Wang, X., Takaki, S. & Yamagishi, J., 1 Oct 2016, In : IEICE Transactions on Information and Systems. E99.D, 10, p. 2471-2480 10 p.

    Research output: Contribution to journalArticle

  157. The NII speech synthesis entry for Blizzard Challenge 2016

    Juvela, L., Wang, X., Takaki, S., Kim, S., Airaksinen, M. & Yamagishi, J., 16 Sep 2016, Blizzard Challenge workshop 2016. 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  158. A Comparative Study of the Performance of HMM, DNN, and RNN based Speech Synthesis Systems Trained on Very Large Speaker-Dependent Corpora

    Wang, X., Takaki, S. & Yamagishi, J., 15 Sep 2016, Proceedings of 9th ISCA Speech Synthesis Workshop. p. 125-128 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  159. DNN-based Speech Synthesis for Indian Languages from ASCII text

    Ronanki, S., Gangireddy, S. R., Bollepalli, B. & King, S., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop. p. 74-79 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  160. Development of a statistical parametric synthesis system for operatic singing in German

    Pucher, M., Villavicencio, F. & Yamagishi, J., 15 Sep 2016, Proceedings of 9th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 68-73 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  161. Investigating RNN-based speech enhancement methods for noise-robust Text-to-Speech

    Valentini Botinhao, C., Wang, X., Takaki, S. & Yamagishi, J., 15 Sep 2016, Proceedings of 9th ISCA Speech Synthesis Workshop. Sunnyvale, United States, p. 159-165 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  162. Investigating Very Deep Highway Networks for Parametric Speech Synthesis

    Wang, X., Takaki, S. & Yamagishi, J., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop. p. 166-171 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  163. Merlin: An Open Source Neural Network Speech Synthesis System

    Wu, Z., Watts, O. & King, S., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop (2016). p. 202-207 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  164. Multidimensional scaling of systems in the Voice Conversion Challenge 2016

    Wester, M., Wu, Z. & Yamagishi, J., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop. p. 38-43 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  165. Parallel and cascaded deep neural networks for text-to-speech synthesis

    Ribeiro, M. S., Watts, O. & Yamagishi, J., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop. p. 107-112 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  166. Speaker Adaptation of Various Components in Deep Neural Network based Speech Synthesis

    Takaki, S., Kim, S. & Yamagishi, J., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop. p. 167-173 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  167. A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks

    Yoshimura, T., Henter, G. E., Watts, O., Wester, M., Yamagishi, J. & Tokuda, K., 12 Sep 2016, Interspeech 2016. International Speech Communication Association, p. 342-346 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  168. Analysis of the Voice Conversion Challenge 2016 Evaluation Results

    Wester, M., Wu, Z. & Yamagishi, J., 12 Sep 2016, Interspeech 2016. International Speech Communication Association, p. 1637-1641 5 p. (Interspeech).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  169. Applying Spectral Normalisation and Efficient Envelope Estimation and Statistical Transformation for the Voice Conversion Challenge 2016

    Villavicencio, F., Yamagishi, J., Bonada, J. & Espic, F., 12 Sep 2016, Interspeech 2016. International Speech Communication Association, p. 1661 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  170. Automatic Dialect Detection in Arabic Broadcast Speech

    Ali, A., Dehak, N., Cardinal, P., Khurana, S., Yella, S. H., Glass, J., Bell, P. & Renals, S., 12 Sep 2016, Interspeech 2016. p. 2934-2938 5 p. (Interspeech).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  171. Automatic Paragraph Segmentation with Lexical and Prosodic Features

    Lai, C., Farrús, M. & Moore, J., 12 Sep 2016, Interspeech 2016. San Francisco, United States, p. 1034-1038 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  172. Enhance the Word Vector with Prosodic Information for the Recurrent Neural Network Based TTS System

    Wang, X., Takaki, S. & Yamagishi, J., 12 Sep 2016, Interspeech 2016. International Speech Communication Association, p. 2856-2860 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  173. Improving Children's Speech Recognition through Out-of-Domain Data Augmentation

    Fainberg, J., Bell, P., Lincoln, M. & Renals, S., 12 Sep 2016, Interspeech 2016. p. 1598-1602 5 p. (Interspeech).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  174. Just-in-time prepared captioning for live transmissions

    Simpson, M. N., Barrett, J., Bell, P. & Renals, S., 12 Sep 2016, IBC 2016 Conference. Amsterdam, Netherlands: IET, 9 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  175. Majorisation-Minimisation Based Optimisation of the Composite Autoregressive System with Application to Glottal Inverse Filtering

    Juvela, L., Kameok, H., Airaksinen, M., Yamagishi, J. & Alku, P., 12 Sep 2016, Interspeech 2016. International Speech Communication Association, p. 968-972 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  176. Segmental Recurrent Neural Networks for End-to-end Speech Recognition

    Lu, L., Kong, L., Dyer, C., Smith, N. A. & Renals, S., 12 Sep 2016, Proceedings of Interspeech 2016. San Francisco, United States, p. 385-389 5 p. (Interspeech).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  177. Small-footprint Deep Neural Networks with Highway Connections for Speech Recognition

    Lu, L. & Renals, S., 12 Sep 2016, Proceedings of Interspeech 2016. San Francisco, United States, 5 p. (Interspeech).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  178. Syllable-Level Representations of Suprasegmental Features for DNN-Based Text-to-Speech Synthesis

    Ribeiro, M. S., Watts, O. & Yamagishi, J., 12 Sep 2016, Interspeech 2016. International Speech Communication Association, p. 3186-3190 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  179. The Voice Conversion Challenge 2016

    Toda, T., Chen, L-H., Saito, D., Villavicencio, F., Wester, M., Wu, Z. & Yamagishi, J., 12 Sep 2016, Interspeech 2016. International Speech Communication Association, p. 1632-1636 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  180. Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition

    Loweimi, E., Barker, J. & Hain, T., 12 Sep 2016, Proc. Interspeech 2016. ISCA, p. 3798-3802 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  181. Waveform generation based on signal reshaping for statistical parametric speech synthesis

    Espic, F., Valentini Botinhao, C., Wu, Z. & King, S., 12 Sep 2016, Interspeech 2016. San Francisco, United States, p. 2263-2267 5 p. (Interspeech).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  182. GlottDNN - A full-band glottal vocoder for statistical parametric speech synthesis

    Airaksinen, M., Bollepalli, B., Juvela, L., Wu, Z., King, S. & Alku, P., 8 Sep 2016.

    Research output: Contribution to conferencePaper

  183. Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System using Deep Recurrent Neural Networks

    Valentini Botinhao, C., Wang, X., Takaki, S. & Yamagishi, J., 8 Sep 2016, Proceedings of Interspeech 2016. San Francisco, United States, p. 352-356 5 p. (Interspeech).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  184. The SIWIS Database: A Multilingual Speech Database with Acted Emphasis

    Goldman, J., Honnet, P., Clark, R., Garner, P. N., Ivanova, M., Lazaridis, A., Liang, H., Macedo, T., Pfister, B., Ribeiro, M. S., Wehrli, E. & Yamagishi, J., 8 Sep 2016, Interspeech 2016. International Speech Communication Association, p. 1532-1535 4 p. (Interspeech).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  185. Unsupervised Adaptation of Recurrent Neural Network Language Models

    Gangireddy, S. R., Swietojanski, P., Bell, P. & Renals, S., 8 Sep 2016, Interspeech 2016. p. 2333-2337 5 p. (Interspeech).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  186. Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks

    Juvela, L., Wang, X., Takaki, S., Airaksinen, M., Yamagishi, J. & Alku, P., 8 Sep 2016, Interspeech 2016. International Speech Communication Association, p. 2283-2287 5 p. (International Speech Communication Association).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  187. Differentiable Pooling for Unsupervised Acoustic Model Adaptation

    Swietojanski, P. & Renals, S., Aug 2016, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 24, 10, p. 1773-1784 12 p.

    Research output: Contribution to journalArticle

  188. Paragraph-based Prosodic Cues for Speech Synthesis Applications

    Farrús, M., Lai, C. & Moore, J., Jun 2016, Proceedings of Speech Prosody 2016. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  189. Voice Liveness Detection for Speaker Verification based on a Tandem

    Shiota, S., Villavicencio, F., Yamagishi, J., Ono, N., Echizen, I. & Matsui, T., Jun 2016, Odyssey 2016: The Speaker and Language Recognition Workshop. International Speech Communication Association, p. 259-263 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  190. From HMMs to DNNs: Where Do the Improvements Come From?

    Watts, O., Henter, G. E., Merritt, T., Wu, Z. & King, S., 19 May 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5505-5509 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  191. Learning Hidden Unit Contributions for Unsupervised Acoustic Model Adaptation

    Swietojanski, P., Li, J. & Renals, S., May 2016, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 24, 8, p. 1450-1463 14 p.

    Research output: Contribution to journalArticle

  192. Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance

    Wu, Z., De Leon, P., Demiroglu, C., Khodabakhsh, A., King, S., Ling, Z., Saito, D., Stewart, B., Toda, T., Wester, M. & Yamagishi, J., Apr 2016, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 24, 4, p. 768 - 783 17 p.

    Research output: Contribution to journalArticle

  193. SAT-LHUC: Speaker adaptive training for learning hidden unit contributions

    Swietojanski, P. & Renals, S., 1 Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5010-5014 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  194. A deep auto-encoder based low-dimensional feature extraction from FFT spectral envelopes for statistical parametric speech synthesis

    Takaki, S. & Yamagishi, J., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5535-5539 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  195. Deep neural network-guided unit selection synthesis

    Merritt, T., Clark, R., Wu, Z. & Yamagishi, J., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5145-5149 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  196. Initial investigation of speech synthesis based on complex-valued neural networks

    Hu, Q., Richmond, K., Yamagishi, J., Subramanian, K. & Stylianou, Y., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5630 - 5634 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  197. Investigating gated recurrent neural networks for speech synthesis

    Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 1-5 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  198. On training the recurrent neural network encoder-decoder for large vocabulary end-to-end speech recognition

    Lu, L., Zhang, X. & Renals, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5060-5064 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  199. Privacy-preserving sound to degrade automatic speaker verification performance

    Takaki, S. & Yamagishi, J., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5500-5504 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  200. Robust TTS Duration Modelling Using DNNs

    Henter, G., Ronanki, S., Watts, O., Wester, M., Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5130-5134 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  201. Testing the Consistency Assumption: Pronunciation Variant Forced Alignment in Read and Spontaneous Speech Synthesis

    Dall, R., Brognaux, S., Richmond, K., Valentini Botinhao, C., Henter, G., Hirschberg, J., Yamagishi, J. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5155-5159 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  202. Wavelet-based decomposition of F0 as a secondary task for DNN-based speech synthesis with multi-task learning

    Ribeiro, M. S., Watts, O., Yamagishi, J. & Clark, R., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5525-5529 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  203. Speech synthesis

    King, S., 25 Feb 2016, Oxford Bibliographies in Linguistics. Aronoff, M. (ed.). New York: Oxford University Press, 29 p.

    Research output: Chapter in Book/Report/Conference proceedingOther chapter contribution

  204. ALISA: An automatic lightly supervised speech segmentation and alignment tool

    Stan, A., Mamiya, Y., Yamagishi, J., Bell, P., Watts, O., Clark, R. A. J. & King, S., 31 Jan 2016, In : Computer Speech and Language. 35, p. 116-133 18 p.

    Research output: Contribution to journalArticle

  205. Evaluating the predictions of objective intelligibility metrics for modified and synthetic speech

    Tang, Y., Cooke, M. & Valentini-Botinhao, C., Jan 2016, In : Computer Speech and Language. 35, p. 73-92 30 p.

    Research output: Contribution to journalArticle

  206. Character-level neural translation for multilingual media monitoring in the SUMMA project

    Barzdins, G., Renals, S. & Gosko, D., 2016, Proceedings of the 10th International Conference on Language Resources and Evaluation, LREC 2016. European Language Resources Association (ELRA), p. 1789-1793 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  207. Evaluating comprehension of natural and synthetic conversational speech

    Wester, M., Watts, O. & Henter, G. E., 2016, Speech Prosody 2016. p. 766-770 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  208. Smooth talking: articulatory join costs for unit selection

    Richmond, K. & King, S., 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5150-5154 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  209. The CSTR entry to the Blizzard Challenge 2016

    Merritt, T., Ronanki, S., Wu, Z. & Watts, O., 2016, Proceedings of Blizzard Challenge 2016. 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  210. Tongue mesh extraction from 3D MRI data of the human vocal tract

    Hewer, A., Wuhrer, S., Steiner, I. & Richmond, K., 2016, Perspectives in Shape Analysis.

    Research output: Chapter in Book/Report/Conference proceedingChapter

  211. 2015
  212. Combining Lightly-supervised Learning and User Feedback to Construct Andimprove a Statistical Parametric Speech Synthesizer for Malay

    Chee Yong, L., Watts, O. & King, S., 15 Dec 2015, In : Research Journal of Applied Sciences, Engineering and Technology. 11, 11, p. 1227-1232 6 p.

    Research output: Contribution to journalArticle

  213. Multi-reference WER for evaluating ASR for languages with no orthographic rule

    Ali, A., Magdy, W., Bell, P. & Renals, S., 13 Dec 2015, Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop. Institute of Electrical and Electronics Engineers (IEEE), p. 576-580 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  214. On the Efficiency of Recurrent Neural Network Optimization Algorithms

    Krause, B., Lu, L., Murray, I. & Renals, S., Dec 2015, OPT2015 Optimization for Machine Learning at the Neural Information Processing Systems Conference, 2015. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  215. The use of articulatory movement data in speech synthesis applications: An overview — Application of articulatory movements using machine learning algorithms

    Richmond, K., Ling, Z. & Yamagishi, J., 1 Nov 2015, In : Acoustical Science and Technology. 36, 6, p. 467-477 11 p.

    Research output: Contribution to journalArticle

  216. Emotion transplantation through adaptation in HMM-based speech synthesis

    Lorenzo-Trueba, J., Barra-Chicote, R., San-Segundo, R., Ferreiros, J., Yamagishi, J. & Montero, J. M., Nov 2015, In : Computer Speech and Language. 34, 1, p. 292-307 16 p.

    Research output: Contribution to journalArticle

  217. Intelligibility of time-compressed synthetic speech: Compression method and speaking style

    Valentini-Botinhao, C., Toman, M., Pucher, M., Schabus, D. & Yamagishi, J., Nov 2015, In : Speech Communication. 74, p. 52-64 13 p.

    Research output: Contribution to journalArticle

  218. Cue phrases in Spoken Language: Discourse Pragmatics at the Forefront

    Lai, C. & Moore, J., 1 Oct 2015, Proceedings of DiSpol 2015: Identification and Annotation of Discourse Relations in Spoken Language.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  219. 調音運動の機械学習に基づく応用(<小特集>調音運動の計測とその応用)

    Richmond, K., Yamagishi, J. & Ling, Z-H., 1 Oct 2015, In : The Journal of the Acoustical Society of Japan. 71, 10, p. 539–545 7 p.

    Research output: Contribution to journalArticle

  220. Sentence-level control vectors for deep neural network speech synthesis

    Watts, O., Wu, Z. & King, S., 30 Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2217-2221 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  221. Complementary tasks for context-dependent deep neural network acoustic models

    Bell, P. & Renals, S., 11 Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. p. 3610-3614 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  222. Feature-space Speaker Adaptation for Probabilistic Linear Discriminant Analysis Acoustic Models

    Lu, L. & Renals, S., 11 Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. p. 2862-2866 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  223. The NST–GlottHMM entry to the Blizzard Challenge 2015

    Watts, O., Ronanki, S., Wu, Z., Raitio, T. & Suni, A., 11 Sep 2015, Proceedings of Blizzard Challenge 2015. 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  224. A study of speaker adaptation for DNN-based speech synthesis

    Wu, Z., Swietojanski, P., Veaux, C., Renals, S. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  225. Deep neural network context embeddings for model selection in rich-context HMM synthesis

    Merritt, T., Yamagishi, J., Wu, Z., Watts, O. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. Dresden: International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  226. A Comparison of Manual and Automatic Voice Repair for Individual with Vocal Disabilities

    Veaux, C., Yamagishi, J. & King, S., Sep 2015, SLPAT 2015, 6th Workshop on Speech and Language Processing for Assistive Technologies. Association for Computational Linguistics (ACL), p. 130-133 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  227. A Function-wise Pre-training Technique for Constructing a Deep Neural Network based Spectral Model in Statistical Parametric Speech Synthesis

    Takaki, S., Wu, Z. & Yamagishi, J., Sep 2015, First International Workshop on Machine Learning in Spoken Language Processing. 11 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  228. A Study of the Recurrent Neural Network Encoder-Decoder for Large Vocabulary Speech Recognition

    Lu, L., Zhang, X., Cho, K. & Renals, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. p. 3249-3253 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  229. A system for automatic broadcast news summarisation, geolocation and translation

    Bell, P., Lai, C., Llewellyn, C., Birch, A. & Sinclair, M., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. p. 730-731 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  230. ASVspoof 2015: the First Automatic Speaker Verification Spoofing and Countermeasures Challenge

    Wu, Z., Kinnunen, T., Evans, N., Yamagishi, J., Hanilci, C., Sahidullah, M. & Sizov, A., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2037-2041 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  231. Are we using enough listeners? No! An empirically-supported critique of Interspeech 2014 TTS evaluations

    Wester, M., Valentini-Botinhao, C. & Henter, G. E., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. Dresden: International Speech Communication Association, p. 3476-3480 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  232. Fusion of multiple parameterisations for DNN-based sinusoidal speech synthesis with multi-task learning

    Hu, Q., Wu, Z., Richmond, K., Yamagishi, J., Stylianou, Y. & Maia, R., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 854-858 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  233. Human vs Machine Spoofing Detection on Wideband and Narrowband Data

    Wester, M., Wu, Z. & Yamagishi, J., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. Dresden: International Speech Communication Association, p. 2047-2051 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  234. Influence of speaker familiarity on blind and visually impaired children’s perception of synthetic voices in audio games

    Pucher, M., Toman, M., Schabus, D., Valentini-Botinhao, C., Yamagishi, J., Zillinger, B. & Schmid, E., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. Dresden: International Speech Communication Association, p. 1625-1629 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  235. Prosodically-enhanced Recurrent Neural Network Language Models

    Gangireddy, S. R., Renals, S., Nankaku, Y. & Lee, A., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. p. 2390-2394 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  236. Reconstructing Voices within the Multiple-Average-Voice-Model framework

    Lanchantin, P., Veaux, C., Gales, M. J. F., King, S. & Yamagishi, J., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2232-2236 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  237. Structured Output Layer with Auxiliary Targets for Context-Dependent Acoustic Modelling

    Swietojanski, P., Bell, P. & Renals, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 3605-3609 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  238. Towards automatic detection of reported speech in dialogue using prosodic cues

    Cervone, A., Lai, C., Pareti, S. & Bell, P., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 3061-3065 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  239. Towards minimum perceptual error training for DNN-based speech synthesis

    Valentini-Botinhao, C., Wu, Z. & King, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. Dresden: International Speech Communication Association, p. 869-873 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  240. Voice liveness detection algorithms based on pop noise caused by human breath for automatic speaker verification

    Shiota, S., Villavicencio, F., Yamagishi, J., Ono, N., Echizen, I. & Matsui, T., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 239-243 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  241. A Lattice-based Approach to Automatic Filled Pause Insertion

    Tomalin, M., Wester, M., Dall, R., Byrne, B. & King, S., 10 Aug 2015, Proc. of DiSS 2015, The 7th Workshop on Disfluencies in Spontaneous Speech. Edinburgh, 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  242. A reading list of recent advances in speech synthesis

    King, S., 10 Aug 2015, Proc. 18th International Congress of Phonetic Sciences (ICPhS). T. S. C. F. ICP. . (ed.). Glasgow, UK: University of Glasgow

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  243. Recognising emotions in dialogues with disfluencies and non-verbal vocalisations

    Tian, L., Lai, C. & Moore, J., 8 Aug 2015, Proceedings of The 7th Workshop on Disfluencies in Spontaneous Speech (DiSS 2015). 3 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  244. A Multi-Level Representation of f0 using the Continuous Wavelet Transform and the Discrete Cosine Transform

    Ribeiro, M. S. & Clark, R. A. J., 6 Aug 2015, IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. Brisbane, Australia: Institute of Electrical and Electronics Engineers (IEEE), 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  245. Regularization of context-dependent deep neural networks with context-independent multi-task training

    Bell, P. & Renals, S., 6 Aug 2015, Proc IEEE International Conference on Acoustics, Speech and Signal Processing. Brisbane, QLD, Australia: Institute of Electrical and Electronics Engineers (IEEE), p. 4290-4294 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  246. Multi-Reference Evaluation for Dialectal Speech Recognition System: A Study for Egyptian ASR

    Ali, A., Magdy, W. & Renals, S., 1 Aug 2015, Proceedings of the Second Workshop on Arabic Natural Language Processing. Association for Computational Linguistics, p. 118-126 9 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  247. Open Challenges in Modelling, Analysis and Synthesis of Human Behaviour in Human--Human and Human--Machine Interactions

    Vinciarelli, A., Esposito, A., André, E., Bonin, F., Chetouani, M., Cohn, J. F., Cristani, M., Fuhrmann, F., Gilmartin, E., Hammal, Z., Heylen, D., Kaiser, R., Koutsombogera, M., Potamianos, A., Renals, S., Riccardi, G. & Salah, A. A., Aug 2015, In : Cognitive Computation. 7, 4, p. 397-413 17 p.

    Research output: Contribution to journalArticle

  248. A Deep Generative Architecture for Postfiltering in Statistical Parametric Speech Synthesis

    Chen, L-H., Raitio, T., Valentini-Botinhao, C., Ling, Z-H. & Yamagishi, J., Jul 2015, In : IEEE Transactions on Audio, Speech and Language Processing. 99, 13 p.

    Research output: Contribution to journalArticle

  249. Constructing a Deep Neural Network based Spectral Model for Statistical Speech Synthesis

    Takaki, S. & Yamagishi, J., 17 Jun 2015, International Conference on NONLINEAR SPEECH PROCESSING 2015. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  250. Emotion Recognition from the Speech Signal by Effective Combination of Generative and Discriminative Models

    Loweimi, E., Doulaty, M., Barker, J. & Hain, T., 1 Jun 2015, USES 2015 - The University of Sheffield Engineering Symposium. 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  251. A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification

    Poblete, V., Espic, F., King, S., Stem, R. M., Huenupan, F., Fredes, J. & Yoma, N. B., May 2015, In : Computer Speech and Language. 31, 1, p. 1-27 27 p.

    Research output: Contribution to journalArticle

  252. Recognizing emotions in dialogue with disfluences and non-verbal vocalisations

    Tian, L., Lai, C. & Moore, J., 14 Apr 2015, Proceedings of The 4th Interdisciplinary Workshop on Laughter and other Non-Verbal Vocalisations in Speech. p. 39-41 3 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  253. Deep neural networks employing multi-task learning and stacked bottleneck features for speech synthesis.

    Wu, Z., Valentini-Botinhao, C., Watts, O. & King, S., 1 Apr 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. Brisbane, Australia, p. 4460-4464 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  254. Spoofing and countermeasures for speaker verification: A survey

    Wu, Z., Evans, N., Kinnunen, T., Yamagishi, J., Alegre, F. & Li, H., Feb 2015, In : Speech Communication. 66, p. 130-153 24 p.

    Research output: Contribution to journalArticle

  255. Soft context clustering for F0 modeling in HMM-based speech synthesis

    Khorram, S., Sameti, H. & King, S., 9 Jan 2015, In : EURASIP Journal on Advances in Signal Processing. 2015, 1

    Research output: Contribution to journalArticle

  256. A perceptual investigation of wavelet-based decomposition of f0 for text-to-speech synthesis

    Ribeiro, M. S., Yamagishi, J. & Clark, R. A. J., 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1586-1590 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  257. A statistical shape space model of the palate surface trained on 3D MRI scans of the vocal tract

    Hewer, A., Steiner, I., Bolkart, T., Wuhrer, S. & Richmond, K., 2015, Proceedings of ICPhS 2015. University of Glasgow, 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  258. A system for automatic alignment of broadcast media captions using weighted finite-state transducers

    Bell, P. & Renals, S., 2015, 2015 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2015). Institute of Electrical and Electronics Engineers (IEEE), p. 675-680 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  259. Differentiable pooling for unsupervised speaker adaptation

    Swietojanski, P. & Renals, S., 2015, Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  260. Emotion Recognition in Spontaneous and Acted Dialogues

    Tian, L., Moore, J. & Lai, C., 2015, Affective Computing and Intelligent Interaction (ACII), 2015 International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 698 - 704 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  261. Knowledge versus data in TTS: evaluation of a continuum of synthesis systems

    Kay, R., Watts, O., Barra-Chicote, R. & Mayo, C., 2015, INTERSPEECH 2015, 16th Annual Conference of the International Speech Communication Association, Dresden, Germany, September 6-10, 2015. p. 3335-3339 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  262. Long-Term Statistical Feature Extraction from Speech Signal and Its Application in Emotion Recognition

    Loweimi, E., Doulaty, M., Barker, J. & Hain, T., 2015, Statistical Language and Speech Processing: SLSP 2015. Dediu, A-H., Martín-Vide, C. & Vicsi, K. (eds.). Springer, Cham, p. 173-184 12 p. (Lecture Notes in Computer Science; vol. 9449).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  263. Methods for Applying Dynamic Sinusoidal Models to Statistical Parametric Speech Synthesis

    Hu, Q., Stylianou, Y., Maia, R., Richmond, K. & Yamagishi, J., 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 4889-4893 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  264. Modelling acoustic feature dependencies with artificial neural networks: Trajectory-RNADE

    Uria, B., Murray, I., Renals, S., Valentini-Botinhao, C. & Bridle, J., 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on . IEEE Signal Processing Society Press, p. 4465-4469 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  265. Multi-frame factorisation for long-span acoustic modelling

    Lu, L. & Renals, S., 2015, Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  266. Multiple Feed-forward Deep Neural Networks for Statistical Parametric Speech Synthesis

    Takaki, S., Kim, S., Yamagishi, J. & Kim, J., 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2242-2246 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  267. Recognizing emotions in dialogues with acoustic and lexical features

    Tian, L., Moore, J. & Lai, C., 2015, Affective Computing and Intelligent Interaction (ACII), 2015 International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 737-742 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  268. SAS: A Speaker Verification Spoofing Database Containing Diverse Attacks

    Wu, Z., Khodabakhsh, A., Demiroglu, C., Yamagishi, J., Saito, D., Toda, T. & King, S., 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on . Institute of Electrical and Electronics Engineers (IEEE), p. 4440-4444 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  269. Source-filter Separation of Speech Signal in the Phase Domain

    Loweimi, E., Barker, J. & Hain, T., 2015, Proc. Interspeech 2015. ISCA, p. 598-602 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  270. Spoofing and Anti-Spoofing: A Shared View of Speaker Verification, Speech Synthesis and Voice Conversion

    Wu, Z., Kinnunen, T., Evans, N. & Yamagishi, J., 2015. 3 p.

    Research output: Contribution to conferenceOther

  271. The MGB Challenge: Evaluating Multi-Genre Broadcast Media Recognition

    Bell, P., Gales, MJF., Hain, T., Kilgour, J., Lanchantin, P., Liu, X., McParland, A., Renals, S., Saz, O., Wester, M. & Woodland, PC., 2015, Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on. Institute of Electrical and Electronics Engineers (IEEE), p. 687 - 693 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  272. The University of Edinburgh Speaker Personality and MoCap Dataset

    Haag, K. & Shimodaira, H., 2015, FAA '15 Proceedings of the Facial Analysis and Animation. New York, NY, USA: ACM, 2 p. 8

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  273. 音声の障がい者のための最先端音声合成技術

    Yamagishi, J., 2015, In : Journal of Information Processing and Management (Joho Kanri). 57, 12, p. 882-889 8 p.

    Research output: Contribution to journalArticle

  274. 2014
  275. Tied Probabilistic Linear Discriminant Analysis for Speech Recognition

    Lu, L. & Renals, S., 30 Nov 2014, (Submitted) 5 p.

    Research output: Working paper

  276. Translation and Prosody in Swiss Languages

    Garner, P. N., Clark, R., Goldman, J-P., Honnet, P-E., Ivanova, M., Lazaridis, A., Lang, H., Pfister, B., Ribeiro, M. S., Wehrli, E. & Yamagishi, J., 30 Sep 2014, Nouveaux cahiers de linguistique francaise. p. 1-12 12 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  277. The Simple4All entry to the Blizzard Challenge 2014

    Suni, A., Raitio, T., Gowda, D., Karhila, R., Gibson, M. & Watts, O., 1 Sep 2014, Proc. Blizzard Challenge 2014. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  278. Using linguistic predictability and the Lombard effect to increase the intelligibility of synthetic speech in noise

    Valentini-Botinhao, C. & Wester, M., 1 Sep 2014, Proc. Interspeech. p. 2063-2067 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  279. Voice source modelling using deep neural networks for statistical parametric speech synthesis

    Raitio, T., Lu, H., Kane, J., Suni, A., Vainio, M., King, S. & Alku, P., 1 Sep 2014, European Signal Processing Conference. European Signal Processing Conference, EUSIPCO, p. 2290-2294 5 p. 6952838

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  280. An investigation of the application of dynamic sinusoidal models to statistical parametric speech synthesis

    Hu, Q., Stylianou, Y., Maia, R., Richmond, K., Yamagishi, J. & Latorre, J., Sep 2014, Interspeech 2014. p. 780-784 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  281. DNN-based stochastic postfilter for HMM-based speech synthesis

    Chen, L-H., Raitio, T., Valentini-Botinhao, C., Yamagishi, J. & Ling, Z-H., Sep 2014, Interspeech 2014. Singapore: International Speech Communication Association, p. 1954-1958 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  282. Feature analysis for discriminative confidence estimation in spoken term detection

    Tejedor, J., Toledano, D. T., Wang, D., King, S. & Colas, J., Sep 2014, In : Computer Speech and Language. 28, 5, p. 1083–1114 32 p.

    Research output: Contribution to journalArticle

  283. Intelligibility Enhancement of Speech in Noise

    Valentini-Botinhao, C., Yamagishi, J. & King, S., Sep 2014, Proceedings of the Institute of Acoustics 2014. Vol. 36. 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  284. Intelligibility analysis of fast synthesized speech

    Valentini-Botinhao, C., Toman, M., Pucher, M., Schabus, D. & Yamagishi, J., Sep 2014, Interspeech. Singapore: International Speech Communication Association, 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  285. Compression of Model-based Group Delay Function for Robust Speech Recognition

    Loweimi, E., Barker, J. & Hain, T., 1 Jun 2014, 2 p.

    Research output: Other contribution

  286. Probabilistic Linear Discriminant Analysis for Acoustic Modeling

    Lu, L. & Renals, S., 1 Jun 2014, In : IEEE Signal Processing Letters. 21, 6, p. 702-706 5 p.

    Research output: Contribution to journalArticle

  287. Multiple-average-voice-based speech synthesis

    Lanchantin, P., Gales, M. J. F., King, S. & Yamagishi, J., 4 May 2014, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 285-289 5 p. 6853603

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  288. Neural net word representations for phrase-break prediction without a part of speech tagger

    Watts, O., Gangireddy, S., Yamagishi, J., King, S., Renals, S., Stan, A. & Giurgiu, M., 4 May 2014, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 2599-2603 5 p. 6854070

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  289. RSS-TOBI - a Prosodically Enhanced Romanian Speech Corpus

    Boroș, T., Stan, A., Watts, O. & Dumitrescu, S. D., 1 May 2014, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14). 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  290. Speech driven talking head from estimated articulatory features

    Ben-Youssef, A., Shimodaira, H. & Braude, D. A., 1 May 2014, Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 4573-4577 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  291. A fixed dimension and perceptually based dynamic sinusoidal model of speech

    Hu, Q., Stylianou, Y., Richmond, K., Maia, R., Yamagishi, J. & Latorre, J., May 2014, Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on . Institute of Electrical and Electronics Engineers (IEEE), p. 6311-6315

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  292. A Generative Model for User Simulation in a Spatial Navigation Domain

    Eshky, A., Allison, B., Ramamoorthy, S. & Steedman, M., 1 Apr 2014, Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics. Gothenburg, Sweden: Association for Computational Linguistics, p. 626-635 10 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  293. Combining Vocal Tract Length Normalization With Hierarchical Linear Transformations

    Saheer, L., Yamagishi, J., Garner, P. N. & Dines, J., 1 Apr 2014, In : IEEE Journal of Selected Topics in Signal Processing. 8, 2, p. 262-272 11 p.

    Research output: Contribution to journalArticle

  294. Glottal spectral separation for speech synthesis

    Cabral, J., Richmond, K., Yamagishi, J. & Renals, S., Apr 2014, In : IEEE Journal of Selected Topics in Signal Processing. 8, 2, p. 195-208

    Research output: Contribution to journalArticle

  295. Introduction to the Issue on Statistical Parametric Speech Synthesis

    Tao, J., Hirose, K., Tokuda, K., Black, A. W. & King, S., Apr 2014, In : IEEE Journal of Selected Topics in Signal Processing. 8, 2, p. 170-172 3 p.

    Research output: Contribution to journalEditorial

  296. Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Maia, R., Mar 2014, In : Computer Speech and Language. 28, 2, p. 665-686 22 p.

    Research output: Contribution to journalArticle

  297. Introduction to the Special Issue on The listening talker: Context-dependent speech production and perception

    Cooke, M., King, S., Kleijn, W. B. & Stylianou, Y., Mar 2014, In : Computer Speech and Language. 28, 2, p. 540-542

    Research output: Contribution to journalArticle

  298. The listening talker: A review of human and algorithmic context-induced modifications of speech

    Cooke, M., King, S., Garnier, M. & Aubanel, V., Mar 2014, In : Computer Speech and Language. 28, 2, p. 543-571 29 p.

    Research output: Contribution to journalLiterature review

  299. Measuring a decade of progress in Text-to-Speech

    King, S., Jan 2014, In : Loquens. 1, 1, e006.

    Research output: Contribution to journalArticle

  300. Statistical parametric speech synthesis for Ibibio

    Ekpenyong, M., Urua, E-A., Watts, O., King, S. & Yamagishi, J., Jan 2014, In : Speech Communication. 56, p. 243-251 9 p.

    Research output: Contribution to journalArticle

  301. A Semi-Markov Model for Speech Segmentation with an Utterance-Break Prior

    Sinclair, M., Bell, P., Birch, A. & McInnes, F., 2014, INTERSPEECH 2014 15th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2351-2355 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  302. Anti-Spoofing: Voice Databases

    Alegre, F., Evans, N., Kinnunen, T., Wu, Z. & Yamagishi, J., 2014, Encyclopedia of Biometrics. Li, S. Z. & Jain, A. K. (eds.). Springer US, p. 1-7 7 p.

    Research output: Chapter in Book/Report/Conference proceedingEntry for encyclopedia/dictionary

  303. Automated Production of True-Cased Punctuated Subtitles for Weather and News Broadcasts

    Driesen, J., Birch, A., Grimsey, S., Safarfashandi, S., Gauthier, J., Simpson, M. & Renals, S., 2014, INTERSPEECH 2014 15th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2146-2147 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  304. Convolutional Neural Networks for Distant Speech Recognition

    Swietojanski, P., Ghoshal, A. & Renals, S., 2014, In : IEEE Signal Processing Letters. 21, 9, p. 1120-1124 5 p.

    Research output: Contribution to journalArticle

  305. Cross-Lingual Adaptation with Multi-Task Adaptive Networks

    Bell, P., Driesen, J. & Renals, S., 2014, INTERSPEECH 2014 15th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 21-25 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  306. Cross-Lingual Subspace Gaussian Mixture Models for Low-Resource Speech Recognition

    Lu, L., Ghoshal, A. & Renals, S., 2014, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 22, 1, p. 17-27 11 p.

    Research output: Contribution to journalArticle

  307. Detecting Attribution Relations in Speech: a Corpus Study

    Cervone, A., Pareti, S., Bell, P., Prodanof, I. & Caselli, T., 2014, Proc. Italian Conference on Computational Linguistics. p. 103-107 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  308. Development of a Genre-Dependent TTS System with Cross-Speaker Speaking-Style Transplantation

    Lorenzo-Trueba, J., Echeverry-Correa, J. D., Barra-Chicote, R., San-Segundo, R., Ferreiros, J., Gallardo-Antolin, A., Yamagishi, J., King, S. & Montero, J. M., 2014, 2nd International Workshop on Speech, Language and Audio in Multimedia (SLAM2014). International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  309. Feed Forward Pre-training for Recurrent Neural Network Language Models

    Gangireddy, S. R., McInnes, F. & Renals, S., 2014, INTERSPEECH 2014 15th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2620-2624 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  310. Generating Segmental Foreign Accent

    Lecumberri, M. L. G., Barra-Chicote, R., Ruben Perez, R., Yamagishi, J. & Cooke, M., 2014, INTERSPEECH 2014 15th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1302-1306 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  311. Incorporating Lexical and Prosodic Information at Different Levels for Meeting Summarization

    Lai, C. & Renals, S., 2014, Proceedings of Interspeech 2014. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  312. Interpreting Final Rises: Task and Role Factors

    Lai, C., 2014, Proceedings of Speech Prosody 7.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  313. Investigating Automatic & Human Filled Pause Insertion for Speech Synthesis

    Dall, R., Tomalin, M., Wester, M., Byrne, W. & King, S., 2014, Proc. Interspeech. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  314. Learning Hidden Unit Contributions for Unsupervised Speaker Adaptation of Neural Network Acoustic Models

    Swietojanski, P. & Renals, S., 2014, Spoken Language Technology Workshop (SLT), 2014 IEEE. Institute of Electrical and Electronics Engineers (IEEE), p. 171-176 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  315. Measuring the Perceptual Effects of Modelling Assumptions in Speech Synthesis Using Stimuli Constructed from Repeated Natural Speech

    Henter, G. E., Merritt, T., Shannon, M., Mayo, C. & King, S., 2014, INTERSPEECH 2014 15th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1504-1508 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  316. Neural networks for distant speech recognition

    Renals, S. & Swietojanski, P., 2014, Proceedings 2014 Workshop on Hands-Free Speech Communication and Microphone Arrays . Institute of Electrical and Electronics Engineers (IEEE), p. 172-176 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  317. Probabilistic Linear Discriminant Analysis with Bottleneck Features for Speech Recognition

    Lu, L. & Renals, S., 2014, INTERSPEECH-2014. International Speech Communication Association, p. 910-914 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  318. ROCKIT: Roadmap for Conversational Interaction Technologies

    Renals, S., Carletta, J., Edwards, K., Bourlard, H., Garner, P. N., Popescu-Belis, A., Klakow, D., Girenko, A., Petukhova, V., Wacker, P., Joscelyne, A., Kompis, C., Aliwell, S., Stevens, W. & Sabbah, Y., 2014, RFMIR '14 Proceedings of the 2014 Workshop on Roadmapping the Future of Multimodal Interaction Research including Business Opportunities and Challenges. p. 39-42 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  319. Rating Naturalness in Speech Synthesis: The Effect of Style and Expectation

    Dall, R., Yamagishi, J. & King, S., 2014, Speech Prosody 2014.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  320. Speaker Recognition Anti-spoofing

    Evans, N., Kinnunen, T., Yamagishi, J., Wu, Z., Alegre, F. & Leon, P. D., 2014, Handbook of Biometric Anti-Spoofing. Marcel, S., Nixon, M. S. & Li, S. Z. (eds.). Springer London, p. 125-146 22 p. (Advances in Computer Vision and Pattern Recognition).

    Research output: Chapter in Book/Report/Conference proceedingChapter

  321. The UEDIN ASR Systems for the IWSLT 2014 Evaluation

    Bell, P., Swietojanski, P., Driesen, J., Sinclair, M., McInnes, F. & Renals, S., 2014, 11th International Workshop on Spoken Language Translation (IWSLT 2014). 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  322. Towards Cross-Lingual Emotion Transplantation

    Lorenzo-trueba, J., Barra-chicote, R., Yamagishi, J. & Montero, J. M., 2014, Advances in Speech and Language Technologies for Iberian Languages: Second International Conference, IberSPEECH 2014, Las Palmas de Gran Canaria, Spain, November 19-21, 2014. Proceedings. Springer International Publishing, p. 199-208 10 p. Chapter 21. (Lecture Notes in Computer Science; vol. 8854).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  323. Unsupervised lexical clustering of speech segments using fixed dimensional acoustic embeddings

    Kamper, H., Jansen, A., King, S. & Goldwater, S., 2014, Proceedings of the IEEE Spoken Language Technology Workshop. Institute of Electrical and Electronics Engineers (IEEE), 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  324. Word-Level Emotion Recognition Using High-Level Features

    Moore, J. D., Tian, L. & Lai, C., 2014, Computational Linguistics and Intelligent Text Processing: 15th International Conference, CICLing 2014, Kathmandu, Nepal, April 6-12, 2014, Proceedings, Part II. Gelbukh, A. (ed.). Springer Berlin Heidelberg, p. 17-31 15 p. (Lecture Notes in Computer Science; vol. 8404).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  325. 2013
  326. Lightly supervised automatic subtitling of weather forecasts

    Driesen, J. & Renals, S., 1 Dec 2013, Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on. Institute of Electrical and Electronics Engineers (IEEE), p. 452-457 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  327. Cross-Lingual Automatic Speech Recognition Using Tandem Features

    Lal, P. & King, S., Dec 2013, In : IEEE Transactions on Audio, Speech and Language Processing. 21, 12, p. 2506-2515 10 p.

    Research output: Contribution to journalArticle

  328. Recording speech articulation in dialogue: Evaluating a synchronized double electromagnetic articulography setup

    Geng, C., Turk, A., Scobbie, J. M., Macmartin, C., Hoole, P., Richmond, K., Wrench, A., Pouplier, M., Bard, E., Campbell, Z., Dickie, C., Dubourg, E., Hardcastle, W., Kainada, E., King, S., Lickley, R., Nakai, S., Renals, S., White, K. & Wiegand, R., Nov 2013, In : Journal of Phonetics. 41, 6, p. 421-431 11 p.

    Research output: Contribution to journalArticle

  329. The voice bank corpus: Design, collection and data analysis of a large regional accent speech database

    Veaux, C., Yamagishi, J. & King, S., Nov 2013, Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference. Institute of Electrical and Electronics Engineers (IEEE), 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  330. Recognition of overlapping speech using digital MEMS microphone arrays

    Zwyssig, E., Faubel, F., Renals, S. & Lincoln, M., 21 Oct 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. p. 7068-7072 5 p. 6639033

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  331. Where are the challenges in speaker diarization?

    Sinclair, M. & King, S., 21 Oct 2013, IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013. Institute of Electrical and Electronics Engineers (IEEE), p. 7741-7745 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  332. Building personalised synthetic voices for individuals with severe speech impairment

    Creer, S., Cunningham, S., Green, P. & Yamagishi, J., Sep 2013, In : Computer Speech and Language. 27, 6, p. 1178-1193 16 p.

    Research output: Contribution to journalArticle

  333. Head Motion Analysis and Synthesis over Different Tasks

    Ben Youssef, A., Shimodaira, H. & Braude, D. A., Sep 2013, Intelligent Virtual Agents: 13th International Conference, IVA 2013, Edinburgh, UK, August 29-31, 2013. Proceedings. Aylett, R., Krenn, B., Pelachaud, C. & Shimodaira, H. (eds.). Springer-Verlag GmbH, p. 285-294 10 p. (Lecture Notes in Computer Science; vol. 8108).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  334. Joint Uncertainty Decoding for Noise Robust Subspace Gaussian Mixture Models

    Lu, L., Chin, K. K., Ghoshal, A. & Renals, S., Sep 2013, In : IEEE Transactions on Audio, Speech and Language Processing. 21, 9, p. 1791-1804 14 p.

    Research output: Contribution to journalArticle

  335. Mage-HMM-based speech synthesis reactively controlled by the articulators

    Astrinaki, M., Moinet, A., Yamagishi, J., Richmond, K., Ling, Z-H., King, S. & Dutoit, T., Sep 2013, 8th ISCA Speech Synthesis Workshop. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  336. Spoofing and countermeasures for automatic speaker verification

    Evans, N. W. D., Kinnunen, T. & Yamagishi, J., 25 Aug 2013, INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  337. Articulatory features for speech-driven head motion synthesis

    Ben Youssef, A., Shimodaira, H. & Braude, D. A., 1 Aug 2013, Proc. Interspeech. p. 2758-2762 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  338. Combining a Vector Space Representation of Linguistic Context with a Deep Neural Network for Text-To-Speech Synthesis

    Lu, H., King, S. & Watts, O., 1 Aug 2013, 8th ISCA Speech Synthesis Workshop. p. 261-265 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  339. Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech

    Christensen, H., Aniol, M., Bell, P., Green, P., Hain, T., King, S. & Swietojanski, P., 1 Aug 2013, Proc. Interspeech 2013. ISCA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  340. Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Stylianou, Y., 1 Aug 2013, Interspeech 2013.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  341. Investigating the shortcomings of HMM synthesis

    Merritt, T. & King, S., 1 Aug 2013, Proceedings of 8th ISCA Speech Synthesis Workshop. p. 185-190 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  342. Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data

    Stan, A., Bell, P., Yamagishi, J. & King, S., 1 Aug 2013, Proc Interspeech 2013. ISCA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  343. Noise adaptive training for subspace Gaussian mixture models

    Lu, L., Ghoshal, A. & Renals, S., 1 Aug 2013, Proceedings of Interspeech 2013. ISCA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  344. Template-Warping Based Speech Driven Head Motion Synthesis

    Braude, D. A., Shimodaira, H. & Ben Youssef, A., 1 Aug 2013, Interspeech 2013: 14th Annual Conference of the International Speech Communication Association. p. 2763-2767 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  345. The Edinburgh Speech Production Facility DoubleTalk Corpus

    Scobbie, J., Turk, A., Geng, C., King, S., Lickley, R. & Richmond, K., 1 Aug 2013, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  346. The Simple4All entry to the Blizzard Challenge 2013

    Watts, O., Stan, A., Mamiya, Y., Suni, A., Burgos, J. M. & Montero, J. M., 1 Aug 2013, Proc. Blizzard Challenge 2013. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  347. An experimental comparison of multiple vocoder types

    Hu, Q., Richmond, K., Yamagishi, J. & Latorre, J., Aug 2013, 8th ISCA Workshop on Speech Synthesis: Barcelona, Spain. p. 155-160 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  348. Gesture Control of HMM-Based Singing Voice Synthesis

    Veaux, C., Astrinaki, M., Oura, K., Clark, R. & Yamagishi, J., Aug 2013, Proceedings of 8th ISCA Speech Synthesis Workshop. p. 247-248 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  349. Intelligibility-enhancing speech modifications: the Hurricane Challenge

    Cooke, M., Mayo, C. & Valentini-Botinhao, C., Aug 2013, Interspeech. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  350. Mage - Reactive articulatory feature control of HMM-based parametric speech synthesis

    Astrinaki, M., Moinet, A., Yamagishi, J., Richmond, K., Ling, Z-H., King, S. & Dutoit, T., Aug 2013, 8th ISCA Workshop on Speech Synthesis: Barcelona, Spain. p. 227-231 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  351. On the Evaluation of Inversion Mapping Performance in the Acoustic Domain

    Richmond, K., Ling, Z., Yamagishi, J. & Ur?a, B., Aug 2013, Proc. Interspeech. p. 1012-1016 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  352. Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project

    Bourlard, H., Ferras, M., Pappas, N., Popescu-Belis, A., Renals, S., McInnes, F., Bell, P., Ingram, S. & Guillemot, M., Aug 2013, Proceedings of SLAM 2013 (First Workshop on Speech, Language and Audio in Multimedia). CEUR Workshop Proceedings, p. 3-8 6 p. (CEUR Workshop Proceedings; vol. 1012).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  353. Reactive accent interpolation through an interactive map application

    Astrinaki, M., Yamagishi, J., King, S., d'Alessandro, N. & Dutoit, T., Aug 2013, Proceedings of 8th ISCA Speech Synthesis Workshop. p. 245-246 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  354. Towards Speaking Style Transplantation in Speech Synthesis

    Lorenzo-Trueba, J., Barra-Chicote, R., Yamagishi, J., Watts, O. & Montero, J. M., Aug 2013, 8th ISCA Workshop on Speech Synthesis - Barcelona, Spain. ISCA, p. 159-163 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  355. Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from `found' data: evaluation and analysis

    Watts, O., Stan, A., Clark, R., Mamiya, Y., Giurgiu, M., Yamagishi, J. & King, S., Aug 2013, 8th ISCA Speech Synthesis Workshop: Barcelona, Spain. ISCA-INST SPEECH COMMUNICATION ASSOC, p. 101-106 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  356. Using Adaptation to Improve Speech Transcription Alignment in Noisy and Reverberant Environments

    Mamiya, Y., Stan, A., Yamagishi, J., Bell, P., Watts, O., Clark, R. & King, S., Aug 2013, Proc. 8th ISCA Speech Synthesis Workshop. p. 61-66 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  357. Using neighbourhood density and selective SNR boosting to increase the intelligibility of synthetic speech in noise

    Valentini-Botinhao, C., Wester, M., Yamagishi, J. & King, S., Aug 2013, 8th ISCA Workshop on Speech Synthesis. p. 133-138 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  358. Discriminative Tandem Features for HMM-based EEG Classification

    Ting, C-M., King, S., Salleh, S-H. & Ariff, A. K., 1 Jul 2013, Proc. 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 13). IEEE Engineering in Medicine and Biology Society, Vol. 2013. p. 3957-3960

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  359. A new phase-based feature representation for robust speech recognition

    Loweimi, E., Ahadi, S. M. & Drugman, T., 1 May 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing. Institute of Electrical and Electronics Engineers (IEEE), p. 7155-7159 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  360. Evaluating the intelligibility benefit of speech modifications in known noise conditions

    Cooke, M., Tang, Y., Mayo, C., Valentini-Botinhao, C., Stylianou, Y. & Sauert, B., 1 May 2013, In : Speech Communication. 55, 4, p. 572-585 14 p.

    Research output: Contribution to journalArticle

  361. Grapheme and multilingual posterior features for under-resourced speech recognition: a study on Scottish Gaelic

    Rasipuram, R., Bell, P. & Magimai-Doss, M., 1 May 2013, Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 7334-7338 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  362. Improving intelligibility in noise of HMM-generated speech via noise-dependent and -independent methods.

    Valentini-Botinhao, C., Godoy, E., Stylianou, Y., Sauert, B., King, S. & Yamagishi, J., May 2013, Proc. ICASSP - Vancouver, Canada.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  363. Speech Synthesis Based on Hidden Markov Models

    Tokuda, K., Nankaku, Y., Toda, T., Zen, H., Yamagishi, J. & Oura, K., May 2013, In : Proceedings of the IEEE. 101, 5, p. 1234-1252 19 p.

    Research output: Contribution to journalArticle

  364. Grapheme-to-phoneme conversion methods for minority language conditions

    Cao, M., Renals, S., Bell, P., Li, A. & Fang, Q., 1 Feb 2013, Proceedings of the 2012 International Conference on Speech Database and Assessments, Oriental COCOSDA 2012. Institute of Electrical and Electronics Engineers (IEEE), p. 151-156 6 p. 6422470

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  365. Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

    Dines, J., Liang, H., Saheer, L., Gibson, M., Byrne, W., Oura, K., Tokuda, K., Yamagishi, J., King, S., Wester, M., Hirsimäki, T., Karhila, R. & Kurimo, M., Feb 2013, In : Computer Speech and Language. 27, 2, p. 420-437 18 p.

    Research output: Contribution to journalArticle

  366. Articulatory Control of HMM-based Parametric Speech Synthesis using Feature-Space-Switched Multiple Regression

    Ling, Z., Richmond, K. & Yamagishi, J., Jan 2013, In : IEEE Transactions on Audio, Speech and Language Processing. 21, 1, p. 207-219 13 p.

    Research output: Contribution to journalArticle

  367. A distortion-weighted glimpse-based intelligibility metric for modified and synthetic speech

    Tang, Y., Cooke, M. & Valentini-Botinhao, C., 2013, Proc. SPIN.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  368. A lecture transcription system combining neural network acoustic and language models

    Bell, P., Yamamoto, H., Swietojanski, P., Wu, Y., McInnes, F., Hori, C. & Renals, S., 2013, In Proc. Interspeech 2013.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  369. Acoustic Data-driven Pronunciation Lexicon for Large Vocabulary Speech Recognition

    Lu, L., Ghoshal, A. & Renals, S., 2013, Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on. Institute of Electrical and Electronics Engineers (IEEE)

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  370. Applying rhythm metrics to non-native spontaneous speech

    Lai, C., Evanini, K. & Zechner, K., 2013, Proceedings of SLaTE 2013.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  371. Automatic Transcription of Multi-genre Media Archives

    Lanchantin, P., Bell, P., Gales, M., Hain, T., Liu, X., Long, Y., Quinnell, J., Renals, S., Saz, O., Seigel, M., Swietojanski, P. & Woodland, P., 2013, SLAM 2013 Speech, Language and Audio in Multimedia: Proceedings of the First Workshop on Speech, Language and Audio in Multimedia. CEUR Workshop Proceedings, Vol. 1012. p. 26-31 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  372. Description of the UEDIN System for German ASR

    Driesen, J., Bell, P., Sinclair, M. & Renals, S., 2013, Proceedings of the 10th International Workshop on Spoken Language Translation (IWSLT 2013). 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  373. Detecting Summarization Hot Spots in Meetings Using Group Level Involvement and Turn-Taking Features

    Lai, C., Carletta, J. & Renals, S., 2013, Proceedings of Interspeech 2013. ISCA, p. 2723-2727 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  374. Evaluation of a Transplantation Algorithm for Expressive Speech Synthesis

    Lorenzo-Trueba, J., Barra-Chicote, R., Yamagishi, J., Watts, O. & Montero, J. M., 2013, Proccedings of Workshop en Tecnologicas Accesibles, IV Congreso Espanol de Informatica CEDI2013. 10 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  375. Factorized context modelling for Text-to-Speech synthesis

    Lu, H. & King, S., 2013, Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 7849-7853 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  376. Hybrid acoustic models for distant and multichannel large vocabulary speech recognition

    Swietojanski, P., Ghoshal, A. & Renals, S., 2013, Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on. Institute of Electrical and Electronics Engineers (IEEE), p. 285-290 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  377. Lightly supervised GMM VAD to use audiobook for speech synthesiser

    Mamiya, Y., Yamagishi, J., Watts, O., Clark, R. A. J., King, S. & Stan, A., 2013, Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 7987-7991 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  378. Modelling Participant Affect in Meetings with Turn-Taking Features

    Lai, C., Carletta, J. & Renals, S., 2013, Proceedings of WASSS 2013.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  379. Multi-level adaptive networks in tandem and hybrid ASR systems

    Bell, P., Swietojanski, P. & Renals, S., 2013, Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 6975-6979 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  380. Multilingual training of deep neural networks

    Ghoshal, A., Swietojanski, P. & Renals, S., 2013, Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 7319-7323 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  381. On the Importance of Pre-emphasis and Window Shape in Phase-Based Speech Recognition

    Loweimi, E., Ahadi, S. M., Drugman, T. & Loveymi, S., 2013, Advances in Nonlinear Speech Processing: 6th International Conference, NOLISP 2013, Mons, Belgium, June 19-21, 2013. Proceedings. Drugman, T. & Dutoit, T. (eds.). Berlin, Heidelberg: Springer Berlin Heidelberg, p. 160-167 8 p. (Lecture Notes in Computer Science; vol. 7911).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  382. Real-time control of expressive speech synthesis using kinect body tracking

    Veaux, C., Astrinaki, M., Oura, K., Clark, R. A. J. & Yamagishi, J., 2013, The Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, Barcelona, Spain, August 31-September 2, 2013. p. 247-248 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  383. Revisiting Hybrid and GMM-HMM system combination techniques

    Swietojanski, P., Ghoshal, A. & Renals, S., 2013, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 6744-6748 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  384. Socio-material design for computer mediated social sensemaking

    Hartswood, M., Anderson, S., Wolters, M., Pagliari, C. & Renals, S., 2013, CHI’13 CHI Conference on Human Factors in Computing Systems. ACM, (Proc. CHI Workshop on Explorations in Social Interaction Design).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  385. Speech animation using electromagnetic articulography as motion capture data

    Steiner, I., Richmond, K. & Ouni, S., 2013, Proc. 12th International Conference on Auditory-Visual Speech Processing. p. 55-60 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  386. TUNDRA: a multilingual corpus of found data for TTS research created with light supervision

    Stan, A., Watts, O., Mamiya, Y., Giurgiu, M., Clark, R. A. J., Yamagishi, J. & King, S., 2013, INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association: Lyon, France, August 25-29, 2013. ISCA-INST SPEECH COMMUNICATION ASSOC, p. 2331-2335 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  387. The UEDIN English ASR System for the IWSLT 2013 Evaluation

    Bell, P., McInnes, F., Gangireddy, S. R., Sinclair, M., Birch, A. & Renals, S., 2013, Proceedings of the 10th International Workshop on Spoken Language Translation (IWSLT 2013). 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  388. The University of Edinburgh Head-Motion and Audio Storytelling (UoE-HAS) Dataset

    Braude, D. A., Shimodaira, H. & Ben Youssef, A., 2013, Proc. of Intelligent Virtual Agents. p. 466-467 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  389. Towards Personalized Synthesized Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction

    Veaux, C., Yamagishi, J. & King, S., 2013, SLPAT 2013, 4th Workshop on Speech and Language Processing for Assistive Technologies. ISCA, p. 107-111 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  390. 2012
  391. Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation

    Yang, C-Y., Brown, G., Lu, L., Yamagishi, J. & King, S., 4 Dec 2012, Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on. Institute of Electrical and Electronics Engineers (IEEE), p. 220-223 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  392. Reactive Control of Expressive Speech Synthesis Using Kinect Skeleton Tracking

    Clark, R., Konkiewicz, M. A., Astrinaki, M. & Yamagishi, J., 1 Dec 2012, In : IEICE technical report. Speech. 112, 369, p. 175-178 4 p.

    Research output: Contribution to journalArticle

  393. Evaluation of Speaker Verification Security and Detection of HMM-Based Synthetic Speech

    De Leon, P. L., Pucher, M., Yamagishi, J., Hernaez, I. & Saratxaga, I., Oct 2012, In : IEEE Transactions on Audio, Speech and Language Processing. 20, 8, p. 2280-2290 11 p.

    Research output: Contribution to journalArticle

  394. Analysis of Speaker Clustering Strategies for HMM-Based Speech Synthesis

    Dall, R., Veaux, C., Yamagishi, J. & King, S., 1 Sep 2012, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  395. Automatic detection of sigmatism in children

    Valentini-Botinhao, C., Degenkolb-Weyers, S., Maier, A., Eysholdt, U., Bocklet, T. & Nöth, E., 1 Sep 2012, Proc. WOCCI. Portland, USA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  396. Detecting Acronyms from Capital Letter Sequences in Spanish

    San-Segundo, R., Montero, J. M., Lopez-Luden, V. & King, S., 1 Sep 2012, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  397. Joint Uncertainty Decoding with Unscented Transform for Noise Robust Subspace Gaussian Mixture Models

    Lu, L., Ghoshal, A. & Renals, S., 1 Sep 2012, SAPA - SCALE Conference Proceedings.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  398. Ultrax: An Animated Midsagittal Vocal Tract Display for Speech Therapy

    Richmond, K. & Renals, S., 1 Sep 2012, INTERSPEECH 2012: 13th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 74-77 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  399. Using Bayesian Networks to find relevant context features for HMM-based speech synthesis

    Lu, H. & King, S., 1 Sep 2012, INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012. ISCA-INST SPEECH COMMUNICATION ASSOC

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  400. Deep Architectures for Articulatory Inversion

    Uria, B., Murray, I., Renals, S. & Richmond, K., Sep 2012, INTERSPEECH 2012 13th Annual Conference of the International Speech Communication Association. ISCA, p. 867-870 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  401. Synthetic Speech Discrimination using Pitch Pattern Statistics Derived from Image Analysis

    Leon, P. L. D., Stewart, B. & Yamagishi, J., Sep 2012, Proc. Interspeech: Portland, Oregon, USE.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  402. Towards an Unsupervised Speaking Style Voice Building Framework: Multi-Style Speaker Diarization

    Lorenzo, J., Martinez, B., Barra-Chicote, R., Lopez-Ludena, V., Ferreiros, J., Yamagishi, J. & Montero, J. M., Sep 2012, Proc. Interspeech 2012: 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012. ISCA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  403. Using HMM-based speech synthesis to reconstruct the voice of individuals with degenerative speech disorders

    Veaux, C., Yamagishi, J. & King, S., Sep 2012, Proceedings of INTERSPEECH 2012 13th Annual Conference of the International Speech Communication Association. p. 967-970 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  404. Vowel Creation by Articulatory Control in HMM-based Parametric Speech Synthesis

    Ling, Z-H., Richmond, K. & Yamagishi, J., Sep 2012, INTERSPEECH 2012 13th Annual Conference of the International Speech Communication Association. p. 72 1 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  405. Direct Posterior Confidence for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Frankel, J., Vipperla, R., Evans, N. & Troncy, R., Aug 2012, In : ACM Transactions on Information Systems. 30, 3, p. - 34 p., 16.

    Research output: Contribution to journalArticle

  406. Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping

    Oura, K., Yamagishi, J., Wester, M., King, S. & Tokuda, K., 1 Jul 2012, In : Speech Communication. 54, 6, p. 703-714 12 p.

    Research output: Contribution to journalArticle

  407. Multimodal Signal Processing: Human Interactions in Meetings

    Renals, S., Bourlard, H., Carletta, J. & Popescu-Belis, A., Jul 2012, Cambridge University Press.

    Research output: Book/ReportBook

  408. Using an intelligibility measure to create noise robust cepstral coefficients for HMM-based speech synthesis

    Valentini-Botinhao, C., Yamagishi, J. & King, S., May 2012, Proc. LISTA Workshop: Edinburgh, UK.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  409. Vowel Creation by Articulatory Control in HMM-based Parametric Speech Synthesis

    Ling, Z., Richmond, K. & Yamagishi, J., May 2012, p. 72. 1 p.

    Research output: Contribution to conferencePoster

  410. Evaluating language understanding accuracy with respect to objective outcomes in a dialogue system

    Dzikovska, M. O., Bell, P., Isard, A. & Moore, J. D., 1 Apr 2012, Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics. Avignon, France: Association for Computational Linguistics, p. 471-481 11 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  411. Combining vocal tract length normalization with hierarchial linear transformations

    Saheer, L., Yamagishi, J., Garner, P. N. & Dines, J., 1 Mar 2012, Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on. p. 4493-4496 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  412. Term-dependent Confidence Normalization for Out-of-Vocabulary Spoken Term Detection

    Wang, D., Tejedor, J., King, S. & Frankel, J., Mar 2012, In : Journal of Computer Science and Technology. 27, 2, p. 358-375 17 p.

    Research output: Contribution to journalArticle

  413. Synthesis and evaluation of conversational characteristics in HMM-based speech synthesis

    Andersson, S., Yamagishi, J. & Clark, R. A. J., Feb 2012, In : Speech Communication. 54, 2, p. 175-188 14 p.

    Research output: Contribution to journalArticle

  414. The magnetic resonance imaging subset of the mngu0 articulatory corpus

    Steiner, I., Richmond, K., Marshall, I. & Gray, C., Feb 2012, In : The Journal of the Acoustical Society of America. 131, 2, p. EL106-EL111 6 p.

    Research output: Contribution to journalArticle

  415. A grapheme-based method for automatic alignment of speech and text data

    Stan, A., Bell, P. & King, S., 2012, Spoken Language Technology Workshop (SLT), 2012 IEEE. Institute of Electrical and Electronics Engineers (IEEE), p. 286-290 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  416. A tutorial dialogue system with unrestricted spoken input

    Bell, P., Dzikovska, M. & Isard, A., 2012, INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012. p. 2113-2114 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  417. Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Maia, R., Yamagishi, J., King, S. & Zen, H., 2012, 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP): Kyoto, Japan. NEW YORK: Institute of Electrical and Electronics Engineers (IEEE), p. 3997-4000 4 p. (IEEE International Conference on Acoustics Speech and Signal Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  418. Designing a spoken language interface for a tutorial dialogue system

    Bell, P., Dzikovska, M. & Isard, A., 2012, INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012. p. 1283-1286 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  419. Determining the number of speakers in a meeting using microphone array features

    Zwyssig, E., Renals, S. & Lincoln, M., 2012, Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on. p. 4765-4768 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  420. Evaluating speech intelligibility enhancement for HMM-based synthetic speech in noise

    King, S., Yamagishi, J. & Valentini-Botinhao, C., 2012, Proc. SAPA-SCALE Workshop on Statistical and Perceptual Audition (SAPA-SCALE 2012). Portland, OR, USA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  421. Generative Goal-driven User Simulation for Dialog Management

    Eshky, A., Allison, B. & Steedman, M., 2012, Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Stroudsburg, PA, USA: Association for Computational Linguistics, p. 71-81 11 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  422. Impacts of machine translation and speech synthesis on speech-to-speech translation

    Hashimoto, K., Yamagishi, J., Byrne, W., King, S. & Tokuda, K., 2012, In : Speech Communication. 54, 7, p. 857-866 10 p.

    Research output: Contribution to journalArticle

  423. Maximum a posteriori adaptation of subspace Gaussian mixture models for cross-lingual speech recognition

    Lu, L., Ghoshal, A. & Renals, S., 2012, Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 4877-4880 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  424. Noise Compensation for Subspace Gaussian Mixture Models.

    Lu, L., Chin, K. K., Ghoshal, A. & Renals, S., 2012, INTERSPEECH 2012 13th Annual Conference of the International Speech Communication Association. ISCA, p. 306-309 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  425. On the effect of SNR and superdirective beamforming in speaker diarisation in meetings

    Zwyssig, E., Renals, S. & Lincoln, M., 2012, Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on. p. 4177-4180 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  426. Simple4All proposals for the Albayzin Evaluations in Speech Synthesis

    Lorenzo-Trueba, J., Watts, O., Barra-Chicote, R., Yamagishi, J., King, S. & Montero, J. M., 2012, Proc. Iberspeech 2012.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  427. Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction

    Yamagishi, J., Veaux, C., King, S. & Renals, S., 2012, In : Acoustical Science and Technology. 33, 1, p. 1-5 5 p.

    Research output: Contribution to journalArticle

  428. Spoken dialogue interfaces for older people

    Vipperla, R., Wolters, M. & Renals, S., 2012, Advances in Home Care Technologies. Turner, K. J. (ed.). IOS Press, Vol. Volume 31: Advances in Home Care Technologies. p. 118-137 21 p.

    Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)

  429. The UEDIN system for the IWSLT 2012 evaluation

    Hasler, E., Bell, P., Ghoshal, A., Haddow, B., Koehn, P., McInnes, F., Renals, S. & Swietojanski, P., 2012, Proc. International Workshop on Spoken Language Translation. p. 46-53 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  430. Towards Glottal Source Controllability in Expressive Speech Synthesis

    Lorenzo-Trueba, J., Barra-Chicote, R., Raitio, T., Obin, N., Alku, P., Yamagishi, J. & Montero, J. M., 2012, INTERSPEECH: 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012. ISCA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  431. Transcription of multi-genre media archives using out-of-domain data

    Bell, P. J., Gales, M. J. F., Lanchantin, P., Liu, X., Long, Y., Renals, S., Swietojanski, P. & Woodland, P. C., 2012, Spoken Language Technology Workshop (SLT), 2012 IEEE. Institute of Electrical and Electronics Engineers (IEEE), p. 324-329 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  432. Unsupervised cross-lingual knowledge transfer in DNN-based LVCSR

    Swietojanski, P., Ghoshal, A. & Renals, S., 2012, Spoken Language Technology Workshop (SLT), 2012 IEEE. Institute of Electrical and Electronics Engineers (IEEE), p. 246-251 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  433. Using multimodal speech production data to evaluate articulatory animation for audiovisual speech synthesis

    Steiner, I., Richmond, K. & Ouni, S., 2012, 3rd International Symposium on Facial Analysis and Animation.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  434. 2011
  435. A Deep Neural Network for Acoustic-Articulatory Speech Inversion

    Uria, B., Renals, S. & Richmond, K., Dec 2011, Proc. NIPS 2011 Workshop on Deep Learning and Unsupervised Feature Learning.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  436. Automatic analysis of multiparty meetings

    Renals, S., 1 Oct 2011, In : Sadhana - Academy Proceedings in Engineering Sciences. 36, 5, p. 917-932 16 p.

    Research output: Contribution to journalArticle

  437. An introduction to statistical parametric speech synthesis

    King, S., Oct 2011, In : Sadhana-Academy proceedings in engineering sciences. 36, 5, p. 837-852 16 p.

    Research output: Contribution to journalArticle

  438. Carnival - Combining Speech Technology and Computer Animation

    Berger, M. A., Hofer, G. & Shimodaira, H., 1 Sep 2011, In : IEEE Computer Graphics and Applications. 31, 5, p. 80-89 10 p.

    Research output: Contribution to journalArticle

  439. Speech Synthesis

    King, S., Sep 2011, Speech and Audio Signal Processing. Gold, B., Morgan, N. & Ellis, D. (eds.). 2nd ed. Wiley, p. 431-454 23 p.

    Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)

  440. Announcing the Electromagnetic Articulography (Day 1) Subset of the mngu0 Articulatory Corpus

    Richmond, K., Hoole, P. & King, S., 1 Aug 2011, Interspeech 2011: 12th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1505-1508 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  441. Can Objective Measures Predict the Intelligibility of Modified HMM-based Synthetic Speech in Noise?

    Valentini-Botinhao, C., Yamagishi, J. & King, S., 1 Aug 2011, Interspeech 2011: 12th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1837-1840 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  442. Formant-controlled HMM-based speech synthesis

    Lei, M., Yamagishi, J., Richmond, K., Ling, Z-H., King, S. & Dai, L-R., 1 Aug 2011, Interspeech 2011: 12th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2777-2780 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  443. Feature-space transform tying in unified acoustic-articulatory modelling of articulatory control of HMM-based speech synthesis

    Ling, Z-H., Richmond, K. & Yamagishi, J., Aug 2011, Proc. Interspeech: 12th Annual Conference of the International Speech Communication Association . p. 117-120 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  444. Unsupervised continuous-valued word features for phrase-break prediction without a part-of-speech tagger.

    Watts, O., Yamagishi, J. & King, S., Aug 2011, Proceedings of the 12th Annual Conference of the International Speech Communication Association. Cosi, P., De Mori, R., Di Fabbrizio, G. & Pieraccini, R. (eds.). ISCA, p. 2157-2160 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  445. A new group delay-based feature for robust speech recognition

    Loweimi, E. & Ahadi, S. M., 1 Jul 2011, 2011 IEEE International Conference on Multimedia and Expo. Institute of Electrical and Electronics Engineers (IEEE), p. 1-5 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  446. Beetle II: an adaptable tutorial dialogue system

    Dzikovska, M., Isard, A., Bell, P., Moore, J., Steinhauser, N. & Campbell, G., 1 Jun 2011, Proceedings of the SIGDIAL 2011 Conference. Portland, Oregon: Association for Computational Linguistics, p. 338-340 3 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  447. Detection of synthetic speech for the problem of imposture

    De Leon, P. L., Hernaez, I., Saratxaga, I., Pucher, M. & Yamagishi, J., 1 May 2011, Acoustics, Speech and Signal Processing (ICASSP): 2011 IEEE International Conference. p. 4844-4847 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  448. Evaluation of objective measures for intelligibility prediction of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Yamagishi, J. & King, S., 1 May 2011, Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. p. 5112-5115 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  449. HMM-based speech synthesiser using the LF-model of the glottal source

    Cabral, J., Renals, S., Yamagishi, J. & Richmond, K., 1 May 2011, Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. p. 4704-4707 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  450. Handling overlaps in spoken term detection

    Wang, D., Evans, N., Troncy, R. & King, S., 1 May 2011, Proc. International Conference on Acoustics, Speech and Signal Processing. p. 5656-5659 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  451. On the importance of phase and magnitude spectra in speech enhancement

    Loweimi, E., Ahadi, S. M. & Loveymi, S., 1 May 2011, 2011 19th Iranian Conference on Electrical Engineering. Institute of Electrical and Electronics Engineers (IEEE), p. 1-6 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  452. Vocal attractiveness of statistical speech synthesisers

    Andraszewicz, S., Yamagishi, J. & King, S., 1 May 2011, Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. p. 5368-5371 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  453. An analysis of machine translation and speech synthesis in speech-to-speech translation system

    Hashimoto, K., Yamagishi, J., Byrne, W., King, S. & Tokuda, K., May 2011, Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. p. 5108-5111 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  454. Stochastic Pronunciation Modelling for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S. & Frankel, J., May 2011, In : IEEE Transactions on Audio, Speech and Language Processing. 19, 4, p. 688-698 11 p.

    Research output: Contribution to journalArticle

  455. Listeners' weighting of acoustic cues to synthetic speech naturalness: A multidimensional scaling analysis

    Mayo, C., Clark, R. A. J. & King, S., 1 Mar 2011, In : Speech Communication. 53, 3, p. 311-326 15 p.

    Research output: Contribution to journalArticle

  456. The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate

    Stan, A., Yamagishi, J., King, S. & Aylett, M., Mar 2011, In : Speech Communication. 53, 3, p. 442-450 9 p.

    Research output: Contribution to journalArticle

  457. Letter-to-Sound Pronunciation Prediction Using Conditional Random Fields

    Wang, D. & King, S., 1 Feb 2011, In : IEEE Signal Processing Letters. 18, 2, p. 122-125 4 p.

    Research output: Contribution to journalArticle

  458. HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering

    Raitio, T., Suni, A., Yamagishi, J., Pulakka, H., Nurminen, J., Vainio, M. & Alku, P., Jan 2011, In : IEEE Transactions on Audio, Speech and Language Processing. 19, 1, p. 153-165 13 p.

    Research output: Contribution to journalArticle

  459. Adaptive Intelligent Tutorial Dialogue in the BEETLE II System

    Dzikovska, M. O., Isard, A., Bell, P., Moore, J. D., Steinhauser, N. B., Campbell, G. E., Taylor, L. S., Caine, S. & Scott, C., 2011, Artificial Intelligence in Education: 15th International Conference, AIED 2011, Auckland, New Zealand, June 28 – July 2011. Biswas, G., Bull, S., Kay, J. & Mitrovic, A. (eds.). Springer-Verlag GmbH, p. 621-621 1 p. (Lecture Notes in Computer Science; vol. 6738).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  460. Phase-Only Speech Reconstruction Using Very Short Frames

    Loweimi, E., Ahadi, S. M. & Sheikhzadeh, H., 2011, Proc. Interspeech 2011. ISCA, p. 2501-2504 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  461. Regularized Subspace Gausian Mixture Models for Speech Recognition

    Lu, L., Ghoshal, A. & Renals, S., 2011, In : IEEE Signal Processing Letters. 18, 7, p. 419-422 4 p.

    Research output: Contribution to journalArticle

  462. Regularized subspace Gaussian mixture models for cross-lingual speech recognition

    Lu, L., Ghoshal, A. & Renals, S., 2011, Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on. Institute of Electrical and Electronics Engineers (IEEE), p. 365-370 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  463. Speech Synthesis

    King, S., Ellis, D. & Morgan, N., 2011, Speech and Audio Signal Processing. Gold, B., Morgan, N. & Ellis, D. (eds.). 2nd ed. Wiley, p. 431-454 24 p.

    Research output: Chapter in Book/Report/Conference proceedingChapter

  464. The Ambient Spotlight: Personal meeting capture with a microphone array

    Kilgour, J., Carletta, J. & Renals, S., 2011, Hands-free Speech Communication and Microphone Arrays (HSCMA), 2011 Joint Workshop on. Institute of Electrical and Electronics Engineers (IEEE), p. 163-164 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  465. Unsupervised Features from Text for Speech Synthesis in a Speech-to-Speech Translation System

    Watts, O. & Zhou, B., 2011, INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association, Florence, Italy, August 27-31, 2011. ISCA, p. 2153-2156 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  466. Voice Banking and Voice Reconstruction for MND patients

    Veaux, C., Yamagishi, J. & King, S., 2011, ASSETS 11: Proceedings of the 13th International ACM Sigaccess conference on computers and accessibility. New York: ASSOC COMPUTING MACHINERY, p. 305-306 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  467. 2010
  468. Measuring the Gap Between HMM-Based ASR and TTS

    Dines, J., Yamagishi, J. & King, S., 1 Dec 2010, In : IEEE Journal of Selected Topics in Signal Processing. 4, 6, p. 1046-1058 13 p.

    Research output: Contribution to journalArticle

  469. Hierarchical Bayesian Language Models for Conversational Speech Recognition

    Huang, S. & Renals, S., Nov 2010, In : IEEE Transactions on Audio, Speech and Language Processing. 18, 8, p. 1941-1954 14 p.

    Research output: Contribution to journalArticle

  470. An Analysis of HMM-based Prediction of Articulatory Movements

    Ling, Z-H., Richmond, K. & Yamagishi, J., Oct 2010, In : Speech Communication. 52, 10, p. 834-846 13 p.

    Research output: Contribution to journalArticle

  471. A Unified and Automatic Approach Of Mandarin HTS System

    Guan, Y., Tian, J., Wu, Y-J., Yamagishi, J. & Nurminen, J., 1 Sep 2010, Proc. SSW7.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  472. CRF-based Stochastic Pronunciation Modelling for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Evans, N. & Troncy, R., 1 Sep 2010, Interspeech 2010: 11th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1668-1671 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  473. Direct Posterior Confidence For Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Evans, N. & Troncy, R., 1 Sep 2010, SSCS '10 Proceedings of the 2010 international workshop on Searching spontaneous conversational speech. ACM, p. 21-26 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  474. Synthesis of fast speech with interpolation of adapted HSMMs and its evaluation by blind and sighted listeners

    Pucher, M., Schabus, D. & Yamagishi, J., 1 Sep 2010, Proc. Interspeech. p. 2186-2189 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  475. Utilising Spontaneous Conversational Speech in HMM-Based Speech Synthesis

    Andersson, S., Yamagishi, J. & Clark, R., 1 Sep 2010, The 7th ISCA Tutorial and Research Workshop on Speech Synthesis.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  476. Comparison of HMM and TMDN Methods for Lip Synchronisation

    Hofer, G. & Richmond, K., Sep 2010, INTERSPEECH 2010 11th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 454-457 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  477. HMM-based Text-to-Articulatory-Movement Prediction and Analysis of Critical Articulators

    Ling, Z-H., Richmond, K. & Yamagishi, J., Sep 2010, INTERSPEECH 2010 11th Annual Conference of the International Speech Communication Association. ISCA, p. 2194-2197 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  478. Letter-based speech synthesis

    Watts, O., Yamagishi, J. & King, S., Sep 2010, Proc. Speech Synthesis Workshop 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  479. On Generating Combilex Pronunciations via Morphological Analysis

    Richmond, K., Clark, R. & Fitt, S., Sep 2010, Proc. Interspeech 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  480. Relying on critical articulators to estimate vocal tract spectra in an articulatory-acoustic database

    Richmond, K., Felps, D., Geng, C., Berger, M. & Gutierrez-Osuna, R., Sep 2010, Proc. Interspeech. p. 1900-1993 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  481. An Edinburgh Speech Production Facility

    Turk, A., Scobbie, J., Geng, C., Dickie, C., Bard, E., Hardcastle, W., Hartinger, M., King, S., Lickley, R., Renals, S., Richmond, K., Schaeffler, S., White, K. & Wrench, A., Jul 2010, (Unpublished).

    Research output: Contribution to conferencePoster

  482. Personalising speech-to-speech translation in the EMIME project

    Kurimo, M., Byrne, W., Dines, J., Garner, P. N., Gibson, M., Guan, Y., Hirsimaki, T., Karhila, R., King, S., Liang, H., Oura, K., Saheer, L., Shannon, M., Shiota, S., Tian, J., Tokuda, K., Wester, M., Wu, Y-J. & Yamagishi, J., Jul 2010, Proceedings of the ACL 2010 System Demonstrations. p. 48-53 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  483. Synthesis of Child Speech With HMM Adaptation and Voice Conversion

    Watts, O., Yamagishi, J., King, S. & Berkling, K., Jul 2010, In : IEEE Transactions on Audio, Speech and Language Processing. 18, 5, p. 1005-1016 12 p.

    Research output: Contribution to journalArticle

  484. Thousands of Voices for HMM-Based Speech Synthesis-Analysis and Application of TTS Systems Built on Various ASR Corpora

    Yamagishi, J., Usabaev, B., King, S., Watts, O., Dines, J., Tian, J., Guan, Y., Hu, R., Oura, K., Wu, Y-J., Tokuda, K., Karhila, R. & Kurimo, M., Jul 2010, In : IEEE Transactions on Audio, Speech and Language Processing. 18, 5, p. 984-1004 21 p.

    Research output: Contribution to journalArticle

  485. Recognition and Understanding of Meetings

    Renals, S., Jun 2010, Proc. NAACL/HLT. Association for Computational Linguistics, p. 1-9

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  486. Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech

    Barra-Chicote, R., Yamagishi, J., King, S., Montero, J. M. & Macias-Guarasa, J., 1 May 2010, In : Speech Communication. 52, 5, p. 394-404 11 p.

    Research output: Contribution to journalArticle

  487. Objective evaluation of phase and magnitude only reconstructed speech: New considerations

    Loweimi, E. & Ahadi, S. M., 1 May 2010, 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010). Institute of Electrical and Electronics Engineers (IEEE), p. 117-120 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  488. Evaluation of a hierarchical reinforcement learning spoken dialogue system

    Cuayáhuitl, H., Renals, S., Lemon, O. & Shimodaira, H., Apr 2010, In : Computer Speech and Language. 24, 2, p. 395-429 35 p.

    Research output: Contribution to journalArticle

  489. Objective evaluation of magnitude and phase only spectrum-based reconstruction of the Speech signal

    Loveimi, E. & Ahadi, S. M., 1 Mar 2010, 2010 4th International Symposium on Communications, Control and Signal Processing (ISCCSP). Institute of Electrical and Electronics Engineers (IEEE), p. 1-4 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  490. Stochastic Pronunciation Modelling and Soft Match for Out-of-vocabulary Spoken Term Detection

    Wang, D., King, S., Frankel, J. & Bell, P., 1 Mar 2010, Proceedings of the 2010 IEEE International conference on Acoustic Speech and Signal Processing (ICASSP). NEW YORK: Institute of Electrical and Electronics Engineers (IEEE), p. 5294-5297 4 p. (IEEE International Conference on Acoustics Speech and Signal Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  491. Automatic speech recognition

    Renals, S. & King, S., Feb 2010, Handbook of Phonetic Sciences. Hardcastle, W. J., Laver, J. & Gibbon, F. E. (eds.). 2nd ed. Wiley-Blackwell, Vol. 1.

    Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)

  492. Modeling and interpolation of Austrian German and Viennese dialect in HMM-based speech synthesis

    Pucher, M., Schabus, D., Yamagishi, J., Neubarth, F. & Strom, V., Feb 2010, In : Speech Communication. 52, 2, p. 164-179 16 p.

    Research output: Contribution to journalArticle

  493. Building personalised synthesised voices for individuals with dysarthia using the HTS toolkit

    Creer, S., Green, P., Cunningham, S. & Yamagishi, J., 31 Jan 2010, Computer Synthesized Speech Technologies: Tools for Aiding Impairment. Mullennix, J. M. & Stern, S. E. (eds.). 1 ed. IGI Global, p. 92-115

    Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)

  494. A Classifier-based target cost for unit selection speech synthesis trained on perceptual data

    Strom, V. & King, S., 2010, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  495. A Digital Microphone Array for Distant Speech Recognition

    Zwyssig, E., Lincoln, M. & Renals, S., 2010, Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on . New York: Institute of Electrical and Electronics Engineers (IEEE), p. 5106-5109 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  496. A tutorial on HMM speech synthesis (Invited paper)

    King, S., 2010, Sadhana -- Academy Proceedings in Engineering Sciences, Indian Institute of Sciences.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  497. Ageing voices: The effect of changes in voice parameters on ASR performance

    Vipperla, R., Renals, S. & Frankel, J., 2010, In : EURASIP Journal on Audio, Speech, and Music Processing. 10 p., 525783.

    Research output: Contribution to journalArticle

  498. Augmentation of adaptation data

    Vipperla, R., Renals, S. & Frankel, J., 2010, INTERSPEECH 2010 11th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 530-533 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  499. Augmented set of features for confidence estimation in spoken term detection

    Tejedor, J., Toledano, D. T., Bautista, M., King, S., Wang, D. & Colas, J., 2010, Proc. Interspeech 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  500. Carnival: a modular framework for automated facial animation

    Berger, M., Hofer, G. & Shimodaira, H., 2010, ACM SIGGRAPH 2010 Posters. New York, NY, USA: ACM, p. 5:1-5:1 (SIGGRAPH '10).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  501. Evaluating speech synthesis intelligibility using Amazon Mechanical Turk

    Wolters, M. K., Isaac, K. B. & Renals, S., 2010, Proc. 7th Speech Synthesis Workshop (SSW7). p. 136-141 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  502. Evaluation of the Vulnerability of Speaker Verification to Synthetic Speech

    Leon, P. L. D., Pucher, M. & Yamagishi, J., 2010, Proc. Odyssey (The speaker and language recognition workshop) 2010: Brno, Czech Republic.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  503. Lip Synchronization by Acoustic Inversion

    Hofer, G., Richmond, K. & Berger, M., 2010, ACM SIGGRAPH 2010 Posters. ACM, 1 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  504. Power Law Discounting for N-Gram Language Models

    Huang, S. & Renals, S., 2010, Proc. IEEE ICASSP--10.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  505. Querying Linguistic Trees

    Lai, C. & Bird, S., 2010, In : Journal of Logic, Language and Information. 19, 1, p. 53-73 21 p.

    Research output: Contribution to journalArticle

  506. Revisiting the security of speaker verification systems against imposture using synthetic speech

    Leon, P. L. D., Apsingekar, V. R., Pucher, M. & Yamagishi, J., 2010, Proc. ICASSP 2010: Dallas, TX, USE.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  507. Roles of the Average Voice in Speaker-adaptive HMM-based Speech Synthesis

    Yamagishi, J., Watts, O., King, S. & Usabaev, B., 2010, Proc. Interspeech 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  508. Simple methods for improving speaker-similarity of HMM-based speech synthesis

    Yamagishi, J. & King, S., 2010, Proc. ICASSP 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  509. Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project

    Wester, M., Dines, J., Gibson, M., Liang, H., Wu, Y-J., Saheer, L., King, S., Oura, K., Garner, P. N., Byrne, W., Guan, Y., Hirsimaki, T., Karhila, R., Kurimo, M., Shannon, M., Shiota, S., Tian, J., Tokuda, K. & Yamagishi, J., 2010, Proc. of 7th ISCA Speech Synthesis Workshop.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

Previous 1 2 Next