Edinburgh Research Explorer

Prof Simon King

Personal Chair of Speech Processing

  1. 2020
  2. A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis

    Wang, X., Takaki, S., Yamagishi, J., King, S. & Tokuda, K., 1 Jan 2020, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 28, p. 157-170 13 p.

    Research output: Contribution to journalArticle

  3. 2019
  4. Measuring the contribution to cognitive load of each predicted vocoder speech parameter in DNN-based speech synthesis

    Govender, A., Valentini-Botinhao, C. & King, S., 22 Sep 2019, Proceedings of the 10th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 121-126 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  5. Evaluating Near End Listening Enhancement Algorithms in Realistic Environments

    Chermaz, C., Valentini Botinhao, C., Schepker, H. & King, S., 19 Sep 2019, Proceedings Interspeech 2019. International Speech Communication Association, p. 1373-1377 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  6. Improving speech synthesis with discourse relations

    Aubin, A., Cervone, A., Watts, O. & King, S., 19 Sep 2019, Interspeech 2019. ISCA, Vol. 2019-September. p. 4470-4474 5 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  7. Attentive filtering networks for audio replay attack detection

    Lai, C-I., Abad, A., Richmond, K., Yamagishi, J., Dehak, N. & King, S., 17 Apr 2019, 2019 IEEE International Conference on Acoustics, Speech and Signal Processing. p. 6316-6320

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  8. Speech Waveform Reconstruction using Convolutional Neural Networks with Noise and Periodic Inputs

    Watts, O., Valentini Botinhao, C. & King, S., 17 Apr 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brighton, United Kingdom: Institute of Electrical and Electronics Engineers (IEEE), p. 7045-7049 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  9. Exemplar-based speech waveform generation for text-to-speech

    Valentini Botinhao, C., Watts, O., Espic Calderón, F. & King, S., 14 Feb 2019, 2018 IEEE Workshop on Spoken Language Technology (SLT). Institute of Electrical and Electronics Engineers (IEEE), p. 332-338 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  10. Comunicaci ́on enriquecida a lo largo de la vida

    Cooke, M., King, S., Hazan, V., Stylianou, Y., Janse, E., Baskent, D., Hohmann, V., Winneke, A. & Hernaez, I., 2019, In : Procesamiento del Lenguaje Natural. 63, p. 175-178

    Research output: Contribution to journalArticle

  11. 2018
  12. Exemplar-based Speech Waveform Generation

    Watts, O., Valentini Botinhao, C., Espic calderón, F. & King, S., 6 Sep 2018, Interspeech 2018. Hyderabad, India, p. 2022-2026 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  13. Impact of different speech types on listening effort

    Symantiraki, O., Cooke, M. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2267-2271

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  14. Learning interpretable control dimensions for speech synthesis by using external data

    Hodari, Z., Watts, O., Ronanki, S. & King, S., 6 Sep 2018, Interspeech 2018. Hyderabad, India, p. 32-36 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  15. Measuring the cognitive load of synthetic speech using a dual task paradigm

    Govender, A. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2843-2847 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  16. Using pupillometry to measure the cognitive load of synthetic speech

    Govender, A. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2843-2847 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  17. 2017
  18. A Hierarchical Encoder-Decoder Model for Statistical Parametric Speech Synthesis

    Ronanki, S., Watts, O. & King, S., 24 Aug 2017, Proceedings Interspeech 2017. p. 1133-1137 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  19. Nativization of foreign names in TTS for automatic reading of world news in Swahili

    Mendelson, J., Oplustil, P., Watts, O. & King, S., 24 Aug 2017, Proceedings Interspeech 2017. p. 2188-2192 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  20. Direct Modelling of Magnitude and Phase Spectra for Statistical Parametric Speech Synthesis

    Espic calderón, F., Valentini Botinhao, C. & King, S., 20 Aug 2017, Interspeech 2017. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  21. Using eigenvoices and nearest-neighbours in HMM-based cross-lingual speaker adaptation with limited data

    Sarfjoo, S. S., Demiroglu, C. & King, S., Apr 2017, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25, 4, p. 839-851 13 p.

    Research output: Contribution to journalArticle

  22. Median-based generation of synthetic speech durations using a non-parametric approach

    Ronanki, S., Watts, O., King, S. & Henter, G. E., 9 Feb 2017, 2016 IEEE Spoken Language Technology Workshop (SLT). Institute of Electrical and Electronics Engineers (IEEE), p. 686-692 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  23. 2016
  24. DNN-based Speech Synthesis for Indian Languages from ASCII text

    Ronanki, S., Gangireddy, S. R., Bollepalli, B. & King, S., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop. p. 74-79 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  25. Merlin: An Open Source Neural Network Speech Synthesis System

    Wu, Z., Watts, O. & King, S., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop (2016). p. 202-207 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  26. Waveform generation based on signal reshaping for statistical parametric speech synthesis

    Espic, F., Valentini Botinhao, C., Wu, Z. & King, S., 12 Sep 2016, Interspeech 2016. San Francisco, United States, p. 2263-2267 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  27. GlottDNN - A full-band glottal vocoder for statistical parametric speech synthesis

    Airaksinen, M., Bollepalli, B., Juvela, L., Wu, Z., King, S. & Alku, P., 8 Sep 2016.

    Research output: Contribution to conferencePaper

  28. Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance

    Wu, Z., De Leon, P., Demiroglu, C., Khodabakhsh, A., King, S., Ling, Z., Saito, D., Stewart, B., Toda, T., Wester, M. & Yamagishi, J., Apr 2016, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 24, 4, p. 768 - 783 17 p.

    Research output: Contribution to journalArticle

  29. From HMMs to DNNs: Where Do the Improvements Come From?

    Watts, O., Henter, G. E., Merritt, T., Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5505-5509 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  30. Investigating gated recurrent neural networks for speech synthesis

    Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 1-5 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  31. Robust TTS Duration Modelling Using DNNs

    Henter, G., Ronanki, S., Watts, O., Wester, M., Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5130-5134 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  32. Testing the Consistency Assumption: Pronunciation Variant Forced Alignment in Read and Spontaneous Speech Synthesis

    Dall, R., Brognaux, S., Richmond, K., Valentini Botinhao, C., Henter, G., Hirschberg, J., Yamagishi, J. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5155-5159 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  33. Speech synthesis

    King, S., 25 Feb 2016, Oxford Bibliographies in Linguistics. Aronoff, M. (ed.). New York: Oxford University Press, 29 p.

    Research output: Chapter in Book/Report/Conference proceedingOther chapter contribution

  34. ALISA: An automatic lightly supervised speech segmentation and alignment tool

    Stan, A., Mamiya, Y., Yamagishi, J., Bell, P., Watts, O., Clark, R. A. J. & King, S., Jan 2016, In : Computer Speech and Language. 35, p. 116-133 18 p.

    Research output: Contribution to journalArticle

  35. Smooth talking: articulatory join costs for unit selection

    Richmond, K. & King, S., 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5150-5154 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  36. 2015
  37. Combining Lightly-supervised Learning and User Feedback to Construct Andimprove a Statistical Parametric Speech Synthesizer for Malay

    Chee Yong, L., Watts, O. & King, S., 15 Dec 2015, In : Research Journal of Applied Sciences, Engineering and Technology. 11, 11, p. 1227-1232 6 p.

    Research output: Contribution to journalArticle

  38. A study of speaker adaptation for DNN-based speech synthesis

    Wu, Z., Swietojanski, P., Veaux, C., Renals, S. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  39. Deep neural network context embeddings for model selection in rich-context HMM synthesis

    Merritt, T., Yamagishi, J., Wu, Z., Watts, O. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. Dresden: International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  40. A Comparison of Manual and Automatic Voice Repair for Individual with Vocal Disabilities

    Veaux, C., Yamagishi, J. & King, S., Sep 2015, SLPAT 2015, 6th Workshop on Speech and Language Processing for Assistive Technologies. Association for Computational Linguistics (ACL), p. 130-133 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  41. Reconstructing Voices within the Multiple-Average-Voice-Model framework

    Lanchantin, P., Veaux, C., Gales, M. J. F., King, S. & Yamagishi, J., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2232-2236 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  42. Sentence-level control vectors for deep neural network speech synthesis

    Watts, O., Wu, Z. & King, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2217-2221 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  43. Towards minimum perceptual error training for DNN-based speech synthesis

    Valentini-Botinhao, C., Wu, Z. & King, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. Dresden: International Speech Communication Association, p. 869-873 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  44. A Lattice-based Approach to Automatic Filled Pause Insertion

    Tomalin, M., Wester, M., Dall, R., Byrne, B. & King, S., 10 Aug 2015, Proc. of DiSS 2015, The 7th Workshop on Disfluencies in Spontaneous Speech. Edinburgh, 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  45. A reading list of recent advances in speech synthesis

    King, S., 10 Aug 2015, Proc. 18th International Congress of Phonetic Sciences (ICPhS). T. S. C. F. ICP. . (ed.). Glasgow, UK: University of Glasgow

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  46. A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification

    Poblete, V., Espic, F., King, S., Stem, R. M., Huenupan, F., Fredes, J. & Yoma, N. B., May 2015, In : Computer Speech and Language. 31, 1, p. 1-27 27 p.

    Research output: Contribution to journalArticle

  47. Deep neural networks employing multi-task learning and stacked bottleneck features for speech synthesis.

    Wu, Z., Valentini-Botinhao, C., Watts, O. & King, S., 1 Apr 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. Brisbane, Australia, p. 4460-4464 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  48. Soft context clustering for F0 modeling in HMM-based speech synthesis

    Khorram, S., Sameti, H. & King, S., 9 Jan 2015, In : EURASIP Journal on Advances in Signal Processing. 2015, 1

    Research output: Contribution to journalArticle

  49. SAS: A Speaker Verification Spoofing Database Containing Diverse Attacks

    Wu, Z., Khodabakhsh, A., Demiroglu, C., Yamagishi, J., Saito, D., Toda, T. & King, S., 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on . Institute of Electrical and Electronics Engineers (IEEE), p. 4440-4444 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  50. 2014
  51. Voice source modelling using deep neural networks for statistical parametric speech synthesis

    Raitio, T., Lu, H., Kane, J., Suni, A., Vainio, M., King, S. & Alku, P., 1 Sep 2014, European Signal Processing Conference. European Signal Processing Conference, EUSIPCO, p. 2290-2294 5 p. 6952838

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  52. Feature analysis for discriminative confidence estimation in spoken term detection

    Tejedor, J., Toledano, D. T., Wang, D., King, S. & Colas, J., Sep 2014, In : Computer Speech and Language. 28, 5, p. 1083–1114 32 p.

    Research output: Contribution to journalArticle

  53. Intelligibility Enhancement of Speech in Noise

    Valentini-Botinhao, C., Yamagishi, J. & King, S., Sep 2014, Proceedings of the Institute of Acoustics 2014. Vol. 36. 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  54. Multiple-average-voice-based speech synthesis

    Lanchantin, P., Gales, M. J. F., King, S. & Yamagishi, J., 4 May 2014, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 285-289 5 p. 6853603

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  55. Neural net word representations for phrase-break prediction without a part of speech tagger

    Watts, O., Gangireddy, S., Yamagishi, J., King, S., Renals, S., Stan, A. & Giurgiu, M., 4 May 2014, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 2599-2603 5 p. 6854070

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  56. Introduction to the Issue on Statistical Parametric Speech Synthesis

    Tao, J., Hirose, K., Tokuda, K., Black, A. W. & King, S., Apr 2014, In : IEEE Journal of Selected Topics in Signal Processing. 8, 2, p. 170-172 3 p.

    Research output: Contribution to journalEditorial

  57. Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Maia, R., Mar 2014, In : Computer Speech and Language. 28, 2, p. 665-686 22 p.

    Research output: Contribution to journalArticle

  58. Introduction to the Special Issue on The listening talker: Context-dependent speech production and perception

    Cooke, M., King, S., Kleijn, W. B. & Stylianou, Y., Mar 2014, In : Computer Speech and Language. 28, 2, p. 540-542

    Research output: Contribution to journalArticle

  59. The listening talker: A review of human and algorithmic context-induced modifications of speech

    Cooke, M., King, S., Garnier, M. & Aubanel, V., Mar 2014, In : Computer Speech and Language. 28, 2, p. 543-571 29 p.

    Research output: Contribution to journalLiterature review

  60. Measuring a decade of progress in Text-to-Speech

    King, S., Jan 2014, In : Loquens. 1, 1, e006.

    Research output: Contribution to journalArticle

  61. Statistical parametric speech synthesis for Ibibio

    Ekpenyong, M., Urua, E-A., Watts, O., King, S. & Yamagishi, J., Jan 2014, In : Speech Communication. 56, p. 243-251 9 p.

    Research output: Contribution to journalArticle

  62. Development of a Genre-Dependent TTS System with Cross-Speaker Speaking-Style Transplantation

    Lorenzo-Trueba, J., Echeverry-Correa, J. D., Barra-Chicote, R., San-Segundo, R., Ferreiros, J., Gallardo-Antolin, A., Yamagishi, J., King, S. & Montero, J. M., 2014, 2nd International Workshop on Speech, Language and Audio in Multimedia (SLAM2014). International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  63. Investigating Automatic & Human Filled Pause Insertion for Speech Synthesis

    Dall, R., Tomalin, M., Wester, M., Byrne, W. & King, S., 2014, Proc. Interspeech. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  64. Measuring the Perceptual Effects of Modelling Assumptions in Speech Synthesis Using Stimuli Constructed from Repeated Natural Speech

    Henter, G. E., Merritt, T., Shannon, M., Mayo, C. & King, S., 2014, INTERSPEECH 2014 15th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1504-1508 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  65. Rating Naturalness in Speech Synthesis: The Effect of Style and Expectation

    Dall, R., Yamagishi, J. & King, S., 2014, Speech Prosody 2014.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  66. Unsupervised lexical clustering of speech segments using fixed dimensional acoustic embeddings

    Kamper, H., Jansen, A., King, S. & Goldwater, S., 2014, Proceedings of the IEEE Spoken Language Technology Workshop. Institute of Electrical and Electronics Engineers (IEEE), 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  67. 2013
  68. Cross-Lingual Automatic Speech Recognition Using Tandem Features

    Lal, P. & King, S., Dec 2013, In : IEEE Transactions on Audio, Speech and Language Processing. 21, 12, p. 2506-2515 10 p.

    Research output: Contribution to journalArticle

  69. Recording speech articulation in dialogue: Evaluating a synchronized double electromagnetic articulography setup

    Geng, C., Turk, A., Scobbie, J. M., Macmartin, C., Hoole, P., Richmond, K., Wrench, A., Pouplier, M., Bard, E., Campbell, Z., Dickie, C., Dubourg, E., Hardcastle, W., Kainada, E., King, S., Lickley, R., Nakai, S., Renals, S., White, K. & Wiegand, R., Nov 2013, In : Journal of Phonetics. 41, 6, p. 421-431 11 p.

    Research output: Contribution to journalArticle

  70. The voice bank corpus: Design, collection and data analysis of a large regional accent speech database

    Veaux, C., Yamagishi, J. & King, S., Nov 2013, Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference. Institute of Electrical and Electronics Engineers (IEEE), 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  71. Where are the challenges in speaker diarization?

    Sinclair, M. & King, S., 21 Oct 2013, IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013. Institute of Electrical and Electronics Engineers (IEEE), p. 7741-7745 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  72. Mage-HMM-based speech synthesis reactively controlled by the articulators

    Astrinaki, M., Moinet, A., Yamagishi, J., Richmond, K., Ling, Z-H., King, S. & Dutoit, T., Sep 2013, 8th ISCA Speech Synthesis Workshop. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  73. Combining a Vector Space Representation of Linguistic Context with a Deep Neural Network for Text-To-Speech Synthesis

    Lu, H., King, S. & Watts, O., 1 Aug 2013, 8th ISCA Speech Synthesis Workshop. p. 261-265 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  74. Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech

    Christensen, H., Aniol, M., Bell, P., Green, P., Hain, T., King, S. & Swietojanski, P., 1 Aug 2013, Proc. Interspeech 2013. ISCA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  75. Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Stylianou, Y., 1 Aug 2013, Interspeech 2013.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  76. Investigating the shortcomings of HMM synthesis

    Merritt, T. & King, S., 1 Aug 2013, Proceedings of 8th ISCA Speech Synthesis Workshop. p. 185-190 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  77. Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data

    Stan, A., Bell, P., Yamagishi, J. & King, S., 1 Aug 2013, Proc Interspeech 2013. ISCA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  78. The Edinburgh Speech Production Facility DoubleTalk Corpus

    Scobbie, J., Turk, A., Geng, C., King, S., Lickley, R. & Richmond, K., 1 Aug 2013, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  79. Mage - Reactive articulatory feature control of HMM-based parametric speech synthesis

    Astrinaki, M., Moinet, A., Yamagishi, J., Richmond, K., Ling, Z-H., King, S. & Dutoit, T., Aug 2013, 8th ISCA Workshop on Speech Synthesis: Barcelona, Spain. p. 227-231 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  80. Reactive accent interpolation through an interactive map application

    Astrinaki, M., Yamagishi, J., King, S., d'Alessandro, N. & Dutoit, T., Aug 2013, Proceedings of 8th ISCA Speech Synthesis Workshop. p. 245-246 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  81. Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from `found' data: evaluation and analysis

    Watts, O., Stan, A., Clark, R., Mamiya, Y., Giurgiu, M., Yamagishi, J. & King, S., Aug 2013, 8th ISCA Speech Synthesis Workshop: Barcelona, Spain. ISCA-INST SPEECH COMMUNICATION ASSOC, p. 101-106 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  82. Using Adaptation to Improve Speech Transcription Alignment in Noisy and Reverberant Environments

    Mamiya, Y., Stan, A., Yamagishi, J., Bell, P., Watts, O., Clark, R. & King, S., Aug 2013, Proc. 8th ISCA Speech Synthesis Workshop. p. 61-66 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  83. Using neighbourhood density and selective SNR boosting to increase the intelligibility of synthetic speech in noise

    Valentini-Botinhao, C., Wester, M., Yamagishi, J. & King, S., Aug 2013, 8th ISCA Workshop on Speech Synthesis. p. 133-138 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  84. Discriminative Tandem Features for HMM-based EEG Classification

    Ting, C-M., King, S., Salleh, S-H. & Ariff, A. K., 1 Jul 2013, Proc. 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 13). IEEE Engineering in Medicine and Biology Society, Vol. 2013. p. 3957-3960

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  85. Improving intelligibility in noise of HMM-generated speech via noise-dependent and -independent methods.

    Valentini-Botinhao, C., Godoy, E., Stylianou, Y., Sauert, B., King, S. & Yamagishi, J., May 2013, Proc. ICASSP - Vancouver, Canada.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  86. Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

    Dines, J., Liang, H., Saheer, L., Gibson, M., Byrne, W., Oura, K., Tokuda, K., Yamagishi, J., King, S., Wester, M., Hirsimäki, T., Karhila, R. & Kurimo, M., Feb 2013, In : Computer Speech and Language. 27, 2, p. 420-437 18 p.

    Research output: Contribution to journalArticle

  87. Factorized context modelling for Text-to-Speech synthesis

    Lu, H. & King, S., 2013, Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 7849-7853 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  88. Lightly supervised GMM VAD to use audiobook for speech synthesiser

    Mamiya, Y., Yamagishi, J., Watts, O., Clark, R. A. J., King, S. & Stan, A., 2013, Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 7987-7991 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  89. TUNDRA: a multilingual corpus of found data for TTS research created with light supervision

    Stan, A., Watts, O., Mamiya, Y., Giurgiu, M., Clark, R. A. J., Yamagishi, J. & King, S., 2013, INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association: Lyon, France, August 25-29, 2013. ISCA-INST SPEECH COMMUNICATION ASSOC, p. 2331-2335 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  90. Towards Personalized Synthesized Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction

    Veaux, C., Yamagishi, J. & King, S., 2013, SLPAT 2013, 4th Workshop on Speech and Language Processing for Assistive Technologies. ISCA, p. 107-111 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  91. 2012
  92. Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation

    Yang, C-Y., Brown, G., Lu, L., Yamagishi, J. & King, S., 4 Dec 2012, Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on. Institute of Electrical and Electronics Engineers (IEEE), p. 220-223 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  93. Analysis of Speaker Clustering Strategies for HMM-Based Speech Synthesis

    Dall, R., Veaux, C., Yamagishi, J. & King, S., 1 Sep 2012, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  94. Detecting Acronyms from Capital Letter Sequences in Spanish

    San-Segundo, R., Montero, J. M., Lopez-Luden, V. & King, S., 1 Sep 2012, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  95. Using Bayesian Networks to find relevant context features for HMM-based speech synthesis

    Lu, H. & King, S., 1 Sep 2012, INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012. ISCA-INST SPEECH COMMUNICATION ASSOC

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  96. Using HMM-based speech synthesis to reconstruct the voice of individuals with degenerative speech disorders

    Veaux, C., Yamagishi, J. & King, S., Sep 2012, Proceedings of INTERSPEECH 2012 13th Annual Conference of the International Speech Communication Association. p. 967-970 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  97. Direct Posterior Confidence for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Frankel, J., Vipperla, R., Evans, N. & Troncy, R., Aug 2012, In : ACM Transactions on Information Systems. 30, 3, p. - 34 p., 16.

    Research output: Contribution to journalArticle

  98. Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping

    Oura, K., Yamagishi, J., Wester, M., King, S. & Tokuda, K., 1 Jul 2012, In : Speech Communication. 54, 6, p. 703-714 12 p.

    Research output: Contribution to journalArticle

  99. Using an intelligibility measure to create noise robust cepstral coefficients for HMM-based speech synthesis

    Valentini-Botinhao, C., Yamagishi, J. & King, S., May 2012, Proc. LISTA Workshop: Edinburgh, UK.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  100. Term-dependent Confidence Normalization for Out-of-Vocabulary Spoken Term Detection

    Wang, D., Tejedor, J., King, S. & Frankel, J., Mar 2012, In : Journal of Computer Science and Technology. 27, 2, p. 358-375 17 p.

    Research output: Contribution to journalArticle

  101. A grapheme-based method for automatic alignment of speech and text data

    Stan, A., Bell, P. & King, S., 2012, Spoken Language Technology Workshop (SLT), 2012 IEEE. Institute of Electrical and Electronics Engineers (IEEE), p. 286-290 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  102. Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Maia, R., Yamagishi, J., King, S. & Zen, H., 2012, 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP): Kyoto, Japan. NEW YORK: Institute of Electrical and Electronics Engineers (IEEE), p. 3997-4000 4 p. (IEEE International Conference on Acoustics Speech and Signal Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  103. Evaluating speech intelligibility enhancement for HMM-based synthetic speech in noise

    King, S., Yamagishi, J. & Valentini-Botinhao, C., 2012, Proc. SAPA-SCALE Workshop on Statistical and Perceptual Audition (SAPA-SCALE 2012). Portland, OR, USA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  104. Impacts of machine translation and speech synthesis on speech-to-speech translation

    Hashimoto, K., Yamagishi, J., Byrne, W., King, S. & Tokuda, K., 2012, In : Speech Communication. 54, 7, p. 857-866 10 p.

    Research output: Contribution to journalArticle

  105. Simple4All proposals for the Albayzin Evaluations in Speech Synthesis

    Lorenzo-Trueba, J., Watts, O., Barra-Chicote, R., Yamagishi, J., King, S. & Montero, J. M., 2012, Proc. Iberspeech 2012.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  106. Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction

    Yamagishi, J., Veaux, C., King, S. & Renals, S., 2012, In : Acoustical Science and Technology. 33, 1, p. 1-5 5 p.

    Research output: Contribution to journalArticle

  107. 2011
  108. An introduction to statistical parametric speech synthesis

    King, S., Oct 2011, In : Sadhana-Academy proceedings in engineering sciences. 36, 5, p. 837-852 16 p.

    Research output: Contribution to journalArticle

  109. Speech Synthesis

    King, S., Sep 2011, Speech and Audio Signal Processing. Gold, B., Morgan, N. & Ellis, D. (eds.). 2nd ed. Wiley, p. 431-454 23 p.

    Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)

Previous 1 2 3 Next