Edinburgh Research Explorer

Prof Simon King

Personal Chair of Speech Processing

  1. 2019
  2. Measuring the contribution to cognitive load of each predicted vocoder speech parameter in DNN-based speech synthesis

    Govender, A., Valentini-Botinhao, C. & King, S., 2 Jul 2019, (Accepted/In press) 10th ISCA Speech Synthesis Workshop. 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  3. Evaluating Near End Listening Enhancement Algorithms in Realistic Environments

    Chermaz, C., Valentini Botinhao, C., Schepker, H. & King, S., 17 Jun 2019, (Accepted/In press) Proceedings Interspeech 2019. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  4. Attentive filtering networks for audio replay attack detection

    Lai, C-I., Abad, A., Richmond, K., Yamagishi, J., Dehak, N. & King, S., 17 Apr 2019, 2019 IEEE International Conference on Acoustics, Speech and Signal Processing. p. 6316-6320

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  5. Speech Waveform Reconstruction using Convolutional Neural Networks with Noise and Periodic Inputs

    Watts, O., Valentini Botinhao, C. & King, S., 17 Apr 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brighton, United Kingdom: Institute of Electrical and Electronics Engineers (IEEE), p. 7045-7049 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  6. Exemplar-based speech waveform generation for text-to-speech

    Valentini Botinhao, C., Watts, O., Espic Calderón, F. & King, S., 14 Feb 2019, 2018 IEEE Workshop on Spoken Language Technology (SLT). Institute of Electrical and Electronics Engineers (IEEE), p. 332-338 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  7. 2018
  8. Exemplar-based Speech Waveform Generation

    Watts, O., Valentini Botinhao, C., Espic calderón, F. & King, S., 6 Sep 2018, Interspeech 2018. Hyderabad, India, p. 2022-2026 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  9. Impact of different speech types on listening effort

    Symantiraki, O., Cooke, M. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2267-2271

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  10. Learning interpretable control dimensions for speech synthesis by using external data

    Hodari, Z., Watts, O., Ronanki, S. & King, S., 6 Sep 2018, Interspeech 2018. Hyderabad, India, p. 32-36 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  11. Measuring the cognitive load of synthetic speech using a dual task paradigm

    Govender, A. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2843-2847 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  12. Using pupillometry to measure the cognitive load of synthetic speech

    Govender, A. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2843-2847 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  13. 2017
  14. A Hierarchical Encoder-Decoder Model for Statistical Parametric Speech Synthesis

    Ronanki, S., Watts, O. & King, S., 24 Aug 2017, Proceedings Interspeech 2017. p. 1133-1137 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  15. Nativization of foreign names in TTS for automatic reading of world news in Swahili

    Mendelson, J., Oplustil, P., Watts, O. & King, S., 24 Aug 2017, Proceedings Interspeech 2017. p. 2188-2192 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  16. Direct Modelling of Magnitude and Phase Spectra for Statistical Parametric Speech Synthesis

    Espic calderón, F., Valentini Botinhao, C. & King, S., 20 Aug 2017, Interspeech 2017. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  17. Using eigenvoices and nearest-neighbours in HMM-based cross-lingual speaker adaptation with limited data

    Sarfjoo, S. S., Demiroglu, C. & King, S., Apr 2017, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25, 4, p. 839-851 13 p.

    Research output: Contribution to journalArticle

  18. Median-based generation of synthetic speech durations using a non-parametric approach

    Ronanki, S., Watts, O., King, S. & Henter, G. E., 9 Feb 2017, 2016 IEEE Spoken Language Technology Workshop (SLT). Institute of Electrical and Electronics Engineers (IEEE), p. 686-692 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  19. 2016
  20. DNN-based Speech Synthesis for Indian Languages from ASCII text

    Ronanki, S., Gangireddy, S. R., Bollepalli, B. & King, S., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop. p. 74-79 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  21. Merlin: An Open Source Neural Network Speech Synthesis System

    Wu, Z., Watts, O. & King, S., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop (2016). p. 202-207 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  22. Waveform generation based on signal reshaping for statistical parametric speech synthesis

    Espic, F., Valentini Botinhao, C., Wu, Z. & King, S., 12 Sep 2016, Interspeech 2016. San Francisco, United States, p. 2263-2267 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  23. GlottDNN - A full-band glottal vocoder for statistical parametric speech synthesis

    Airaksinen, M., Bollepalli, B., Juvela, L., Wu, Z., King, S. & Alku, P., 8 Sep 2016.

    Research output: Contribution to conferencePaper

  24. Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance

    Wu, Z., De Leon, P., Demiroglu, C., Khodabakhsh, A., King, S., Ling, Z., Saito, D., Stewart, B., Toda, T., Wester, M. & Yamagishi, J., Apr 2016, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 24, 4, p. 768 - 783 17 p.

    Research output: Contribution to journalArticle

  25. From HMMs to DNNs: Where Do the Improvements Come From?

    Watts, O., Henter, G. E., Merritt, T., Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5505-5509 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  26. Investigating gated recurrent neural networks for speech synthesis

    Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 1-5 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  27. Robust TTS Duration Modelling Using DNNs

    Henter, G., Ronanki, S., Watts, O., Wester, M., Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5130-5134 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  28. Testing the Consistency Assumption: Pronunciation Variant Forced Alignment in Read and Spontaneous Speech Synthesis

    Dall, R., Brognaux, S., Richmond, K., Valentini Botinhao, C., Henter, G., Hirschberg, J., Yamagishi, J. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5155-5159 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  29. Speech synthesis

    King, S., 25 Feb 2016, Oxford Bibliographies in Linguistics. Aronoff, M. (ed.). New York: Oxford University Press, 29 p.

    Research output: Chapter in Book/Report/Conference proceedingOther chapter contribution

  30. ALISA: An automatic lightly supervised speech segmentation and alignment tool

    Stan, A., Mamiya, Y., Yamagishi, J., Bell, P., Watts, O., Clark, R. A. J. & King, S., Jan 2016, In : Computer Speech and Language. 35, p. 116-133 18 p.

    Research output: Contribution to journalArticle

  31. Smooth talking: articulatory join costs for unit selection

    Richmond, K. & King, S., 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5150-5154 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  32. 2015
  33. Combining Lightly-supervised Learning and User Feedback to Construct Andimprove a Statistical Parametric Speech Synthesizer for Malay

    Chee Yong, L., Watts, O. & King, S., 15 Dec 2015, In : Research Journal of Applied Sciences, Engineering and Technology. 11, 11, p. 1227-1232 6 p.

    Research output: Contribution to journalArticle

  34. A study of speaker adaptation for DNN-based speech synthesis

    Wu, Z., Swietojanski, P., Veaux, C., Renals, S. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  35. Deep neural network context embeddings for model selection in rich-context HMM synthesis

    Merritt, T., Yamagishi, J., Wu, Z., Watts, O. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. Dresden: International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  36. A Comparison of Manual and Automatic Voice Repair for Individual with Vocal Disabilities

    Veaux, C., Yamagishi, J. & King, S., Sep 2015, SLPAT 2015, 6th Workshop on Speech and Language Processing for Assistive Technologies. Association for Computational Linguistics (ACL), p. 130-133 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  37. Reconstructing Voices within the Multiple-Average-Voice-Model framework

    Lanchantin, P., Veaux, C., Gales, M. J. F., King, S. & Yamagishi, J., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2232-2236 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  38. Sentence-level control vectors for deep neural network speech synthesis

    Watts, O., Wu, Z. & King, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2217-2221 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  39. Towards minimum perceptual error training for DNN-based speech synthesis

    Valentini-Botinhao, C., Wu, Z. & King, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. Dresden: International Speech Communication Association, p. 869-873 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  40. A Lattice-based Approach to Automatic Filled Pause Insertion

    Tomalin, M., Wester, M., Dall, R., Byrne, B. & King, S., 10 Aug 2015, Proc. of DiSS 2015, The 7th Workshop on Disfluencies in Spontaneous Speech. Edinburgh, 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  41. A reading list of recent advances in speech synthesis

    King, S., 10 Aug 2015, Proc. 18th International Congress of Phonetic Sciences (ICPhS). T. S. C. F. ICP. . (ed.). Glasgow, UK: University of Glasgow

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  42. A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification

    Poblete, V., Espic, F., King, S., Stem, R. M., Huenupan, F., Fredes, J. & Yoma, N. B., May 2015, In : Computer Speech and Language. 31, 1, p. 1-27 27 p.

    Research output: Contribution to journalArticle

  43. Deep neural networks employing multi-task learning and stacked bottleneck features for speech synthesis.

    Wu, Z., Valentini-Botinhao, C., Watts, O. & King, S., 1 Apr 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. Brisbane, Australia, p. 4460-4464 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  44. Soft context clustering for F0 modeling in HMM-based speech synthesis

    Khorram, S., Sameti, H. & King, S., 9 Jan 2015, In : EURASIP Journal on Advances in Signal Processing. 2015, 1

    Research output: Contribution to journalArticle

  45. SAS: A Speaker Verification Spoofing Database Containing Diverse Attacks

    Wu, Z., Khodabakhsh, A., Demiroglu, C., Yamagishi, J., Saito, D., Toda, T. & King, S., 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on . Institute of Electrical and Electronics Engineers (IEEE), p. 4440-4444 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  46. 2014
  47. Voice source modelling using deep neural networks for statistical parametric speech synthesis

    Raitio, T., Lu, H., Kane, J., Suni, A., Vainio, M., King, S. & Alku, P., 1 Sep 2014, European Signal Processing Conference. European Signal Processing Conference, EUSIPCO, p. 2290-2294 5 p. 6952838

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  48. Feature analysis for discriminative confidence estimation in spoken term detection

    Tejedor, J., Toledano, D. T., Wang, D., King, S. & Colas, J., Sep 2014, In : Computer Speech and Language. 28, 5, p. 1083–1114 32 p.

    Research output: Contribution to journalArticle

  49. Intelligibility Enhancement of Speech in Noise

    Valentini-Botinhao, C., Yamagishi, J. & King, S., Sep 2014, Proceedings of the Institute of Acoustics 2014. Vol. 36. 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  50. Multiple-average-voice-based speech synthesis

    Lanchantin, P., Gales, M. J. F., King, S. & Yamagishi, J., 4 May 2014, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 285-289 5 p. 6853603

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  51. Neural net word representations for phrase-break prediction without a part of speech tagger

    Watts, O., Gangireddy, S., Yamagishi, J., King, S., Renals, S., Stan, A. & Giurgiu, M., 4 May 2014, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 2599-2603 5 p. 6854070

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  52. Introduction to the Issue on Statistical Parametric Speech Synthesis

    Tao, J., Hirose, K., Tokuda, K., Black, A. W. & King, S., Apr 2014, In : IEEE Journal of Selected Topics in Signal Processing. 8, 2, p. 170-172 3 p.

    Research output: Contribution to journalEditorial

  53. Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Maia, R., Mar 2014, In : Computer Speech and Language. 28, 2, p. 665-686 22 p.

    Research output: Contribution to journalArticle

  54. Introduction to the Special Issue on The listening talker: Context-dependent speech production and perception

    Cooke, M., King, S., Kleijn, W. B. & Stylianou, Y., Mar 2014, In : Computer Speech and Language. 28, 2, p. 540-542

    Research output: Contribution to journalArticle

  55. The listening talker: A review of human and algorithmic context-induced modifications of speech

    Cooke, M., King, S., Garnier, M. & Aubanel, V., Mar 2014, In : Computer Speech and Language. 28, 2, p. 543-571 29 p.

    Research output: Contribution to journalLiterature review

  56. Measuring a decade of progress in Text-to-Speech

    King, S., Jan 2014, In : Loquens. 1, 1, e006.

    Research output: Contribution to journalArticle

Previous 1 2 3 4 5 Next