Edinburgh Research Explorer

Prof Simon King

Personal Chair of Speech Processing

  1. 2019
  2. Speech Waveform Reconstruction using Convolutional Neural Networks with Noise and Periodic Inputs

    Watts, O., Valentini Botinhao, C. & King, S., 17 Apr 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brighton, United Kingdom: IEEE, p. 7045-7049 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  3. Attentive filtering networks for audio replay attack detection

    Lai, C-I., Abad, A., Richmond, K., Yamagishi, J., Dehak, N. & King, S., 1 Feb 2019, (Accepted/In press) 2019 IEEE International Conference on Acoustics, Speech and Signal Processing.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  4. 2018
  5. Exemplar-based Speech Waveform Generation

    Watts, O., Valentini Botinhao, C., Espic calderón, F. & King, S., 6 Sep 2018, Interspeech 2018. Hyderabad, India, p. 2022-2026 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  6. Impact of different speech types on listening effort

    Symantiraki, O., Cooke, M. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2267-2271

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  7. Learning interpretable control dimensions for speech synthesis by using external data

    Hodari, Z., Watts, O., Ronanki, S. & King, S., 6 Sep 2018, Interspeech 2018. Hyderabad, India, p. 32-36 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  8. Measuring the cognitive load of synthetic speech using a dual task paradigm

    Govender, A. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2843-2847 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  9. Using pupillometry to measure the cognitive load of synthetic speech

    Govender, A. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2843-2847 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  10. Exemplar-based speech waveform generation for text-to-speech

    Valentini Botinhao, C., Watts, O., Espic calderón, F. & King, S., 3 Sep 2018, (Accepted/In press) 2018 IEEE Workshop on Spoken Language Technology (SLT). 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  11. 2017
  12. Using eigenvoices and nearest-neighbours in HMM-based cross-lingual speaker adaptation with limited data

    Sarfjoo, S. S., Demiroglu, C. & King, S., Apr 2017, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25, 4, p. 839-851 13 p.

    Research output: Contribution to journalArticle

  13. Median-based generation of synthetic speech durations using a non-parametric approach

    Ronanki, S., Watts, O., King, S. & Henter, G. E., 9 Feb 2017, 2016 IEEE Spoken Language Technology Workshop (SLT). IEEE, p. 686-692 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  14. A Hierarchical Encoder-Decoder Model for Statistical Parametric Speech Synthesis

    Ronanki, S., Watts, O. & King, S., 2017, Proceedings Interspeech 2017. p. 1133-1137 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  15. Direct Modelling of Magnitude and Phase Spectra for Statistical Parametric Speech Synthesis

    Espic calderón, F., Valentini Botinhao, C. & King, S., 2017, Interspeech 2017. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  16. Nativization of foreign names in TTS for automatic reading of world news in Swahili

    Mendelson, J., Oplustil, P., Watts, O. & King, S., 2017, Proceedings Interspeech 2017. p. 2188-2192 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  17. 2016
  18. GlottDNN - A full-band glottal vocoder for statistical parametric speech synthesis

    Airaksinen, M., Bollepalli, B., Juvela, L., Wu, Z., King, S. & Alku, P., 8 Sep 2016

    Research output: Contribution to conferencePaper

  19. DNN-based Speech Synthesis for Indian Languages from ASCII text

    Ronanki, S., Gangireddy, S. R., Bollepalli, B. & King, S., Sep 2016, 9th ISCA Speech Synthesis Workshop. p. 74-79 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  20. Merlin: An Open Source Neural Network Speech Synthesis System

    Wu, Z., Watts, O. & King, S., Sep 2016, 9th ISCA Speech Synthesis Workshop (2016). p. 202-207 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  21. Waveform generation based on signal reshaping for statistical parametric speech synthesis

    Espic, F., Valentini Botinhao, C., Wu, Z. & King, S., Sep 2016, Interspeech 2016. San Francisco, United States, p. 2263-2267 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  22. Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance

    Wu, Z., De Leon, P., Demiroglu, C., Khodabakhsh, A., King, S., Ling, Z., Saito, D., Stewart, B., Toda, T., Wester, M. & Yamagishi, J., Apr 2016, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 24, 4, p. 768 - 783 17 p.

    Research output: Contribution to journalArticle

  23. From HMMs to DNNs: Where Do the Improvements Come From?

    Watts, O., Henter, G. E., Merritt, T., Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, p. 5505-5509 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  24. Investigating gated recurrent neural networks for speech synthesis

    Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, p. 1-5 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  25. Robust TTS Duration Modelling Using DNNs

    Henter, G., Ronanki, S., Watts, O., Wester, M., Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, p. 5130-5134 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  26. Testing the Consistency Assumption: Pronunciation Variant Forced Alignment in Read and Spontaneous Speech Synthesis

    Dall, R., Brognaux, S., Richmond, K., Valentini Botinhao, C., Henter, G., Hirschberg, J., Yamagishi, J. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, p. 5155-5159 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  27. Speech synthesis

    King, S., 25 Feb 2016, Oxford Bibliographies in Linguistics. Aronoff, M. (ed.). New York: Oxford University Press, 29 p.

    Research output: Chapter in Book/Report/Conference proceedingOther chapter contribution

  28. ALISA: An automatic lightly supervised speech segmentation and alignment tool

    Stan, A., Mamiya, Y., Yamagishi, J., Bell, P., Watts, O., Clark, R. A. J. & King, S., Jan 2016, In : Computer Speech and Language. 35, p. 116-133 18 p.

    Research output: Contribution to journalArticle

  29. Smooth talking: articulatory join costs for unit selection

    Richmond, K. & King, S., 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, p. 5150-5154 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  30. 2015
  31. Combining Lightly-supervised Learning and User Feedback to Construct Andimprove a Statistical Parametric Speech Synthesizer for Malay

    Chee Yong, L., Watts, O. & King, S., 15 Dec 2015, In : Research Journal of Applied Sciences, Engineering and Technology. 11, 11, p. 1227-1232 6 p.

    Research output: Contribution to journalArticle

  32. A study of speaker adaptation for DNN-based speech synthesis

    Wu, Z., Swietojanski, P., Veaux, C., Renals, S. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  33. Deep neural network context embeddings for model selection in rich-context HMM synthesis

    Merritt, T., Yamagishi, J., Wu, Z., Watts, O. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. Dresden: International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  34. A Comparison of Manual and Automatic Voice Repair for Individual with Vocal Disabilities

    Veaux, C., Yamagishi, J. & King, S., Sep 2015, SLPAT 2015, 6th Workshop on Speech and Language Processing for Assistive Technologies. Association for Computational Linguistics (ACL), p. 130-133 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  35. Reconstructing Voices within the Multiple-Average-Voice-Model framework

    Lanchantin, P., Veaux, C., Gales, M. J. F., King, S. & Yamagishi, J., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2232-2236 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  36. Sentence-level control vectors for deep neural network speech synthesis

    Watts, O., Wu, Z. & King, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2217-2221 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  37. Towards minimum perceptual error training for DNN-based speech synthesis

    Valentini-Botinhao, C., Wu, Z. & King, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. Dresden: International Speech Communication Association, p. 869-873 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  38. A Lattice-based Approach to Automatic Filled Pause Insertion

    Tomalin, M., Wester, M., Dall, R., Byrne, B. & King, S., 10 Aug 2015, Proc. of DiSS 2015, The 7th Workshop on Disfluencies in Spontaneous Speech. Edinburgh, 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  39. A reading list of recent advances in speech synthesis

    King, S., 10 Aug 2015, Proc. 18th International Congress of Phonetic Sciences (ICPhS). T. S. C. F. ICP. . (ed.). Glasgow, UK: University of Glasgow

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  40. A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification

    Poblete, V., Espic, F., King, S., Stem, R. M., Huenupan, F., Fredes, J. & Yoma, N. B., May 2015, In : Computer Speech and Language. 31, 1, p. 1-27 27 p.

    Research output: Contribution to journalArticle

  41. Deep neural networks employing multi-task learning and stacked bottleneck features for speech synthesis.

    Wu, Z., Valentini-Botinhao, C., Watts, O. & King, S., 1 Apr 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. Brisbane, Australia, p. 4460-4464 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  42. Soft context clustering for F0 modeling in HMM-based speech synthesis

    Khorram, S., Sameti, H. & King, S., 9 Jan 2015, In : EURASIP Journal on Advances in Signal Processing. 2015, 1

    Research output: Contribution to journalArticle

  43. SAS: A Speaker Verification Spoofing Database Containing Diverse Attacks

    Wu, Z., Khodabakhsh, A., Demiroglu, C., Yamagishi, J., Saito, D., Toda, T. & King, S., 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on . IEEE, p. 4440-4444 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  44. 2014
  45. Voice source modelling using deep neural networks for statistical parametric speech synthesis

    Raitio, T., Lu, H., Kane, J., Suni, A., Vainio, M., King, S. & Alku, P., 1 Sep 2014, European Signal Processing Conference. European Signal Processing Conference, EUSIPCO, p. 2290-2294 5 p. 6952838

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  46. Feature analysis for discriminative confidence estimation in spoken term detection

    Tejedor, J., Toledano, D. T., Wang, D., King, S. & Colas, J., Sep 2014, In : Computer Speech and Language. 28, 5, p. 1083–1114 32 p.

    Research output: Contribution to journalArticle

  47. Intelligibility Enhancement of Speech in Noise

    Valentini-Botinhao, C., Yamagishi, J. & King, S., Sep 2014, Proceedings of the Institute of Acoustics 2014. Vol. 36, 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  48. Multiple-average-voice-based speech synthesis

    Lanchantin, P., Gales, M. J. F., King, S. & Yamagishi, J., 4 May 2014, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 285-289 5 p. 6853603

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  49. Neural net word representations for phrase-break prediction without a part of speech tagger

    Watts, O., Gangireddy, S., Yamagishi, J., King, S., Renals, S., Stan, A. & Giurgiu, M., 4 May 2014, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 2599-2603 5 p. 6854070

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  50. Introduction to the Issue on Statistical Parametric Speech Synthesis

    Tao, J., Hirose, K., Tokuda, K., Black, A. W. & King, S., Apr 2014, In : IEEE Journal of Selected Topics in Signal Processing. 8, 2, p. 170-172 3 p.

    Research output: Contribution to journalEditorial

  51. Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Maia, R., Mar 2014, In : Computer Speech and Language. 28, 2, p. 665-686 22 p.

    Research output: Contribution to journalArticle

  52. Introduction to the Special Issue on The listening talker: Context-dependent speech production and perception

    Cooke, M., King, S., Kleijn, W. B. & Stylianou, Y., Mar 2014, In : Computer Speech and Language. 28, 2, p. 540-542

    Research output: Contribution to journalArticle

  53. The listening talker: A review of human and algorithmic context-induced modifications of speech

    Cooke, M., King, S., Garnier, M. & Aubanel, V., Mar 2014, In : Computer Speech and Language. 28, 2, p. 543-571 29 p.

    Research output: Contribution to journalLiterature review

  54. Measuring a decade of progress in Text-to-Speech

    King, S., Jan 2014, In : Loquens. 1, 1, e006

    Research output: Contribution to journalArticle

  55. Statistical parametric speech synthesis for Ibibio

    Ekpenyong, M., Urua, E-A., Watts, O., King, S. & Yamagishi, J., Jan 2014, In : Speech Communication. 56, p. 243-251 9 p.

    Research output: Contribution to journalArticle

  56. Development of a Genre-Dependent TTS System with Cross-Speaker Speaking-Style Transplantation

    Lorenzo-Trueba, J., Echeverry-Correa, J. D., Barra-Chicote, R., San-Segundo, R., Ferreiros, J., Gallardo-Antolin, A., Yamagishi, J., King, S. & Montero, J. M., 2014, 2nd International Workshop on Speech, Language and Audio in Multimedia (SLAM2014). International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

Previous 1 2 3 4 5 Next