Edinburgh Research Explorer

Prof Simon King

Personal Chair of Speech Processing

  1. GlottDNN - A full-band glottal vocoder for statistical parametric speech synthesis

    Airaksinen, M., Bollepalli, B., Juvela, L., Wu, Z., King, S. & Alku, P., 8 Sep 2016.

    Research output: Contribution to conferencePaper

  2. Vocal attractiveness of statistical speech synthesisers

    Andraszewicz, S., Yamagishi, J. & King, S., 1 May 2011, Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. p. 5368-5371 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  3. Mage - Reactive articulatory feature control of HMM-based parametric speech synthesis

    Astrinaki, M., Moinet, A., Yamagishi, J., Richmond, K., Ling, Z-H., King, S. & Dutoit, T., Aug 2013, 8th ISCA Workshop on Speech Synthesis: Barcelona, Spain. p. 227-231 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  4. Mage-HMM-based speech synthesis reactively controlled by the articulators

    Astrinaki, M., Moinet, A., Yamagishi, J., Richmond, K., Ling, Z-H., King, S. & Dutoit, T., Sep 2013, 8th ISCA Speech Synthesis Workshop. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  5. Reactive accent interpolation through an interactive map application

    Astrinaki, M., Yamagishi, J., King, S., d'Alessandro, N. & Dutoit, T., Aug 2013, Proceedings of 8th ISCA Speech Synthesis Workshop. p. 245-246 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  6. Single Speaker Segmentation and Inventory Selection Using Dynamic Time Warping Self Organization and Joint Multigram Mapping

    Aylett, M. & King, S., 2008, SSW06. p. 258-263

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  7. Speech synthesis without a phone inventory

    Aylett, M., King, S. & Yamagishi, J., 2009, Interspeech. p. 2087-2090 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  8. An Accent-Independent Lexicon for Automatic Speech Recognition

    Bael, C. V. & King, S., 2003, Proc. ICPhS. p. 1165-1168 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  9. Generacion de una voz sintetica en Castellano basada en HSMM para la Evaluacion Albayzin 2008: conversion texto a voz

    Barra-Chicote, R., Yamagishi, J., Montero, J. M., King, S., Lutfi, S. & Macias-Guarasa, J., Nov 2008, V Jornadas en Tecnologia del Habla. p. 115-118 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  10. Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech

    Barra-Chicote, R., Yamagishi, J., King, S., Montero, J. M. & Macias-Guarasa, J., 1 May 2010, In : Speech Communication. 52, 5, p. 394-404 11 p.

    Research output: Contribution to journalArticle

  11. Diagonal priors for full covariance speech recognition

    Bell, P. & King, S., 2009, IEEE Workshop on automatic speech recognition and understanding. Institute of Electrical and Electronics Engineers (IEEE), p. 113-117 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  12. A Shrinkage Estimator for Speech Recognition with Full Covariance HMMs

    Bell, P. & King, S., Sep 2008, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  13. Sparse Gaussian Graphical Models for Speech Recognition

    Bell, P. & King, S., 1 Aug 2007, Interspeech 2007: 8th Annual Conference of the International Speech Communication Association. p. 2113-2116 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  14. Covariance Updates for Discriminative Training by Constrained Line Search

    Bell, P. & King, S., Sep 2008, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  15. An articulatory feature-based tandem approach and factored observation modeling

    Cetin, O., Kantor, A., King, S., Bartels, C., Magimai-Doss, M., Frankel, J. & Livescu, K., 1 Apr 2007, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2007 (ICASSP 2007). Vol. 4. p. 645-648 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  16. Combining Lightly-supervised Learning and User Feedback to Construct Andimprove a Statistical Parametric Speech Synthesizer for Malay

    Chee Yong, L., Watts, O. & King, S., 15 Dec 2015, In : Research Journal of Applied Sciences, Engineering and Technology. 11, 11, p. 1227-1232 6 p.

    Research output: Contribution to journalArticle

  17. Evaluating Near End Listening Enhancement Algorithms in Realistic Environments

    Chermaz, C., Valentini Botinhao, C., Schepker, H. & King, S., 17 Jun 2019, (Accepted/In press) Proceedings Interspeech 2019. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  18. Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech

    Christensen, H., Aniol, M., Bell, P., Green, P., Hain, T., King, S. & Swietojanski, P., 1 Aug 2013, Proc. Interspeech 2013. ISCA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  19. Multisyn voices from ARCTIC data for the Blizzard challenge

    Clark, R. A. J., Richmond, K. & King, S., 1 Sep 2005, Interspeech 2005 - Eurospeech: 9th European Conference on Speech Communication and Technology . International Speech Communication Association, p. 101-104

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  20. Multisyn: Open-domain unit selection for the Festival speech synthesis system

    Clark, R. A. J., Richmond, K. & King, S., 2007, In : Speech Communication. 49, 4, p. 317-330 14 p.

    Research output: Contribution to journalArticle

  21. Joint Prosodic and Segmental Unit Selection Speech Synthesis

    Clark, R. A. J. & King, S., 1 Sep 2006, Interspeech 2006- ICSLP: 9th International Conference on Spoken Language Processing. International Speech Communication Association, 1262

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  22. Statistical Analysis of the Blizzard Challenge 2007 Listening Test Results

    Clark, R. A. J., Podsiadlo, M., Fraser, M., Mayo, C. & King, S., 1 Aug 2007, Proc. Blizzard 2007 (in Proc. Sixth ISCA Workshop on Speech Synthesis). International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  23. Multisyn Voices for the Blizzard Challenge 2006

    Clark, R., Richmond, K., Strom, V. & King, S., 1 Sep 2006, Proc. Blizzard Challenge Workshop (Interspeech Satellite).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  24. Festival 2 -- build your own general purpose unit selection speech synthesiser

    Clark, R. A. J., Richmond, K. & King, S., 2004, 5th ISCA workshop on speech synthesis. International Speech Communication Association, p. 173-178 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  25. Introduction to the Special Issue on The listening talker: Context-dependent speech production and perception

    Cooke, M., King, S., Kleijn, W. B. & Stylianou, Y., Mar 2014, In : Computer Speech and Language. 28, 2, p. 540-542

    Research output: Contribution to journalArticle

  26. The listening talker: A review of human and algorithmic context-induced modifications of speech

    Cooke, M., King, S., Garnier, M. & Aubanel, V., Mar 2014, In : Computer Speech and Language. 28, 2, p. 543-571 29 p.

    Research output: Contribution to journalLiterature review

  27. Rating Naturalness in Speech Synthesis: The Effect of Style and Expectation

    Dall, R., Yamagishi, J. & King, S., 2014, Speech Prosody 2014.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  28. Analysis of Speaker Clustering Strategies for HMM-Based Speech Synthesis

    Dall, R., Veaux, C., Yamagishi, J. & King, S., 1 Sep 2012, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  29. Testing the Consistency Assumption: Pronunciation Variant Forced Alignment in Read and Spontaneous Speech Synthesis

    Dall, R., Brognaux, S., Richmond, K., Valentini Botinhao, C., Henter, G., Hirschberg, J., Yamagishi, J. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5155-5159 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  30. Investigating Automatic & Human Filled Pause Insertion for Speech Synthesis

    Dall, R., Tomalin, M., Wester, M., Byrne, W. & King, S., 2014, Proc. Interspeech. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  31. Measuring the Gap Between HMM-Based ASR and TTS

    Dines, J., Yamagishi, J. & King, S., 1 Dec 2010, In : IEEE Journal of Selected Topics in Signal Processing. 4, 6, p. 1046-1058 13 p.

    Research output: Contribution to journalArticle

  32. Measuring the gap between HMM-based ASR and TTS

    Dines, J., Yamagishi, J. & King, S., 1 Sep 2009, Interspeech 2009: 10th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1391-1394 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  33. Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

    Dines, J., Liang, H., Saheer, L., Gibson, M., Byrne, W., Oura, K., Tokuda, K., Yamagishi, J., King, S., Wester, M., Hirsimäki, T., Karhila, R. & Kurimo, M., Feb 2013, In : Computer Speech and Language. 27, 2, p. 420-437 18 p.

    Research output: Contribution to journalArticle

  34. Statistical parametric speech synthesis for Ibibio

    Ekpenyong, M., Urua, E-A., Watts, O., King, S. & Yamagishi, J., Jan 2014, In : Speech Communication. 56, p. 243-251 9 p.

    Research output: Contribution to journalArticle

  35. Waveform generation based on signal reshaping for statistical parametric speech synthesis

    Espic, F., Valentini Botinhao, C., Wu, Z. & King, S., 12 Sep 2016, Interspeech 2016. San Francisco, United States, p. 2263-2267 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  36. Direct Modelling of Magnitude and Phase Spectra for Statistical Parametric Speech Synthesis

    Espic calderón, F., Valentini Botinhao, C. & King, S., 20 Aug 2017, Interspeech 2017. 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  37. An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces

    Frankel, J., Richmond, K., King, S. & Taylor, P., Oct 2000, Sixth International Conference on Spoken Language Processing (ICSLP 2000). International Speech Communication Association, p. 254-257 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  38. ASR - Articulatory Speech Recognition

    Frankel, J. & King, S., 1 Sep 2001, Eurospeech 2001: 7th European Conference on Speech Communication and Technology . International Speech Communication Association, p. 599-602 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  39. A Hybrid ANN/DBN Approach to Articulatory Feature Recognition

    Frankel, J. & King, S., 1 Sep 2005, Interspeech 2005 - Eurospeech: 9th European Conference on Speech Communication and Technology . International Speech Communication Association, p. 3045-3048 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  40. Articulatory feature recognition using dynamic Bayesian networks

    Frankel, J., Wester, M. & King, S., 1 Sep 2004, Interspeech 2004 - ICSLP: 8th International Conference on Spoken Language Processing. International Speech Communication Association, p. 1477-1480 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  41. Speech Recognition using Linear Dynamic Models

    Frankel, J. & King, S., 1 Jan 2007, In : IEEE Transactions on Audio, Speech and Language Processing. 15, 1, p. 246-256 11 p.

    Research output: Contribution to journalArticle

  42. Articulatory feature recognition using dynamic Bayesian networks

    Frankel, J., Wester, M. & King, S., 1 Oct 2007, In : Computer Speech and Language. 21, 4, p. 620-640 21 p.

    Research output: Contribution to journalArticle

  43. Speech recognition in the articulatory domain: investigating an alternative to acoustic HMMs

    Frankel, J. & King, S., 1 Apr 2001, Proc. Workshop on Innovations in Speech Processing.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  44. Articulatory Feature Classifiers Trained on 2000 hours of Telephone Speech

    Frankel, J., Magimai-Doss, M., King, S., Livescu, K. & Çetin, Ã., 1 Aug 2007, Interspeech 2007: 8th Annual Conference of the International Speech Communication Association. p. 2485-2488 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  45. Growing bottleneck features for tandem ASR

    Frankel, J., Wang, D. & King, S., 2008, Proc. Interspeech. p. 1549

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  46. Observation Process Adaptation for Linear Dynamic Models

    Frankel, J. & King, S., 1 Sep 2006, In : Speech Communication. 48, 9, p. 1192-1199 8 p.

    Research output: Contribution to journalArticle

  47. Factoring Gaussian Precision Matrices for Linear Dynamic Models

    Frankel, J. & King, S., 1 Dec 2007, In : Pattern Recognition Letters. 28, 16, p. 2264-2272 9 p.

    Research output: Contribution to journalArticle

  48. The Blizzard Challenge 2007

    Fraser, M. & King, S., 1 Aug 2007, The Blizzard Challenge 2007: Workshop : in Sixth ISCA Workshop on Speech Synthesis. International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  49. Recording speech articulation in dialogue: Evaluating a synchronized double electromagnetic articulography setup

    Geng, C., Turk, A., Scobbie, J. M., Macmartin, C., Hoole, P., Richmond, K., Wrench, A., Pouplier, M., Bard, E., Campbell, Z., Dickie, C., Dubourg, E., Hardcastle, W., Kainada, E., King, S., Lickley, R., Nakai, S., Renals, S., White, K. & Wiegand, R., Nov 2013, In : Journal of Phonetics. 41, 6, p. 421-431 11 p.

    Research output: Contribution to journalArticle

  50. Transforming F0 Contours

    Gillett, B. & King, S., 1 Sep 2003, Eurospeech 2003 - Interspeech 2003: 8th European Conference on Speech Communication and Technology. International Speech Communication Association, p. 101-104 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  51. Transforming Voice Quality

    Gillett, B. & King, S., 1 Sep 2003, EUROSPEECH 2003 - INTERSPEECH 2003 : 8th European Conference on Speech Communication and Technology. International Speech Communication Association, p. 1713-1716 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  52. Bayesian networks for phone duration prediction

    Goubanova, O. & King, S., Apr 2008, In : Speech Communication. 50, 4, p. 301-311 11 p.

    Research output: Contribution to journalArticle

  53. Predicting Consonant Duration with Bayesian Belief Networks

    Goubanova, O. & King, S., 2005, Interspeech 2005 - Eurospeech: 9th European Conference on Speech Communication and Technology . International Speech Communication Association, p. 1941-1944 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  54. Measuring the cognitive load of synthetic speech using a dual task paradigm

    Govender, A. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2843-2847 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  55. Measuring the contribution to cognitive load of each predicted vocoder speech parameter in DNN-based speech synthesis

    Govender, A., Valentini-Botinhao, C. & King, S., 2 Jul 2019, (Accepted/In press) 10th ISCA Speech Synthesis Workshop. 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  56. Using pupillometry to measure the cognitive load of synthetic speech

    Govender, A. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2843-2847 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  57. Inductive String Template-Based Learning of Spoken Language

    Gutkin, A. & King, S., 1 May 2005, Proc. 5th International Workshop on Pattern Recognition in Information Systems (PRIS-2005), : In conjunction with the 7th International Conference on Enterprise Information Systems (ICEIS-2005). Gamboa, H. & Fred, A. (eds.). Miami, USA: INSTICC Press, p. 43-51 9 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  58. Detection of Symbolic Gestural Events in Articulatory Data for Use in Structural Representations of Continuous Speech

    Gutkin, A. & King, S., 1 Mar 2005, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005 (ICASSP-05). Philadelphia, PA, USA: IEEE Signal Processing Society Press, Vol. I. p. 885-888 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  59. Structural Representation of Speech for Phonetic Classification

    Gutkin, A. & King, S., 1 Aug 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004 (ICPR 2004). Cambridge, UK: Institute of Electrical and Electronics Engineers (IEEE), Vol. 3. p. 438-441 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  60. Phone classification in pseudo-Euclidean Vector Spaces

    Gutkin, A. & King, S., 1 Oct 2004, Interspeech 2004 - ICSLP: 8th International Conference on Spoken Language Processing. International Speech Communication Association, p. 1453-1457 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  61. Impacts of machine translation and speech synthesis on speech-to-speech translation

    Hashimoto, K., Yamagishi, J., Byrne, W., King, S. & Tokuda, K., 2012, In : Speech Communication. 54, 7, p. 857-866 10 p.

    Research output: Contribution to journalArticle

  62. An analysis of machine translation and speech synthesis in speech-to-speech translation system

    Hashimoto, K., Yamagishi, J., Byrne, W., King, S. & Tokuda, K., May 2011, Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. p. 5108-5111 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  63. Measuring the Perceptual Effects of Modelling Assumptions in Speech Synthesis Using Stimuli Constructed from Repeated Natural Speech

    Henter, G. E., Merritt, T., Shannon, M., Mayo, C. & King, S., 2014, INTERSPEECH 2014 15th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1504-1508 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  64. Robust TTS Duration Modelling Using DNNs

    Henter, G., Ronanki, S., Watts, O., Wester, M., Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5130-5134 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  65. Learning interpretable control dimensions for speech synthesis by using external data

    Hodari, Z., Watts, O., Ronanki, S. & King, S., 6 Sep 2018, Interspeech 2018. Hyderabad, India, p. 32-36 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  66. Discriminative Methods for Improving Named Entity Extraction on Speech Data

    Horlock, J. & King, S., 1 Sep 2003, Eurospeech 2003 - Interspeech 2003: 8th European Conference on Speech Communication and Technology. International Speech Communication Association, p. 2765-2768 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  67. Named Entity Extraction from Word Lattices

    Horlock, J. & King, S., 1 Sep 2003, Eurospeech 2003 - Interspeech 2003: 8th European Conference on Speech Communication and Technology. International Speech Communication Association, p. 1265-1268 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  68. Prosodic Information in a Speech Recognition System intended for Dialogue

    Isard, S., King, S., Taylor, P. & Kowtko, J., 1995, IEEE Workshop in speech recognition. Institute of Electrical and Electronics Engineers (IEEE)

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  69. Unsupervised lexical clustering of speech segments using fixed dimensional acoustic embeddings

    Kamper, H., Jansen, A., King, S. & Goldwater, S., 2014, Proceedings of the IEEE Spoken Language Technology Workshop. Institute of Electrical and Electronics Engineers (IEEE), 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  70. The Blizzard Challenge 2008

    Karaiskos, V., King, S., Clark, R. A. J. & Mayo, C., 2008, Proc. Blizzard Challenge Workshop.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  71. Soft context clustering for F0 modeling in HMM-based speech synthesis

    Khorram, S., Sameti, H. & King, S., 9 Jan 2015, In : EURASIP Journal on Advances in Signal Processing. 2015, 1

    Research output: Contribution to journalArticle

  72. Measuring a decade of progress in Text-to-Speech

    King, S., Jan 2014, In : Loquens. 1, 1, e006.

    Research output: Contribution to journalArticle

  73. Inventory design for Verbmobil Teilprojekt 4.4

    King, S., 1 Oct 1996.

    Research output: Working paper

  74. Speech recognition via phonetically-featured syllables

    King, S., Taylor, P., Frankel, J. & Richmond, K., 2000, PHONUS 5 : Proceedings of the Workshop on "Phonetics and Phonology in ASR. Parameters and Features, and their Implications". Barry, W. J. & Koreman, J. (eds.). Saarbruken: Institute of Phonetics, Vol. 5. p. 15-34 20 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  75. SVitchboard 1: Small Vocabulary Tasks from Switchboard 1

    King, S., Bartels, C. & Bilmes, J., 2005, Interspeech 2005 - Eurospeech: 9th European Conference on Speech Communication and Technology . International Speech Communication Association, p. 3385-3388 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  76. Unsupervised adaptation for HMM-based speech synthesis

    King, S., Tokuda, K., Zen, H. & Yamagishi, J., Sep 2008, Proc. Interspeech. ISCA, p. 1869-1872 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  77. A reading list of recent advances in speech synthesis

    King, S., 10 Aug 2015, Proc. 18th International Congress of Phonetic Sciences (ICPhS). T. S. C. F. ICP. . (ed.). Glasgow, UK: University of Glasgow

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  78. Speech production knowledge in automatic speech recognition

    King, S., Frankel, J., Livescu, K., McDermott, E., Richmond, K. & Wester, M., 1 Feb 2007, In : Journal of the acoustical society of america. 121, 2, p. 723-742 20 p.

    Research output: Contribution to journalArticle

  79. Dynamical System Modelling of Articulator Movement

    King, S. & Wrench, A., 1 Aug 1999, ICPhS 99: Proceedings of the XIVth International Congress of Phonetic Sciences. San Francisco: International Congress of Phonetic Sciences, p. 2259-2262 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  80. Speech Synthesis

    King, S., Ellis, D. & Morgan, N., 2011, Speech and Audio Signal Processing. Gold, B., Morgan, N. & Ellis, D. (eds.). 2nd ed. Wiley, p. 431-454 24 p.

    Research output: Chapter in Book/Report/Conference proceedingChapter

  81. Evaluating speech intelligibility enhancement for HMM-based synthetic speech in noise

    King, S., Yamagishi, J. & Valentini-Botinhao, C., 2012, Proc. SAPA-SCALE Workshop on Statistical and Perceptual Audition (SAPA-SCALE 2012). Portland, OR, USA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  82. Users Manual for Verbmobil Teilprojekt 4.4

    King, S., 1 Oct 1996, Rheinische Friedrich-Wilhelms-Universität Bonn.

    Research output: Book/ReportBook

  83. A tutorial on HMM speech synthesis (Invited paper)

    King, S., 2010, Sadhana -- Academy Proceedings in Engineering Sciences, Indian Institute of Sciences.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  84. Dependence and independence in automatic speech recognition and synthesis

    King, S., 2003, In : Journal of Phonetics. 31, 3-4, p. 407-411 5 p.

    Research output: Contribution to journalArticle

  85. Speech Recognition via Phonetically Featured Syllables

    King, S., Stephenson, T., Isard, S., Taylor, P. & Strachan, A., 1 Dec 1998, ICSLP `98: 5th International Conference on Spoken Language Processing. International Speech Communication Association, p. 1031-1034 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  86. Speech synthesis

    King, S., 25 Feb 2016, Oxford Bibliographies in Linguistics. Aronoff, M. (ed.). New York: Oxford University Press, 29 p.

    Research output: Chapter in Book/Report/Conference proceedingOther chapter contribution

  87. The Blizzard Challenge 2009

    King, S. & Karaiskos, V., 2009.

    Research output: Contribution to conferencePaper

  88. An introduction to statistical parametric speech synthesis

    King, S., Oct 2011, In : Sadhana-Academy proceedings in engineering sciences. 36, 5, p. 837-852 16 p.

    Research output: Contribution to journalArticle

  89. Final report for Verbmobil Teilprojekt 4.4

    King, S., 1 Jan 1997.

    Research output: Working paper

  90. Detection of Phonological Features in Continuous Speech using Neural Networks

    King, S. & Taylor, P., 2000, In : Computer Speech and Language. 14, 4, p. 333-353 21 p.

    Research output: Contribution to journalArticle

  91. Handling Variation in Speech and Language Processing

    King, S., Mar 2006, Encyclopedia of Language and Linguistics. Brown, K. (ed.). 2nd ed. Elsevier, p. 199-203 5 p.

    Research output: Chapter in Book/Report/Conference proceedingOther chapter contribution

  92. Speech synthesis using non-uniform units in the Verbmobil project

    King, S., Portele, T. & Höfer, F., 1 Sep 1997, Proc. Eurospeech 97: 5th European Conference on Speech Communication and Technology . International Speech Communication Association, p. 569-572 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  93. Speech Synthesis

    King, S., Sep 2011, Speech and Audio Signal Processing. Gold, B., Morgan, N. & Ellis, D. (eds.). 2nd ed. Wiley, p. 431-454 23 p.

    Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)

  94. Using Information Above the Word Level for Automatic Speech Recognition

    King, S., 1998, University of Edinburgh.

    Research output: ThesisDoctoral Thesis

  95. Speech Technologies: Language Variation

    King, S., Mar 2006, Encyclopedia of Language and Linguistics. Brown, K. (ed.). 2nd ed. Elsevier, p. 56-61 6 p.

    Research output: Chapter in Book/Report/Conference proceedingOther chapter contribution

  96. Personalising speech-to-speech translation in the EMIME project

    Kurimo, M., Byrne, W., Dines, J., Garner, P. N., Gibson, M., Guan, Y., Hirsimaki, T., Karhila, R., King, S., Liang, H., Oura, K., Saheer, L., Shannon, M., Shiota, S., Tian, J., Tokuda, K., Wester, M., Wu, Y-J. & Yamagishi, J., Jul 2010, Proceedings of the ACL 2010 System Demonstrations. p. 48-53 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  97. Attentive filtering networks for audio replay attack detection

    Lai, C-I., Abad, A., Richmond, K., Yamagishi, J., Dehak, N. & King, S., 17 Apr 2019, 2019 IEEE International Conference on Acoustics, Speech and Signal Processing. p. 6316-6320

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  98. Cross-Lingual Automatic Speech Recognition Using Tandem Features

    Lal, P. & King, S., Dec 2013, In : IEEE Transactions on Audio, Speech and Language Processing. 21, 12, p. 2506-2515 10 p.

    Research output: Contribution to journalArticle

  99. Multiple-average-voice-based speech synthesis

    Lanchantin, P., Gales, M. J. F., King, S. & Yamagishi, J., 4 May 2014, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 285-289 5 p. 6853603

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  100. Reconstructing Voices within the Multiple-Average-Voice-Model framework

    Lanchantin, P., Veaux, C., Gales, M. J. F., King, S. & Yamagishi, J., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2232-2236 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  101. Formant-controlled HMM-based speech synthesis

    Lei, M., Yamagishi, J., Richmond, K., Ling, Z-H., King, S. & Dai, L-R., 1 Aug 2011, Interspeech 2011: 12th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2777-2780 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  102. Manual transcription of conversational speech at the articulatory feature level

    Livescu, K., Bezman, A., Borges, N., Yung, L., C‡etin, O., Frankel, J., King, S., Magimai-Doss, M., Chi, X. & Lavoie, L., 1 Apr 2007, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2007 (ICASSP 2007). Vol. 4. p. 953-956 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  103. Articulatory feature-based methods for acoustic and audio-visual speech recognition: Summary from the 2006 JHU Summer Workshop

    Livescu, K., C‡etin, O., Hasegawa-Johnson, M., King, S., Bartels, C., Borges, N., Kantor, A., Lal, P., Yung, L., Bezman Dawson-Haggerty, S., Woods, B., Frankel, J., Magimai-Doss, M. & Saenko, K., 1 Apr 2007, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2007 (ICASSP 2007). Vol. 4. p. 621-621 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  104. Simple4All proposals for the Albayzin Evaluations in Speech Synthesis

    Lorenzo-Trueba, J., Watts, O., Barra-Chicote, R., Yamagishi, J., King, S. & Montero, J. M., 2012, Proc. Iberspeech 2012.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  105. Development of a Genre-Dependent TTS System with Cross-Speaker Speaking-Style Transplantation

    Lorenzo-Trueba, J., Echeverry-Correa, J. D., Barra-Chicote, R., San-Segundo, R., Ferreiros, J., Gallardo-Antolin, A., Yamagishi, J., King, S. & Montero, J. M., 2014, 2nd International Workshop on Speech, Language and Audio in Multimedia (SLAM2014). International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  106. Combining a Vector Space Representation of Linguistic Context with a Deep Neural Network for Text-To-Speech Synthesis

    Lu, H., King, S. & Watts, O., 1 Aug 2013, 8th ISCA Speech Synthesis Workshop. p. 261-265 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  107. Using Bayesian Networks to find relevant context features for HMM-based speech synthesis

    Lu, H. & King, S., 1 Sep 2012, INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012. ISCA-INST SPEECH COMMUNICATION ASSOC

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  108. Factorized context modelling for Text-to-Speech synthesis

    Lu, H. & King, S., 2013, Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 7849-7853 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  109. Using Adaptation to Improve Speech Transcription Alignment in Noisy and Reverberant Environments

    Mamiya, Y., Stan, A., Yamagishi, J., Bell, P., Watts, O., Clark, R. & King, S., Aug 2013, Proc. 8th ISCA Speech Synthesis Workshop. p. 61-66 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  110. Lightly supervised GMM VAD to use audiobook for speech synthesiser

    Mamiya, Y., Yamagishi, J., Watts, O., Clark, R. A. J., King, S. & Stan, A., 2013, Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 7987-7991 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  111. Multidimensional Scaling of Listener Responses to Synthetic Speech

    Mayo, C., Clark, R. A. J. & King, S., 1 Sep 2005, Interspeech 2005 - Eurospeech: 9th European Conference on Speech Communication and Technology . International Speech Communication Association, p. 1725-1728 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  112. Listeners' weighting of acoustic cues to synthetic speech naturalness: A multidimensional scaling analysis

    Mayo, C., Clark, R. A. J. & King, S., 1 Mar 2011, In : Speech Communication. 53, 3, p. 311-326 15 p.

    Research output: Contribution to journalArticle

  113. Nativization of foreign names in TTS for automatic reading of world news in Swahili

    Mendelson, J., Oplustil, P., Watts, O. & King, S., 24 Aug 2017, Proceedings Interspeech 2017. p. 2188-2192 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  114. Deep neural network context embeddings for model selection in rich-context HMM synthesis

    Merritt, T., Yamagishi, J., Wu, Z., Watts, O. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. Dresden: International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  115. Investigating the shortcomings of HMM synthesis

    Merritt, T. & King, S., 1 Aug 2013, Proceedings of 8th ISCA Speech Synthesis Workshop. p. 185-190 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  116. Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

    Oura, K., Tokuda, K., Yamagishi, J., Wester, M. & King, S., 2010, Proceedings of ICASSP. Vol. 1. p. 4954-4957 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  117. Unsupervised English-to-Japanese speaker adaptation for HMM-based speech synthesis.

    Oura, K., Yamagishi, J., King, S., Wester, M. & Tokuda, K., Dec 2009, Proceedings of the Acoustical Society of Japan : Autmn meeting. Vol. I 3-P-18. p. 401-402 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  118. Unsupervised speaker adaptation for speech-to-speech translation system.

    Oura, K., Yamagishi, J., Wester, M., King, S. & Tokuda, K., Dec 2009, Proceedings SLP 2009. 356 ed. Tokyo, Vol. 109. p. 13-18 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  119. Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping

    Oura, K., Yamagishi, J., Wester, M., King, S. & Tokuda, K., 1 Jul 2012, In : Speech Communication. 54, 6, p. 703-714 12 p.

    Research output: Contribution to journalArticle

  120. A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification

    Poblete, V., Espic, F., King, S., Stem, R. M., Huenupan, F., Fredes, J. & Yoma, N. B., May 2015, In : Computer Speech and Language. 31, 1, p. 1-27 27 p.

    Research output: Contribution to journalArticle

  121. Voice source modelling using deep neural networks for statistical parametric speech synthesis

    Raitio, T., Lu, H., Kane, J., Suni, A., Vainio, M., King, S. & Alku, P., 1 Sep 2014, European Signal Processing Conference. European Signal Processing Conference, EUSIPCO, p. 2290-2294 5 p. 6952838

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  122. Automatic speech recognition

    Renals, S. & King, S., Feb 2010, Handbook of Phonetic Sciences. Hardcastle, W. J., Laver, J. & Gibbon, F. E. (eds.). 2nd ed. Wiley-Blackwell, Vol. 1.

    Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)

  123. Modelling the Uncertainty in Recovering Articulation from Acoustics

    Richmond, K., King, S. & Taylor, P., Apr 2003, In : Computer Speech and Language. 17, 2-3, p. 153-172 20 p.

    Research output: Contribution to journalArticle

  124. Smooth talking: articulatory join costs for unit selection

    Richmond, K. & King, S., 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5150-5154 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  125. Announcing the Electromagnetic Articulography (Day 1) Subset of the mngu0 Articulatory Corpus

    Richmond, K., Hoole, P. & King, S., 1 Aug 2011, Interspeech 2011: 12th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1505-1508 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  126. DNN-based Speech Synthesis for Indian Languages from ASCII text

    Ronanki, S., Gangireddy, S. R., Bollepalli, B. & King, S., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop. p. 74-79 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  127. Median-based generation of synthetic speech durations using a non-parametric approach

    Ronanki, S., Watts, O., King, S. & Henter, G. E., 9 Feb 2017, 2016 IEEE Spoken Language Technology Workshop (SLT). Institute of Electrical and Electronics Engineers (IEEE), p. 686-692 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  128. A Hierarchical Encoder-Decoder Model for Statistical Parametric Speech Synthesis

    Ronanki, S., Watts, O. & King, S., 24 Aug 2017, Proceedings Interspeech 2017. p. 1133-1137 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  129. Framewise phone classification using support vector machines

    Salomon, J., King, S. & Osborne, M., 2002, ICSLP- 2002: Proceedings of the 7th International Conference on Spoken Language Processing. International Speech Communication Association, p. 2645-2648 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  130. Detecting Acronyms from Capital Letter Sequences in Spanish

    San-Segundo, R., Montero, J. M., Lopez-Luden, V. & King, S., 1 Sep 2012, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  131. Using eigenvoices and nearest-neighbours in HMM-based cross-lingual speaker adaptation with limited data

    Sarfjoo, S. S., Demiroglu, C. & King, S., Apr 2017, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25, 4, p. 839-851 13 p.

    Research output: Contribution to journalArticle

  132. The Edinburgh Speech Production Facility DoubleTalk Corpus

    Scobbie, J., Turk, A., Geng, C., King, S., Lickley, R. & Richmond, K., 1 Aug 2013, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  133. Estimating detailed spectral envelopes using articulatory clustering

    Shiga, Y. & King, S., 1 Oct 2004, Interspeech 2004 - ICSLP: 8th International Conference on Spoken Language Processing. International Speech Communication Association, p. 2485-2488 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  134. Estimation of voice source and vocal tract characteristics based on multi-frame analysis

    Shiga, Y. & King, S., 1 Sep 2003, Eurospeech 2003 - Interspeech 2003: 8th European Conference on Speech Communication and Technology. International Speech Communication Association, Vol. 3. p. 1749-1752 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  135. Source-Filter Separation for Articulation-to-Speech Synthesis

    Shiga, Y. & King, S., 1 Oct 2004, Interspeech 2004 - ICSLP: 8th International Conference on Spoken Language Processing. International Speech Communication Association, p. 1913-1916 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  136. Estimating the Spectral Envelope of Voiced Speech Using Multi-frame Analysis

    Shiga, Y. & King, S., 1 Sep 2003, Eurospeech 2003 - Interspeech 2003: 8th European Conference on Speech Communication and Technology. International Speech Communication Association, Vol. 3. p. 1737-1740 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  137. Accurate spectral envelope estimation for articulation-to-speech synthesis

    Shiga, Y. & King, S., 1 Jun 2004, Proc. 5th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 19-24 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  138. Where are the challenges in speaker diarization?

    Sinclair, M. & King, S., 21 Oct 2013, IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013. Institute of Electrical and Electronics Engineers (IEEE), p. 7741-7745 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  139. The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate

    Stan, A., Yamagishi, J., King, S. & Aylett, M., Mar 2011, In : Speech Communication. 53, 3, p. 442-450 9 p.

    Research output: Contribution to journalArticle

  140. TUNDRA: a multilingual corpus of found data for TTS research created with light supervision

    Stan, A., Watts, O., Mamiya, Y., Giurgiu, M., Clark, R. A. J., Yamagishi, J. & King, S., 2013, INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association: Lyon, France, August 25-29, 2013. ISCA-INST SPEECH COMMUNICATION ASSOC, p. 2331-2335 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  141. A grapheme-based method for automatic alignment of speech and text data

    Stan, A., Bell, P. & King, S., 2012, Spoken Language Technology Workshop (SLT), 2012 IEEE. Institute of Electrical and Electronics Engineers (IEEE), p. 286-290 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  142. ALISA: An automatic lightly supervised speech segmentation and alignment tool

    Stan, A., Mamiya, Y., Yamagishi, J., Bell, P., Watts, O., Clark, R. A. J. & King, S., Jan 2016, In : Computer Speech and Language. 35, p. 116-133 18 p.

    Research output: Contribution to journalArticle

  143. Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data

    Stan, A., Bell, P., Yamagishi, J. & King, S., 1 Aug 2013, Proc Interspeech 2013. ISCA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  144. Modelling Prominence and Emphasis Improves Unit-Selection Synthesis

    Strom, V., Nenkova, A., Clark, R., Vazquez-Alvarez, Y., Brenier, J., King, S. & Jurafsky, D., 1 Aug 2007, Interspeech 2007: 8th Annual Conference of the International Speech Communication Association. p. 1282-1285

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  145. Investigating Festival's target cost function using perceptual experiments

    Strom, V. & King, S., 2008, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  146. A Classifier-based target cost for unit selection speech synthesis trained on perceptual data

    Strom, V. & King, S., 2010, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  147. Expressive Prosody for Unit-selection Speech Synthesis

    Strom, V., Clark, R. & King, S., 2006, Interspeech 2006 - ICSLP: 9th International Conference on Spoken Language Processing. International Speech Communication Association, 1522

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  148. Impact of different speech types on listening effort

    Symantiraki, O., Cooke, M. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2267-2271

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  149. Introduction to the Issue on Statistical Parametric Speech Synthesis

    Tao, J., Hirose, K., Tokuda, K., Black, A. W. & King, S., Apr 2014, In : IEEE Journal of Selected Topics in Signal Processing. 8, 2, p. 170-172 3 p.

    Research output: Contribution to journalEditorial

  150. Using Intonation to Constrain Language Models in Speech Recognition

    Taylor, P., King, S., Isard, S., Wright, H. & Kowtko, J., 1997, Proc. Eurospeech'97: 5th European Conference on Speech Communication and Technology . International Speech Communication Association, p. 2763-2766 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  151. Intonation and Dialogue Context as Constraints for Speech Recognition

    Taylor, P., King, S., Isard, S. D. & Wright, H., 1998, In : Language and Speech. 41, 3, p. 493-512 20 p.

    Research output: Contribution to journalArticle

  152. Using Prosodic Information to Constrain Language Models for Spoken dialogue

    Taylor, P., Shimodaira, H., Isard, S., King, S. & Kowtko, J., Oct 1996, Proceedings of the Fourth International Conference on Spoken Language, 1996 (ICSLP `96). Vol. 1. p. 216-219 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  153. A Posterior Probability-based System Hybridisation and Combination for Spoken Term Detection

    Tejedor, J., Wang, D., King, S., Frankel, J. & Colas, J., Sep 2009, Interspeech. Citeseer, Vol. 2009. p. 2131-2134 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  154. A novel two-level architecture plus confidence measures for a keyword spotting system.

    Tejedor, J., King, S., Frankel, J., Wang, D., Colas, J. & Garrido, J., Dec 2009, Proceedings of the 5th Biennial Workshop on Speech Technology. p. 247-250 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  155. Augmented set of features for confidence estimation in spoken term detection

    Tejedor, J., Toledano, D. T., Bautista, M., King, S., Wang, D. & Colas, J., 2010, Proc. Interspeech 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  156. Feature analysis for discriminative confidence estimation in spoken term detection

    Tejedor, J., Toledano, D. T., Wang, D., King, S. & Colas, J., Sep 2014, In : Computer Speech and Language. 28, 5, p. 1083–1114 32 p.

    Research output: Contribution to journalArticle

  157. Discriminative Tandem Features for HMM-based EEG Classification

    Ting, C-M., King, S., Salleh, S-H. & Ariff, A. K., 1 Jul 2013, Proc. 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 13). IEEE Engineering in Medicine and Biology Society, Vol. 2013. p. 3957-3960

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  158. A Lattice-based Approach to Automatic Filled Pause Insertion

    Tomalin, M., Wester, M., Dall, R., Byrne, B. & King, S., 10 Aug 2015, Proc. of DiSS 2015, The 7th Workshop on Disfluencies in Spontaneous Speech. Edinburgh, 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  159. Cross-lingual Portability of MLP-Based Tandem Features--A Case Study for English and Hungarian

    Toth, L., Frankel, J., Gosztolya, G. & King, S., 2008, Proc. Interspeech. p. 2695-2698 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  160. The Edinburgh Speech Production Facility Dialogue Corpus

    Turk, A., Scobbie, J., Geng, C., Macmartin, C., King, S. & Renals, S., 2010

    Research output: Non-textual formDigital or Visual Products

  161. An Edinburgh Speech Production Facility

    Turk, A., Scobbie, J., Geng, C., Dickie, C., Bard, E., Hardcastle, W., Hartinger, M., King, S., Lickley, R., Renals, S., Richmond, K., Schaeffler, S., White, K. & Wrench, A., Jul 2010, (Unpublished).

    Research output: Contribution to conferencePoster

  162. The Edinburgh Speech Production Facility's articulatory corpus of spontaneous dialogue.

    Turk, A., Scobbie, J., Geng, C., Macmartin, C., Bard, E., Campbell, B., Dickie, C., Dubourg, E., Hardcastle, B., Hoole, P., Kanaida, E., Lickley, R., Nakai, S., Pouplier, M., King, S., Renals, S., Richmond, K., Schaeffler, S., Wiegand, R., White, K. & 1 othersWrench, A., 2010, In : Journal of the acoustical society of america. 128, 4, p. 2429-2429 1 p.

    Research output: Contribution to journalArticle

  163. Exemplar-based speech waveform generation for text-to-speech

    Valentini Botinhao, C., Watts, O., Espic Calderón, F. & King, S., 14 Feb 2019, 2018 IEEE Workshop on Spoken Language Technology (SLT). Institute of Electrical and Electronics Engineers (IEEE), p. 332-338 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  164. Using neighbourhood density and selective SNR boosting to increase the intelligibility of synthetic speech in noise

    Valentini-Botinhao, C., Wester, M., Yamagishi, J. & King, S., Aug 2013, 8th ISCA Workshop on Speech Synthesis. p. 133-138 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  165. Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Maia, R., Yamagishi, J., King, S. & Zen, H., 2012, 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP): Kyoto, Japan. NEW YORK: Institute of Electrical and Electronics Engineers (IEEE), p. 3997-4000 4 p. (IEEE International Conference on Acoustics Speech and Signal Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  166. Improving intelligibility in noise of HMM-generated speech via noise-dependent and -independent methods.

    Valentini-Botinhao, C., Godoy, E., Stylianou, Y., Sauert, B., King, S. & Yamagishi, J., May 2013, Proc. ICASSP - Vancouver, Canada.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  167. Intelligibility Enhancement of Speech in Noise

    Valentini-Botinhao, C., Yamagishi, J. & King, S., Sep 2014, Proceedings of the Institute of Acoustics 2014. Vol. 36. 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  168. Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Maia, R., Mar 2014, In : Computer Speech and Language. 28, 2, p. 665-686 22 p.

    Research output: Contribution to journalArticle

  169. Evaluation of objective measures for intelligibility prediction of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Yamagishi, J. & King, S., 1 May 2011, Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. p. 5112-5115 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  170. Using an intelligibility measure to create noise robust cepstral coefficients for HMM-based speech synthesis

    Valentini-Botinhao, C., Yamagishi, J. & King, S., May 2012, Proc. LISTA Workshop: Edinburgh, UK.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  171. Can Objective Measures Predict the Intelligibility of Modified HMM-based Synthetic Speech in Noise?

    Valentini-Botinhao, C., Yamagishi, J. & King, S., 1 Aug 2011, Interspeech 2011: 12th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1837-1840 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  172. Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Stylianou, Y., 1 Aug 2013, Interspeech 2013.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  173. Towards minimum perceptual error training for DNN-based speech synthesis

    Valentini-Botinhao, C., Wu, Z. & King, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. Dresden: International Speech Communication Association, p. 869-873 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  174. Voice Banking and Voice Reconstruction for MND patients

    Veaux, C., Yamagishi, J. & King, S., 2011, ASSETS 11: Proceedings of the 13th International ACM Sigaccess conference on computers and accessibility. New York: ASSOC COMPUTING MACHINERY, p. 305-306 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  175. Towards Personalized Synthesized Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction

    Veaux, C., Yamagishi, J. & King, S., 2013, SLPAT 2013, 4th Workshop on Speech and Language Processing for Assistive Technologies. ISCA, p. 107-111 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  176. Using HMM-based speech synthesis to reconstruct the voice of individuals with degenerative speech disorders

    Veaux, C., Yamagishi, J. & King, S., Sep 2012, Proceedings of INTERSPEECH 2012 13th Annual Conference of the International Speech Communication Association. p. 967-970 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  177. The voice bank corpus: Design, collection and data analysis of a large regional accent speech database

    Veaux, C., Yamagishi, J. & King, S., Nov 2013, Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference. Institute of Electrical and Electronics Engineers (IEEE), 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  178. A Comparison of Manual and Automatic Voice Repair for Individual with Vocal Disabilities

    Veaux, C., Yamagishi, J. & King, S., Sep 2015, SLPAT 2015, 6th Workshop on Speech and Language Processing for Assistive Technologies. Association for Computational Linguistics (ACL), p. 130-133 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  179. Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis

    Vepa, J. & King, S., 1 Sep 2006, In : IEEE Transactions on Audio, Speech and Language Processing. 14, 5, p. 1763-1771 9 p.

    Research output: Contribution to journalArticle

  180. Kalman-filter based Join Cost for Unit-selection Speech Synthesis

    Vepa, J. & King, S., 2003, Eurospeech 2003 - Interspeech 2003: 8th European Conference on Speech Communication and Technology. International Speech Communication Association, p. 293-296 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  181. Join Cost for Unit Selection Speech Synthesis

    Vepa, J. & King, S., 2004, Text to Speech Synthesis: New paradigms and advances. Alwan, A. & Narayanan, S. (eds.). Prentice Hall

    Research output: Chapter in Book/Report/Conference proceedingOther chapter contribution

  182. Subjective Evaluation Of Join Cost Functions Used In Unit Selection Speech Synthesis

    Vepa, J. & King, S., 1 Oct 2004, Interspeech 2004 - ICSLP: 8th International Conference on Spoken Language Processing. International Speech Communication Association, p. 1181-1184 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  183. Objective Distance Measures for Spectral Discontinuities in Concatenative Speech Synthesis

    Vepa, J., King, S. & Taylor, P., 1 Sep 2002, ICSLP 2002: 7th International Conference on Spoken Language Processing. International Speech Communication Association, p. 2605-2608 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  184. New Objective Distance Measures for Spectral Discontinuities in Concatenative Speech Synthesis

    Vepa, J., King, S. & Taylor, P., 1 Sep 2002, Proceedings of the 2002 IEEE workshop on speech synthesis. p. 223-226 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  185. Subjective evaluation of join cost and smoothing methods

    Vepa, J. & King, S., 1 Jun 2004, Proc. 5th ISCA speech synthesis workshop. International Speech Communication Association, p. 7-12 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  186. Stochastic Pronunciation Modelling and Soft Match for Out-of-vocabulary Spoken Term Detection

    Wang, D., King, S., Frankel, J. & Bell, P., 1 Mar 2010, Proceedings of the 2010 IEEE International conference on Acoustic Speech and Signal Processing (ICASSP). NEW YORK: Institute of Electrical and Electronics Engineers (IEEE), p. 5294-5297 4 p. (IEEE International Conference on Acoustics Speech and Signal Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  187. CRF-based Stochastic Pronunciation Modelling for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Evans, N. & Troncy, R., 1 Sep 2010, Interspeech 2010: 11th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1668-1671 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  188. Term-Dependent Confidence for Out-of-Vocabulary Term Detection

    Wang, D., King, S., Frankel, J. & Bell, P., 2009, In Proc. Interspeech. p. 2139-2142

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  189. Posterior-based confidence measures for spoken term detection

    Wang, D., Tejedor, J., Frankel, J., King, S. & Col'a, S. J., 2009, ICASSP09.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  190. Direct Posterior Confidence for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Frankel, J., Vipperla, R., Evans, N. & Troncy, R., Aug 2012, In : ACM Transactions on Information Systems. 30, 3, p. - 34 p., 16.

    Research output: Contribution to journalArticle

  191. Handling overlaps in spoken term detection

    Wang, D., Evans, N., Troncy, R. & King, S., 1 May 2011, Proc. International Conference on Acoustics, Speech and Signal Processing. p. 5656-5659 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  192. A Posterior Approach for Microphone Array Based Speech Recognition

    Wang, D., Himawan, I., Frankel, J. & King, S., Sep 2008, Interspeech. p. 996-999 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  193. Term-dependent Confidence Normalization for Out-of-Vocabulary Spoken Term Detection

    Wang, D., Tejedor, J., King, S. & Frankel, J., Mar 2012, In : Journal of Computer Science and Technology. 27, 2, p. 358-375 17 p.

    Research output: Contribution to journalArticle

  194. Direct Posterior Confidence For Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Evans, N. & Troncy, R., 1 Sep 2010, SSCS '10 Proceedings of the 2010 international workshop on Searching spontaneous conversational speech. ACM, p. 21-26 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  195. Letter-to-Sound Pronunciation Prediction Using Conditional Random Fields

    Wang, D. & King, S., 1 Feb 2011, In : IEEE Signal Processing Letters. 18, 2, p. 122-125 4 p.

    Research output: Contribution to journalArticle

  196. Stochastic pronunciation modelling for spoken term detection

    Wang, D., King, S. & Frankel, J., 2009, Proceedings of Interspeech 2009 Brighton. p. 2135-2138 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  197. A comparison of phone and grapheme-based spoken term detection

    Wang, D., Frankel, J., Tejedor, J. & King, S., Mar 2008, IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. Institute of Electrical and Electronics Engineers (IEEE), p. 4969-4972 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  198. A comparison of grapheme and phoneme-based units for Spanish spoken term detection

    Wang, D., Tejedor, J., Frankel, J., King, S. & Colas, J., Nov 2008, In : Speech Communication. 50, 11-12, p. 980-991 12 p.

    Research output: Contribution to journalArticle

  199. Stochastic Pronunciation Modelling for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S. & Frankel, J., May 2011, In : IEEE Transactions on Audio, Speech and Language Processing. 19, 4, p. 688-698 11 p.

    Research output: Contribution to journalArticle

  200. Speech Waveform Reconstruction using Convolutional Neural Networks with Noise and Periodic Inputs

    Watts, O., Valentini Botinhao, C. & King, S., 17 Apr 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brighton, United Kingdom: Institute of Electrical and Electronics Engineers (IEEE), p. 7045-7049 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  201. Neural net word representations for phrase-break prediction without a part of speech tagger

    Watts, O., Gangireddy, S., Yamagishi, J., King, S., Renals, S., Stan, A. & Giurgiu, M., 4 May 2014, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 2599-2603 5 p. 6854070

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  202. Unsupervised continuous-valued word features for phrase-break prediction without a part-of-speech tagger.

    Watts, O., Yamagishi, J. & King, S., Aug 2011, Proceedings of the 12th Annual Conference of the International Speech Communication Association. Cosi, P., De Mori, R., Di Fabbrizio, G. & Pieraccini, R. (eds.). ISCA, p. 2157-2160 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  203. The role of higher-level linguistic features in HMM-based speech synthesis

    Watts, O., Yamagishi, J. & King, S., 2010, Proc. Interspeech. p. 841-844

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  204. Sentence-level control vectors for deep neural network speech synthesis

    Watts, O., Wu, Z. & King, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2217-2221 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  205. From HMMs to DNNs: Where Do the Improvements Come From?

    Watts, O., Henter, G. E., Merritt, T., Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5505-5509 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  206. Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from `found' data: evaluation and analysis

    Watts, O., Stan, A., Clark, R., Mamiya, Y., Giurgiu, M., Yamagishi, J. & King, S., Aug 2013, 8th ISCA Speech Synthesis Workshop: Barcelona, Spain. ISCA-INST SPEECH COMMUNICATION ASSOC, p. 101-106 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  207. HMM adaptation and voice conversion for the synthesis of child speech: a comparison

    Watts, O., Yamagishi, J., King, S. & Berkling, K., Sep 2009, Interspeech 2009, Brighton UK. p. 2627-2630 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  208. HMM-based synthesis of child speech

    Watts, O., Yamagishi, J., Berkling, K. & King, S., 2008, Proc. of The 1st Workshop on Child, Computer and Interaction (ICMI'08 post-conference workshop).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  209. Synthesis of Child Speech With HMM Adaptation and Voice Conversion

    Watts, O., Yamagishi, J., King, S. & Berkling, K., Jul 2010, In : IEEE Transactions on Audio, Speech and Language Processing. 18, 5, p. 1005-1016 12 p.

    Research output: Contribution to journalArticle

  210. Exemplar-based Speech Waveform Generation

    Watts, O., Valentini Botinhao, C., Espic calderón, F. & King, S., 6 Sep 2018, Interspeech 2018. Hyderabad, India, p. 2022-2026 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  211. Letter-based speech synthesis

    Watts, O., Yamagishi, J. & King, S., Sep 2010, Proc. Speech Synthesis Workshop 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  212. Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project

    Wester, M., Dines, J., Gibson, M., Liang, H., Wu, Y-J., Saheer, L., King, S., Oura, K., Garner, P. N., Byrne, W., Guan, Y., Hirsimaki, T., Karhila, R., Kurimo, M., Shannon, M., Shiota, S., Tian, J., Tokuda, K. & Yamagishi, J., 2010, Proc. of 7th ISCA Speech Synthesis Workshop.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  213. Asynchronous Articulatory Feature Recognition Using Dynamic Bayesian Networks

    Wester, M., Frankel, J. & King, S., 1 Dec 2004, Proc. IEICI Beyond HMM Workshop.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  214. Investigating gated recurrent neural networks for speech synthesis

    Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 1-5 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  215. A study of speaker adaptation for DNN-based speech synthesis

    Wu, Z., Swietojanski, P., Veaux, C., Renals, S. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  216. Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance

    Wu, Z., De Leon, P., Demiroglu, C., Khodabakhsh, A., King, S., Ling, Z., Saito, D., Stewart, B., Toda, T., Wester, M. & Yamagishi, J., Apr 2016, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 24, 4, p. 768 - 783 17 p.

    Research output: Contribution to journalArticle

  217. Deep neural networks employing multi-task learning and stacked bottleneck features for speech synthesis.

    Wu, Z., Valentini-Botinhao, C., Watts, O. & King, S., 1 Apr 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. Brisbane, Australia, p. 4460-4464 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  218. SAS: A Speaker Verification Spoofing Database Containing Diverse Attacks

    Wu, Z., Khodabakhsh, A., Demiroglu, C., Yamagishi, J., Saito, D., Toda, T. & King, S., 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on . Institute of Electrical and Electronics Engineers (IEEE), p. 4440-4444 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  219. Merlin: An Open Source Neural Network Speech Synthesis System

    Wu, Z., Watts, O. & King, S., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop (2016). p. 202-207 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  220. Improved Average-Voice-based Speech Synthesis Using Gender-Mixed Modeling and a Parameter Generation Algorithm Considering GV

    Yamagishi, J., Kobayashi, T., Renals, S., King, S., Zen, H., Toda, T. & Tokuda, K., 1 Aug 2007, SSW6-2007: 6th ISCA Workshop on Speech Synthesis. International Speech Communication Association, p. 125-130 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  221. Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis

    Yamagishi, J., Nose, T., Zen, H., Ling, Z. H., Toda, T., Tokuda, K., King, S. & Renals, S., Aug 2009, In : IEEE Transactions on Audio, Speech and Language Processing. 17, 6, p. 1208-1230 23 p.

    Research output: Contribution to journalArticle

  222. Thousands of Voices for HMM-Based Speech Synthesis-Analysis and Application of TTS Systems Built on Various ASR Corpora

    Yamagishi, J., Usabaev, B., King, S., Watts, O., Dines, J., Tian, J., Guan, Y., Hu, R., Oura, K., Wu, Y-J., Tokuda, K., Karhila, R. & Kurimo, M., Jul 2010, In : IEEE Transactions on Audio, Speech and Language Processing. 18, 5, p. 984-1004 21 p.

    Research output: Contribution to journalArticle

  223. Robustness of HMM-based speech synthesis

    Yamagishi, J., Ling, Z. & King, S., Sep 2008, Proc. Interspeech. p. 581-584 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  224. Roles of the Average Voice in Speaker-adaptive HMM-based Speech Synthesis

    Yamagishi, J., Watts, O., King, S. & Usabaev, B., 2010, Proc. Interspeech 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  225. Simple methods for improving speaker-similarity of HMM-based speech synthesis

    Yamagishi, J. & King, S., 2010, Proc. ICASSP 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  226. Analysis of unsupervised and noise-robust speaker-adaptive HMM-based speech synthesis systems toward a unified ASR and TTS framework

    Yamagishi, J., Lincoln, M., King, S., Dines, J., Gibson, M., Tian, J. & Guan, Y., Sep 2009, Interspeech 2009 Edinburgh..

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  227. Thousands of voices for HMM-based speech synthesis

    Yamagishi, J., Usabaev, B., King, S., Watts, O., Dines, J., Tian, J., Hu, R., Guan, Y., Oura, K., Tokuda, K., Karhila, R. & Kurimo, M., Sep 2009, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2009: 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009; Brighton, United Kingdom. p. 420-423 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  228. Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction

    Yamagishi, J., Veaux, C., King, S. & Renals, S., 2012, In : Acoustical Science and Technology. 33, 1, p. 1-5 5 p.

    Research output: Contribution to journalArticle

  229. Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation

    Yang, C-Y., Brown, G., Lu, L., Yamagishi, J. & King, S., 4 Dec 2012, Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on. Institute of Electrical and Electronics Engineers (IEEE), p. 220-223 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  230. Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPs

    Çetin, Ã., Magimai-Doss, M., Kantor, A., King, S., Bartels, C., Frankel, J. & Livescu, K., Dec 2007, Proceedings of the IEEE workshop on Automated Speech Recognition and Understanding, 2007 (ASRU 07). p. 36-41 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution