Edinburgh Research Explorer

Prof Simon King

Personal Chair of Speech Processing

  1. Reconstructing Voices within the Multiple-Average-Voice-Model framework

    Lanchantin, P., Veaux, C., Gales, M. J. F., King, S. & Yamagishi, J., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2232-2236 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  2. Formant-controlled HMM-based speech synthesis

    Lei, M., Yamagishi, J., Richmond, K., Ling, Z-H., King, S. & Dai, L-R., 1 Aug 2011, Interspeech 2011: 12th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2777-2780 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  3. Manual transcription of conversational speech at the articulatory feature level

    Livescu, K., Bezman, A., Borges, N., Yung, L., C‡etin, O., Frankel, J., King, S., Magimai-Doss, M., Chi, X. & Lavoie, L., 1 Apr 2007, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2007 (ICASSP 2007). Vol. 4. p. 953-956 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  4. Articulatory feature-based methods for acoustic and audio-visual speech recognition: Summary from the 2006 JHU Summer Workshop

    Livescu, K., C‡etin, O., Hasegawa-Johnson, M., King, S., Bartels, C., Borges, N., Kantor, A., Lal, P., Yung, L., Bezman Dawson-Haggerty, S., Woods, B., Frankel, J., Magimai-Doss, M. & Saenko, K., 1 Apr 2007, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2007 (ICASSP 2007). Vol. 4. p. 621-621 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  5. Simple4All proposals for the Albayzin Evaluations in Speech Synthesis

    Lorenzo-Trueba, J., Watts, O., Barra-Chicote, R., Yamagishi, J., King, S. & Montero, J. M., 2012, Proc. Iberspeech 2012.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  6. Development of a Genre-Dependent TTS System with Cross-Speaker Speaking-Style Transplantation

    Lorenzo-Trueba, J., Echeverry-Correa, J. D., Barra-Chicote, R., San-Segundo, R., Ferreiros, J., Gallardo-Antolin, A., Yamagishi, J., King, S. & Montero, J. M., 2014, 2nd International Workshop on Speech, Language and Audio in Multimedia (SLAM2014). International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  7. Combining a Vector Space Representation of Linguistic Context with a Deep Neural Network for Text-To-Speech Synthesis

    Lu, H., King, S. & Watts, O., 1 Aug 2013, 8th ISCA Speech Synthesis Workshop. p. 261-265 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  8. Using Bayesian Networks to find relevant context features for HMM-based speech synthesis

    Lu, H. & King, S., 1 Sep 2012, INTERSPEECH 2012, 13th Annual Conference of the International Speech Communication Association, Portland, Oregon, USA, September 9-13, 2012. ISCA-INST SPEECH COMMUNICATION ASSOC

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  9. Factorized context modelling for Text-to-Speech synthesis

    Lu, H. & King, S., 2013, Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 7849-7853 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  10. Using Adaptation to Improve Speech Transcription Alignment in Noisy and Reverberant Environments

    Mamiya, Y., Stan, A., Yamagishi, J., Bell, P., Watts, O., Clark, R. & King, S., Aug 2013, Proc. 8th ISCA Speech Synthesis Workshop. p. 61-66 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  11. Lightly supervised GMM VAD to use audiobook for speech synthesiser

    Mamiya, Y., Yamagishi, J., Watts, O., Clark, R. A. J., King, S. & Stan, A., 2013, Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. Institute of Electrical and Electronics Engineers (IEEE), p. 7987-7991 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  12. Multidimensional Scaling of Listener Responses to Synthetic Speech

    Mayo, C., Clark, R. A. J. & King, S., 1 Sep 2005, Interspeech 2005 - Eurospeech: 9th European Conference on Speech Communication and Technology . International Speech Communication Association, p. 1725-1728 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  13. Listeners' weighting of acoustic cues to synthetic speech naturalness: A multidimensional scaling analysis

    Mayo, C., Clark, R. A. J. & King, S., 1 Mar 2011, In : Speech Communication. 53, 3, p. 311-326 15 p.

    Research output: Contribution to journalArticle

  14. Nativization of foreign names in TTS for automatic reading of world news in Swahili

    Mendelson, J., Oplustil, P., Watts, O. & King, S., 24 Aug 2017, Proceedings Interspeech 2017. p. 2188-2192 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  15. Deep neural network context embeddings for model selection in rich-context HMM synthesis

    Merritt, T., Yamagishi, J., Wu, Z., Watts, O. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. Dresden: International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  16. Investigating the shortcomings of HMM synthesis

    Merritt, T. & King, S., 1 Aug 2013, Proceedings of 8th ISCA Speech Synthesis Workshop. p. 185-190 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  17. Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

    Oura, K., Tokuda, K., Yamagishi, J., Wester, M. & King, S., 2010, Proceedings of ICASSP. Vol. 1. p. 4954-4957 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  18. Unsupervised English-to-Japanese speaker adaptation for HMM-based speech synthesis.

    Oura, K., Yamagishi, J., King, S., Wester, M. & Tokuda, K., Dec 2009, Proceedings of the Acoustical Society of Japan : Autmn meeting. Vol. I 3-P-18. p. 401-402 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  19. Unsupervised speaker adaptation for speech-to-speech translation system.

    Oura, K., Yamagishi, J., Wester, M., King, S. & Tokuda, K., Dec 2009, Proceedings SLP 2009. 356 ed. Tokyo, Vol. 109. p. 13-18 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  20. Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping

    Oura, K., Yamagishi, J., Wester, M., King, S. & Tokuda, K., 1 Jul 2012, In : Speech Communication. 54, 6, p. 703-714 12 p.

    Research output: Contribution to journalArticle

  21. A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification

    Poblete, V., Espic, F., King, S., Stem, R. M., Huenupan, F., Fredes, J. & Yoma, N. B., May 2015, In : Computer Speech and Language. 31, 1, p. 1-27 27 p.

    Research output: Contribution to journalArticle

  22. Voice source modelling using deep neural networks for statistical parametric speech synthesis

    Raitio, T., Lu, H., Kane, J., Suni, A., Vainio, M., King, S. & Alku, P., 1 Sep 2014, European Signal Processing Conference. European Signal Processing Conference, EUSIPCO, p. 2290-2294 5 p. 6952838

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  23. Automatic speech recognition

    Renals, S. & King, S., Feb 2010, Handbook of Phonetic Sciences. Hardcastle, W. J., Laver, J. & Gibbon, F. E. (eds.). 2nd ed. Wiley-Blackwell, Vol. 1.

    Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)

  24. Modelling the Uncertainty in Recovering Articulation from Acoustics

    Richmond, K., King, S. & Taylor, P., Apr 2003, In : Computer Speech and Language. 17, 2-3, p. 153-172 20 p.

    Research output: Contribution to journalArticle

  25. Smooth talking: articulatory join costs for unit selection

    Richmond, K. & King, S., 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5150-5154 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  26. Announcing the Electromagnetic Articulography (Day 1) Subset of the mngu0 Articulatory Corpus

    Richmond, K., Hoole, P. & King, S., 1 Aug 2011, Interspeech 2011: 12th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1505-1508 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  27. DNN-based Speech Synthesis for Indian Languages from ASCII text

    Ronanki, S., Gangireddy, S. R., Bollepalli, B. & King, S., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop. p. 74-79 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  28. Median-based generation of synthetic speech durations using a non-parametric approach

    Ronanki, S., Watts, O., King, S. & Henter, G. E., 9 Feb 2017, 2016 IEEE Spoken Language Technology Workshop (SLT). Institute of Electrical and Electronics Engineers (IEEE), p. 686-692 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  29. A Hierarchical Encoder-Decoder Model for Statistical Parametric Speech Synthesis

    Ronanki, S., Watts, O. & King, S., 24 Aug 2017, Proceedings Interspeech 2017. p. 1133-1137 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  30. Framewise phone classification using support vector machines

    Salomon, J., King, S. & Osborne, M., 2002, ICSLP- 2002: Proceedings of the 7th International Conference on Spoken Language Processing. International Speech Communication Association, p. 2645-2648 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  31. Detecting Acronyms from Capital Letter Sequences in Spanish

    San-Segundo, R., Montero, J. M., Lopez-Luden, V. & King, S., 1 Sep 2012, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  32. Using eigenvoices and nearest-neighbours in HMM-based cross-lingual speaker adaptation with limited data

    Sarfjoo, S. S., Demiroglu, C. & King, S., Apr 2017, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25, 4, p. 839-851 13 p.

    Research output: Contribution to journalArticle

  33. The Edinburgh Speech Production Facility DoubleTalk Corpus

    Scobbie, J., Turk, A., Geng, C., King, S., Lickley, R. & Richmond, K., 1 Aug 2013, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  34. Estimating detailed spectral envelopes using articulatory clustering

    Shiga, Y. & King, S., 1 Oct 2004, Interspeech 2004 - ICSLP: 8th International Conference on Spoken Language Processing. International Speech Communication Association, p. 2485-2488 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  35. Estimation of voice source and vocal tract characteristics based on multi-frame analysis

    Shiga, Y. & King, S., 1 Sep 2003, Eurospeech 2003 - Interspeech 2003: 8th European Conference on Speech Communication and Technology. International Speech Communication Association, Vol. 3. p. 1749-1752 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  36. Source-Filter Separation for Articulation-to-Speech Synthesis

    Shiga, Y. & King, S., 1 Oct 2004, Interspeech 2004 - ICSLP: 8th International Conference on Spoken Language Processing. International Speech Communication Association, p. 1913-1916 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  37. Estimating the Spectral Envelope of Voiced Speech Using Multi-frame Analysis

    Shiga, Y. & King, S., 1 Sep 2003, Eurospeech 2003 - Interspeech 2003: 8th European Conference on Speech Communication and Technology. International Speech Communication Association, Vol. 3. p. 1737-1740 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  38. Accurate spectral envelope estimation for articulation-to-speech synthesis

    Shiga, Y. & King, S., 1 Jun 2004, Proc. 5th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 19-24 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  39. Where are the challenges in speaker diarization?

    Sinclair, M. & King, S., 21 Oct 2013, IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013. Institute of Electrical and Electronics Engineers (IEEE), p. 7741-7745 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  40. The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate

    Stan, A., Yamagishi, J., King, S. & Aylett, M., Mar 2011, In : Speech Communication. 53, 3, p. 442-450 9 p.

    Research output: Contribution to journalArticle

  41. TUNDRA: a multilingual corpus of found data for TTS research created with light supervision

    Stan, A., Watts, O., Mamiya, Y., Giurgiu, M., Clark, R. A. J., Yamagishi, J. & King, S., 2013, INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association: Lyon, France, August 25-29, 2013. ISCA-INST SPEECH COMMUNICATION ASSOC, p. 2331-2335 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  42. A grapheme-based method for automatic alignment of speech and text data

    Stan, A., Bell, P. & King, S., 2012, Spoken Language Technology Workshop (SLT), 2012 IEEE. Institute of Electrical and Electronics Engineers (IEEE), p. 286-290 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  43. ALISA: An automatic lightly supervised speech segmentation and alignment tool

    Stan, A., Mamiya, Y., Yamagishi, J., Bell, P., Watts, O., Clark, R. A. J. & King, S., Jan 2016, In : Computer Speech and Language. 35, p. 116-133 18 p.

    Research output: Contribution to journalArticle

  44. Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data

    Stan, A., Bell, P., Yamagishi, J. & King, S., 1 Aug 2013, Proc Interspeech 2013. ISCA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  45. Modelling Prominence and Emphasis Improves Unit-Selection Synthesis

    Strom, V., Nenkova, A., Clark, R., Vazquez-Alvarez, Y., Brenier, J., King, S. & Jurafsky, D., 1 Aug 2007, Interspeech 2007: 8th Annual Conference of the International Speech Communication Association. p. 1282-1285

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  46. Investigating Festival's target cost function using perceptual experiments

    Strom, V. & King, S., 2008, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  47. A Classifier-based target cost for unit selection speech synthesis trained on perceptual data

    Strom, V. & King, S., 2010, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  48. Expressive Prosody for Unit-selection Speech Synthesis

    Strom, V., Clark, R. & King, S., 2006, Interspeech 2006 - ICSLP: 9th International Conference on Spoken Language Processing. International Speech Communication Association, 1522

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  49. Impact of different speech types on listening effort

    Symantiraki, O., Cooke, M. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2267-2271

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  50. Introduction to the Issue on Statistical Parametric Speech Synthesis

    Tao, J., Hirose, K., Tokuda, K., Black, A. W. & King, S., Apr 2014, In : IEEE Journal of Selected Topics in Signal Processing. 8, 2, p. 170-172 3 p.

    Research output: Contribution to journalEditorial

  51. Using Intonation to Constrain Language Models in Speech Recognition

    Taylor, P., King, S., Isard, S., Wright, H. & Kowtko, J., 1997, Proc. Eurospeech'97: 5th European Conference on Speech Communication and Technology . International Speech Communication Association, p. 2763-2766 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  52. Intonation and Dialogue Context as Constraints for Speech Recognition

    Taylor, P., King, S., Isard, S. D. & Wright, H., 1998, In : Language and Speech. 41, 3, p. 493-512 20 p.

    Research output: Contribution to journalArticle

  53. Using Prosodic Information to Constrain Language Models for Spoken dialogue

    Taylor, P., Shimodaira, H., Isard, S., King, S. & Kowtko, J., Oct 1996, Proceedings of the Fourth International Conference on Spoken Language, 1996 (ICSLP `96). Vol. 1. p. 216-219 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  54. A Posterior Probability-based System Hybridisation and Combination for Spoken Term Detection

    Tejedor, J., Wang, D., King, S., Frankel, J. & Colas, J., Sep 2009, Interspeech. Citeseer, Vol. 2009. p. 2131-2134 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  55. A novel two-level architecture plus confidence measures for a keyword spotting system.

    Tejedor, J., King, S., Frankel, J., Wang, D., Colas, J. & Garrido, J., Dec 2009, Proceedings of the 5th Biennial Workshop on Speech Technology. p. 247-250 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  56. Augmented set of features for confidence estimation in spoken term detection

    Tejedor, J., Toledano, D. T., Bautista, M., King, S., Wang, D. & Colas, J., 2010, Proc. Interspeech 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  57. Feature analysis for discriminative confidence estimation in spoken term detection

    Tejedor, J., Toledano, D. T., Wang, D., King, S. & Colas, J., Sep 2014, In : Computer Speech and Language. 28, 5, p. 1083–1114 32 p.

    Research output: Contribution to journalArticle

  58. Discriminative Tandem Features for HMM-based EEG Classification

    Ting, C-M., King, S., Salleh, S-H. & Ariff, A. K., 1 Jul 2013, Proc. 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 13). IEEE Engineering in Medicine and Biology Society, Vol. 2013. p. 3957-3960

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  59. A Lattice-based Approach to Automatic Filled Pause Insertion

    Tomalin, M., Wester, M., Dall, R., Byrne, B. & King, S., 10 Aug 2015, Proc. of DiSS 2015, The 7th Workshop on Disfluencies in Spontaneous Speech. Edinburgh, 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  60. Cross-lingual Portability of MLP-Based Tandem Features--A Case Study for English and Hungarian

    Toth, L., Frankel, J., Gosztolya, G. & King, S., 2008, Proc. Interspeech. p. 2695-2698 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  61. The Edinburgh Speech Production Facility Dialogue Corpus

    Turk, A., Scobbie, J., Geng, C., Macmartin, C., King, S. & Renals, S., 2010

    Research output: Non-textual formDigital or Visual Products

  62. An Edinburgh Speech Production Facility

    Turk, A., Scobbie, J., Geng, C., Dickie, C., Bard, E., Hardcastle, W., Hartinger, M., King, S., Lickley, R., Renals, S., Richmond, K., Schaeffler, S., White, K. & Wrench, A., Jul 2010, (Unpublished).

    Research output: Contribution to conferencePoster

  63. The Edinburgh Speech Production Facility's articulatory corpus of spontaneous dialogue.

    Turk, A., Scobbie, J., Geng, C., Macmartin, C., Bard, E., Campbell, B., Dickie, C., Dubourg, E., Hardcastle, B., Hoole, P., Kanaida, E., Lickley, R., Nakai, S., Pouplier, M., King, S., Renals, S., Richmond, K., Schaeffler, S., Wiegand, R., White, K. & 1 othersWrench, A., 2010, In : Journal of the acoustical society of america. 128, 4, p. 2429-2429 1 p.

    Research output: Contribution to journalArticle

  64. Exemplar-based speech waveform generation for text-to-speech

    Valentini Botinhao, C., Watts, O., Espic Calderón, F. & King, S., 14 Feb 2019, 2018 IEEE Workshop on Spoken Language Technology (SLT). Institute of Electrical and Electronics Engineers (IEEE), p. 332-338 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  65. Using neighbourhood density and selective SNR boosting to increase the intelligibility of synthetic speech in noise

    Valentini-Botinhao, C., Wester, M., Yamagishi, J. & King, S., Aug 2013, 8th ISCA Workshop on Speech Synthesis. p. 133-138 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  66. Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Maia, R., Yamagishi, J., King, S. & Zen, H., 2012, 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP): Kyoto, Japan. NEW YORK: Institute of Electrical and Electronics Engineers (IEEE), p. 3997-4000 4 p. (IEEE International Conference on Acoustics Speech and Signal Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  67. Improving intelligibility in noise of HMM-generated speech via noise-dependent and -independent methods.

    Valentini-Botinhao, C., Godoy, E., Stylianou, Y., Sauert, B., King, S. & Yamagishi, J., May 2013, Proc. ICASSP - Vancouver, Canada.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  68. Intelligibility Enhancement of Speech in Noise

    Valentini-Botinhao, C., Yamagishi, J. & King, S., Sep 2014, Proceedings of the Institute of Acoustics 2014. Vol. 36. 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  69. Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Maia, R., Mar 2014, In : Computer Speech and Language. 28, 2, p. 665-686 22 p.

    Research output: Contribution to journalArticle

  70. Evaluation of objective measures for intelligibility prediction of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Yamagishi, J. & King, S., 1 May 2011, Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. p. 5112-5115 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  71. Using an intelligibility measure to create noise robust cepstral coefficients for HMM-based speech synthesis

    Valentini-Botinhao, C., Yamagishi, J. & King, S., May 2012, Proc. LISTA Workshop: Edinburgh, UK.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  72. Can Objective Measures Predict the Intelligibility of Modified HMM-based Synthetic Speech in Noise?

    Valentini-Botinhao, C., Yamagishi, J. & King, S., 1 Aug 2011, Interspeech 2011: 12th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1837-1840 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  73. Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Stylianou, Y., 1 Aug 2013, Interspeech 2013.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  74. Towards minimum perceptual error training for DNN-based speech synthesis

    Valentini-Botinhao, C., Wu, Z. & King, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. Dresden: International Speech Communication Association, p. 869-873 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  75. Voice Banking and Voice Reconstruction for MND patients

    Veaux, C., Yamagishi, J. & King, S., 2011, ASSETS 11: Proceedings of the 13th International ACM Sigaccess conference on computers and accessibility. New York: ASSOC COMPUTING MACHINERY, p. 305-306 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  76. Towards Personalized Synthesized Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction

    Veaux, C., Yamagishi, J. & King, S., 2013, SLPAT 2013, 4th Workshop on Speech and Language Processing for Assistive Technologies. ISCA, p. 107-111 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  77. Using HMM-based speech synthesis to reconstruct the voice of individuals with degenerative speech disorders

    Veaux, C., Yamagishi, J. & King, S., Sep 2012, Proceedings of INTERSPEECH 2012 13th Annual Conference of the International Speech Communication Association. p. 967-970 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  78. The voice bank corpus: Design, collection and data analysis of a large regional accent speech database

    Veaux, C., Yamagishi, J. & King, S., Nov 2013, Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference. Institute of Electrical and Electronics Engineers (IEEE), 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  79. A Comparison of Manual and Automatic Voice Repair for Individual with Vocal Disabilities

    Veaux, C., Yamagishi, J. & King, S., Sep 2015, SLPAT 2015, 6th Workshop on Speech and Language Processing for Assistive Technologies. Association for Computational Linguistics (ACL), p. 130-133 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  80. Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis

    Vepa, J. & King, S., 1 Sep 2006, In : IEEE Transactions on Audio, Speech and Language Processing. 14, 5, p. 1763-1771 9 p.

    Research output: Contribution to journalArticle

  81. Kalman-filter based Join Cost for Unit-selection Speech Synthesis

    Vepa, J. & King, S., 2003, Eurospeech 2003 - Interspeech 2003: 8th European Conference on Speech Communication and Technology. International Speech Communication Association, p. 293-296 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  82. Join Cost for Unit Selection Speech Synthesis

    Vepa, J. & King, S., 2004, Text to Speech Synthesis: New paradigms and advances. Alwan, A. & Narayanan, S. (eds.). Prentice Hall

    Research output: Chapter in Book/Report/Conference proceedingOther chapter contribution

  83. Subjective Evaluation Of Join Cost Functions Used In Unit Selection Speech Synthesis

    Vepa, J. & King, S., 1 Oct 2004, Interspeech 2004 - ICSLP: 8th International Conference on Spoken Language Processing. International Speech Communication Association, p. 1181-1184 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  84. Objective Distance Measures for Spectral Discontinuities in Concatenative Speech Synthesis

    Vepa, J., King, S. & Taylor, P., 1 Sep 2002, ICSLP 2002: 7th International Conference on Spoken Language Processing. International Speech Communication Association, p. 2605-2608 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  85. New Objective Distance Measures for Spectral Discontinuities in Concatenative Speech Synthesis

    Vepa, J., King, S. & Taylor, P., 1 Sep 2002, Proceedings of the 2002 IEEE workshop on speech synthesis. p. 223-226 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  86. Subjective evaluation of join cost and smoothing methods

    Vepa, J. & King, S., 1 Jun 2004, Proc. 5th ISCA speech synthesis workshop. International Speech Communication Association, p. 7-12 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  87. Stochastic Pronunciation Modelling and Soft Match for Out-of-vocabulary Spoken Term Detection

    Wang, D., King, S., Frankel, J. & Bell, P., 1 Mar 2010, Proceedings of the 2010 IEEE International conference on Acoustic Speech and Signal Processing (ICASSP). NEW YORK: Institute of Electrical and Electronics Engineers (IEEE), p. 5294-5297 4 p. (IEEE International Conference on Acoustics Speech and Signal Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  88. Posterior-based confidence measures for spoken term detection

    Wang, D., Tejedor, J., Frankel, J., King, S. & Col'a, S. J., 2009, ICASSP09.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  89. Direct Posterior Confidence for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Frankel, J., Vipperla, R., Evans, N. & Troncy, R., Aug 2012, In : ACM Transactions on Information Systems. 30, 3, p. - 34 p., 16.

    Research output: Contribution to journalArticle

  90. Handling overlaps in spoken term detection

    Wang, D., Evans, N., Troncy, R. & King, S., 1 May 2011, Proc. International Conference on Acoustics, Speech and Signal Processing. p. 5656-5659 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  91. A Posterior Approach for Microphone Array Based Speech Recognition

    Wang, D., Himawan, I., Frankel, J. & King, S., Sep 2008, Interspeech. p. 996-999 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  92. A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis

    Wang, X., Takaki, S., Yamagishi, J., King, S. & Tokuda, K., 10 Oct 2019, (Accepted/In press) In : IEEE Transactions on Audio, Speech and Language Processing. 13 p.

    Research output: Contribution to journalArticle

  93. Term-dependent Confidence Normalization for Out-of-Vocabulary Spoken Term Detection

    Wang, D., Tejedor, J., King, S. & Frankel, J., Mar 2012, In : Journal of Computer Science and Technology. 27, 2, p. 358-375 17 p.

    Research output: Contribution to journalArticle

  94. Direct Posterior Confidence For Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Evans, N. & Troncy, R., 1 Sep 2010, SSCS '10 Proceedings of the 2010 international workshop on Searching spontaneous conversational speech. ACM, p. 21-26 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  95. Letter-to-Sound Pronunciation Prediction Using Conditional Random Fields

    Wang, D. & King, S., 1 Feb 2011, In : IEEE Signal Processing Letters. 18, 2, p. 122-125 4 p.

    Research output: Contribution to journalArticle

  96. Stochastic pronunciation modelling for spoken term detection

    Wang, D., King, S. & Frankel, J., 2009, Proceedings of Interspeech 2009 Brighton. p. 2135-2138 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  97. A comparison of phone and grapheme-based spoken term detection

    Wang, D., Frankel, J., Tejedor, J. & King, S., Mar 2008, IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. Institute of Electrical and Electronics Engineers (IEEE), p. 4969-4972 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  98. A comparison of grapheme and phoneme-based units for Spanish spoken term detection

    Wang, D., Tejedor, J., Frankel, J., King, S. & Colas, J., Nov 2008, In : Speech Communication. 50, 11-12, p. 980-991 12 p.

    Research output: Contribution to journalArticle

  99. Stochastic Pronunciation Modelling for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S. & Frankel, J., May 2011, In : IEEE Transactions on Audio, Speech and Language Processing. 19, 4, p. 688-698 11 p.

    Research output: Contribution to journalArticle