Edinburgh Research Explorer

Prof Simon King

Personal Chair of Speech Processing

  1. Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPs

    Çetin, Ã., Magimai-Doss, M., Kantor, A., King, S., Bartels, C., Frankel, J. & Livescu, K., Dec 2007, Proceedings of the IEEE workshop on Automated Speech Recognition and Understanding, 2007 (ASRU 07). p. 36-41 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  2. Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation

    Yang, C-Y., Brown, G., Lu, L., Yamagishi, J. & King, S., 4 Dec 2012, Chinese Spoken Language Processing (ISCSLP), 2012 8th International Symposium on. Institute of Electrical and Electronics Engineers (IEEE), p. 220-223 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  3. Improved Average-Voice-based Speech Synthesis Using Gender-Mixed Modeling and a Parameter Generation Algorithm Considering GV

    Yamagishi, J., Kobayashi, T., Renals, S., King, S., Zen, H., Toda, T. & Tokuda, K., 1 Aug 2007, SSW6-2007: 6th ISCA Workshop on Speech Synthesis. International Speech Communication Association, p. 125-130 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  4. Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis

    Yamagishi, J., Nose, T., Zen, H., Ling, Z. H., Toda, T., Tokuda, K., King, S. & Renals, S., Aug 2009, In : IEEE Transactions on Audio, Speech and Language Processing. 17, 6, p. 1208-1230 23 p.

    Research output: Contribution to journalArticle

  5. Thousands of Voices for HMM-Based Speech Synthesis-Analysis and Application of TTS Systems Built on Various ASR Corpora

    Yamagishi, J., Usabaev, B., King, S., Watts, O., Dines, J., Tian, J., Guan, Y., Hu, R., Oura, K., Wu, Y-J., Tokuda, K., Karhila, R. & Kurimo, M., Jul 2010, In : IEEE Transactions on Audio, Speech and Language Processing. 18, 5, p. 984-1004 21 p.

    Research output: Contribution to journalArticle

  6. Robustness of HMM-based speech synthesis

    Yamagishi, J., Ling, Z. & King, S., Sep 2008, Proc. Interspeech. p. 581-584 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  7. Roles of the Average Voice in Speaker-adaptive HMM-based Speech Synthesis

    Yamagishi, J., Watts, O., King, S. & Usabaev, B., 2010, Proc. Interspeech 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  8. Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction

    Yamagishi, J., Veaux, C., King, S. & Renals, S., 2012, In : Acoustical Science and Technology. 33, 1, p. 1-5 5 p.

    Research output: Contribution to journalArticle

  9. Thousands of voices for HMM-based speech synthesis

    Yamagishi, J., Usabaev, B., King, S., Watts, O., Dines, J., Tian, J., Hu, R., Guan, Y., Oura, K., Tokuda, K., Karhila, R. & Kurimo, M., Sep 2009, Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2009: 10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009; Brighton, United Kingdom. p. 420-423 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  10. Analysis of unsupervised and noise-robust speaker-adaptive HMM-based speech synthesis systems toward a unified ASR and TTS framework

    Yamagishi, J., Lincoln, M., King, S., Dines, J., Gibson, M., Tian, J. & Guan, Y., Sep 2009, Interspeech 2009 Edinburgh..

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  11. Simple methods for improving speaker-similarity of HMM-based speech synthesis

    Yamagishi, J. & King, S., 2010, Proc. ICASSP 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  12. Investigating gated recurrent neural networks for speech synthesis

    Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 1-5 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  13. A study of speaker adaptation for DNN-based speech synthesis

    Wu, Z., Swietojanski, P., Veaux, C., Renals, S. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  14. Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance

    Wu, Z., De Leon, P., Demiroglu, C., Khodabakhsh, A., King, S., Ling, Z., Saito, D., Stewart, B., Toda, T., Wester, M. & Yamagishi, J., Apr 2016, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 24, 4, p. 768 - 783 17 p.

    Research output: Contribution to journalArticle

  15. Deep neural networks employing multi-task learning and stacked bottleneck features for speech synthesis.

    Wu, Z., Valentini-Botinhao, C., Watts, O. & King, S., 1 Apr 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. Brisbane, Australia, p. 4460-4464 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  16. SAS: A Speaker Verification Spoofing Database Containing Diverse Attacks

    Wu, Z., Khodabakhsh, A., Demiroglu, C., Yamagishi, J., Saito, D., Toda, T. & King, S., 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on . Institute of Electrical and Electronics Engineers (IEEE), p. 4440-4444 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  17. Merlin: An Open Source Neural Network Speech Synthesis System

    Wu, Z., Watts, O. & King, S., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop (2016). p. 202-207 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  18. Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project

    Wester, M., Dines, J., Gibson, M., Liang, H., Wu, Y-J., Saheer, L., King, S., Oura, K., Garner, P. N., Byrne, W., Guan, Y., Hirsimaki, T., Karhila, R., Kurimo, M., Shannon, M., Shiota, S., Tian, J., Tokuda, K. & Yamagishi, J., 2010, Proc. of 7th ISCA Speech Synthesis Workshop.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  19. Asynchronous Articulatory Feature Recognition Using Dynamic Bayesian Networks

    Wester, M., Frankel, J. & King, S., 1 Dec 2004, Proc. IEICI Beyond HMM Workshop.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  20. Speech Waveform Reconstruction using Convolutional Neural Networks with Noise and Periodic Inputs

    Watts, O., Valentini Botinhao, C. & King, S., 17 Apr 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Brighton, United Kingdom: Institute of Electrical and Electronics Engineers (IEEE), p. 7045-7049 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  21. Neural net word representations for phrase-break prediction without a part of speech tagger

    Watts, O., Gangireddy, S., Yamagishi, J., King, S., Renals, S., Stan, A. & Giurgiu, M., 4 May 2014, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 2599-2603 5 p. 6854070

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  22. Unsupervised continuous-valued word features for phrase-break prediction without a part-of-speech tagger.

    Watts, O., Yamagishi, J. & King, S., Aug 2011, Proceedings of the 12th Annual Conference of the International Speech Communication Association. Cosi, P., De Mori, R., Di Fabbrizio, G. & Pieraccini, R. (eds.). ISCA, p. 2157-2160 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  23. The role of higher-level linguistic features in HMM-based speech synthesis

    Watts, O., Yamagishi, J. & King, S., 2010, Proc. Interspeech. p. 841-844

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  24. Sentence-level control vectors for deep neural network speech synthesis

    Watts, O., Wu, Z. & King, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 2217-2221 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  25. From HMMs to DNNs: Where Do the Improvements Come From?

    Watts, O., Henter, G. E., Merritt, T., Wu, Z. & King, S., Mar 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Institute of Electrical and Electronics Engineers (IEEE), p. 5505-5509 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  26. Unsupervised and lightly-supervised learning for rapid construction of TTS systems in multiple languages from `found' data: evaluation and analysis

    Watts, O., Stan, A., Clark, R., Mamiya, Y., Giurgiu, M., Yamagishi, J. & King, S., Aug 2013, 8th ISCA Speech Synthesis Workshop: Barcelona, Spain. ISCA-INST SPEECH COMMUNICATION ASSOC, p. 101-106 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  27. HMM adaptation and voice conversion for the synthesis of child speech: a comparison

    Watts, O., Yamagishi, J., King, S. & Berkling, K., Sep 2009, Interspeech 2009, Brighton UK. p. 2627-2630 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  28. HMM-based synthesis of child speech

    Watts, O., Yamagishi, J., Berkling, K. & King, S., 2008, Proc. of The 1st Workshop on Child, Computer and Interaction (ICMI'08 post-conference workshop).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  29. Synthesis of Child Speech With HMM Adaptation and Voice Conversion

    Watts, O., Yamagishi, J., King, S. & Berkling, K., Jul 2010, In : IEEE Transactions on Audio, Speech and Language Processing. 18, 5, p. 1005-1016 12 p.

    Research output: Contribution to journalArticle

  30. Exemplar-based Speech Waveform Generation

    Watts, O., Valentini Botinhao, C., Espic calderón, F. & King, S., 6 Sep 2018, Interspeech 2018. Hyderabad, India, p. 2022-2026 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  31. Letter-based speech synthesis

    Watts, O., Yamagishi, J. & King, S., Sep 2010, Proc. Speech Synthesis Workshop 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  32. Stochastic Pronunciation Modelling and Soft Match for Out-of-vocabulary Spoken Term Detection

    Wang, D., King, S., Frankel, J. & Bell, P., 1 Mar 2010, Proceedings of the 2010 IEEE International conference on Acoustic Speech and Signal Processing (ICASSP). NEW YORK: Institute of Electrical and Electronics Engineers (IEEE), p. 5294-5297 4 p. (IEEE International Conference on Acoustics Speech and Signal Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  33. CRF-based Stochastic Pronunciation Modelling for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Evans, N. & Troncy, R., 1 Sep 2010, Interspeech 2010: 11th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1668-1671 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  34. Term-Dependent Confidence for Out-of-Vocabulary Term Detection

    Wang, D., King, S., Frankel, J. & Bell, P., 2009, In Proc. Interspeech. p. 2139-2142

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  35. Posterior-based confidence measures for spoken term detection

    Wang, D., Tejedor, J., Frankel, J., King, S. & Col'a, S. J., 2009, ICASSP09.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  36. Direct Posterior Confidence for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Frankel, J., Vipperla, R., Evans, N. & Troncy, R., Aug 2012, In : ACM Transactions on Information Systems. 30, 3, p. - 34 p., 16.

    Research output: Contribution to journalArticle

  37. Handling overlaps in spoken term detection

    Wang, D., Evans, N., Troncy, R. & King, S., 1 May 2011, Proc. International Conference on Acoustics, Speech and Signal Processing. p. 5656-5659 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  38. A Posterior Approach for Microphone Array Based Speech Recognition

    Wang, D., Himawan, I., Frankel, J. & King, S., Sep 2008, Interspeech. p. 996-999 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  39. A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis

    Wang, X., Takaki, S., Yamagishi, J., King, S. & Tokuda, K., 10 Oct 2019, (Accepted/In press) In : IEEE Transactions on Audio, Speech and Language Processing. 13 p.

    Research output: Contribution to journalArticle

  40. Term-dependent Confidence Normalization for Out-of-Vocabulary Spoken Term Detection

    Wang, D., Tejedor, J., King, S. & Frankel, J., Mar 2012, In : Journal of Computer Science and Technology. 27, 2, p. 358-375 17 p.

    Research output: Contribution to journalArticle

  41. Direct Posterior Confidence For Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Evans, N. & Troncy, R., 1 Sep 2010, SSCS '10 Proceedings of the 2010 international workshop on Searching spontaneous conversational speech. ACM, p. 21-26 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  42. Letter-to-Sound Pronunciation Prediction Using Conditional Random Fields

    Wang, D. & King, S., 1 Feb 2011, In : IEEE Signal Processing Letters. 18, 2, p. 122-125 4 p.

    Research output: Contribution to journalArticle

  43. Stochastic pronunciation modelling for spoken term detection

    Wang, D., King, S. & Frankel, J., 2009, Proceedings of Interspeech 2009 Brighton. p. 2135-2138 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  44. A comparison of phone and grapheme-based spoken term detection

    Wang, D., Frankel, J., Tejedor, J. & King, S., Mar 2008, IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. Institute of Electrical and Electronics Engineers (IEEE), p. 4969-4972 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  45. A comparison of grapheme and phoneme-based units for Spanish spoken term detection

    Wang, D., Tejedor, J., Frankel, J., King, S. & Colas, J., Nov 2008, In : Speech Communication. 50, 11-12, p. 980-991 12 p.

    Research output: Contribution to journalArticle

  46. Stochastic Pronunciation Modelling for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S. & Frankel, J., May 2011, In : IEEE Transactions on Audio, Speech and Language Processing. 19, 4, p. 688-698 11 p.

    Research output: Contribution to journalArticle

  47. Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis

    Vepa, J. & King, S., 1 Sep 2006, In : IEEE Transactions on Audio, Speech and Language Processing. 14, 5, p. 1763-1771 9 p.

    Research output: Contribution to journalArticle

  48. Kalman-filter based Join Cost for Unit-selection Speech Synthesis

    Vepa, J. & King, S., 2003, Eurospeech 2003 - Interspeech 2003: 8th European Conference on Speech Communication and Technology. International Speech Communication Association, p. 293-296 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  49. Join Cost for Unit Selection Speech Synthesis

    Vepa, J. & King, S., 2004, Text to Speech Synthesis: New paradigms and advances. Alwan, A. & Narayanan, S. (eds.). Prentice Hall

    Research output: Chapter in Book/Report/Conference proceedingOther chapter contribution

  50. Subjective Evaluation Of Join Cost Functions Used In Unit Selection Speech Synthesis

    Vepa, J. & King, S., 1 Oct 2004, Interspeech 2004 - ICSLP: 8th International Conference on Spoken Language Processing. International Speech Communication Association, p. 1181-1184 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  51. Objective Distance Measures for Spectral Discontinuities in Concatenative Speech Synthesis

    Vepa, J., King, S. & Taylor, P., 1 Sep 2002, ICSLP 2002: 7th International Conference on Spoken Language Processing. International Speech Communication Association, p. 2605-2608 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  52. New Objective Distance Measures for Spectral Discontinuities in Concatenative Speech Synthesis

    Vepa, J., King, S. & Taylor, P., 1 Sep 2002, Proceedings of the 2002 IEEE workshop on speech synthesis. p. 223-226 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  53. Subjective evaluation of join cost and smoothing methods

    Vepa, J. & King, S., 1 Jun 2004, Proc. 5th ISCA speech synthesis workshop. International Speech Communication Association, p. 7-12 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  54. Voice Banking and Voice Reconstruction for MND patients

    Veaux, C., Yamagishi, J. & King, S., 2011, ASSETS 11: Proceedings of the 13th International ACM Sigaccess conference on computers and accessibility. New York: ASSOC COMPUTING MACHINERY, p. 305-306 2 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  55. Towards Personalized Synthesized Voices for Individuals with Vocal Disabilities: Voice Banking and Reconstruction

    Veaux, C., Yamagishi, J. & King, S., 2013, SLPAT 2013, 4th Workshop on Speech and Language Processing for Assistive Technologies. ISCA, p. 107-111 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  56. Using HMM-based speech synthesis to reconstruct the voice of individuals with degenerative speech disorders

    Veaux, C., Yamagishi, J. & King, S., Sep 2012, Proceedings of INTERSPEECH 2012 13th Annual Conference of the International Speech Communication Association. p. 967-970 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  57. The voice bank corpus: Design, collection and data analysis of a large regional accent speech database

    Veaux, C., Yamagishi, J. & King, S., Nov 2013, Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference. Institute of Electrical and Electronics Engineers (IEEE), 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  58. A Comparison of Manual and Automatic Voice Repair for Individual with Vocal Disabilities

    Veaux, C., Yamagishi, J. & King, S., Sep 2015, SLPAT 2015, 6th Workshop on Speech and Language Processing for Assistive Technologies. Association for Computational Linguistics (ACL), p. 130-133 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  59. Using neighbourhood density and selective SNR boosting to increase the intelligibility of synthetic speech in noise

    Valentini-Botinhao, C., Wester, M., Yamagishi, J. & King, S., Aug 2013, 8th ISCA Workshop on Speech Synthesis. p. 133-138 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  60. Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Maia, R., Yamagishi, J., King, S. & Zen, H., 2012, 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP): Kyoto, Japan. NEW YORK: Institute of Electrical and Electronics Engineers (IEEE), p. 3997-4000 4 p. (IEEE International Conference on Acoustics Speech and Signal Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  61. Improving intelligibility in noise of HMM-generated speech via noise-dependent and -independent methods.

    Valentini-Botinhao, C., Godoy, E., Stylianou, Y., Sauert, B., King, S. & Yamagishi, J., May 2013, Proc. ICASSP - Vancouver, Canada.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  62. Intelligibility Enhancement of Speech in Noise

    Valentini-Botinhao, C., Yamagishi, J. & King, S., Sep 2014, Proceedings of the Institute of Acoustics 2014. Vol. 36. 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  63. Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Maia, R., Mar 2014, In : Computer Speech and Language. 28, 2, p. 665-686 22 p.

    Research output: Contribution to journalArticle

  64. Evaluation of objective measures for intelligibility prediction of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Yamagishi, J. & King, S., 1 May 2011, Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. p. 5112-5115 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  65. Using an intelligibility measure to create noise robust cepstral coefficients for HMM-based speech synthesis

    Valentini-Botinhao, C., Yamagishi, J. & King, S., May 2012, Proc. LISTA Workshop: Edinburgh, UK.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  66. Can Objective Measures Predict the Intelligibility of Modified HMM-based Synthetic Speech in Noise?

    Valentini-Botinhao, C., Yamagishi, J. & King, S., 1 Aug 2011, Interspeech 2011: 12th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1837-1840 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  67. Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Stylianou, Y., 1 Aug 2013, Interspeech 2013.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  68. Towards minimum perceptual error training for DNN-based speech synthesis

    Valentini-Botinhao, C., Wu, Z. & King, S., Sep 2015, INTERSPEECH 2015 16th Annual Conference of the International Speech Communication Association. Dresden: International Speech Communication Association, p. 869-873 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  69. Exemplar-based speech waveform generation for text-to-speech

    Valentini Botinhao, C., Watts, O., Espic Calderón, F. & King, S., 14 Feb 2019, 2018 IEEE Workshop on Spoken Language Technology (SLT). Institute of Electrical and Electronics Engineers (IEEE), p. 332-338 7 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  70. The Edinburgh Speech Production Facility Dialogue Corpus

    Turk, A., Scobbie, J., Geng, C., Macmartin, C., King, S. & Renals, S., 2010

    Research output: Non-textual formDigital or Visual Products

  71. An Edinburgh Speech Production Facility

    Turk, A., Scobbie, J., Geng, C., Dickie, C., Bard, E., Hardcastle, W., Hartinger, M., King, S., Lickley, R., Renals, S., Richmond, K., Schaeffler, S., White, K. & Wrench, A., Jul 2010, (Unpublished).

    Research output: Contribution to conferencePoster

  72. The Edinburgh Speech Production Facility's articulatory corpus of spontaneous dialogue.

    Turk, A., Scobbie, J., Geng, C., Macmartin, C., Bard, E., Campbell, B., Dickie, C., Dubourg, E., Hardcastle, B., Hoole, P., Kanaida, E., Lickley, R., Nakai, S., Pouplier, M., King, S., Renals, S., Richmond, K., Schaeffler, S., Wiegand, R., White, K. & 1 othersWrench, A., 2010, In : Journal of the acoustical society of america. 128, 4, p. 2429-2429 1 p.

    Research output: Contribution to journalArticle

  73. Cross-lingual Portability of MLP-Based Tandem Features--A Case Study for English and Hungarian

    Toth, L., Frankel, J., Gosztolya, G. & King, S., 2008, Proc. Interspeech. p. 2695-2698 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  74. A Lattice-based Approach to Automatic Filled Pause Insertion

    Tomalin, M., Wester, M., Dall, R., Byrne, B. & King, S., 10 Aug 2015, Proc. of DiSS 2015, The 7th Workshop on Disfluencies in Spontaneous Speech. Edinburgh, 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  75. Discriminative Tandem Features for HMM-based EEG Classification

    Ting, C-M., King, S., Salleh, S-H. & Ariff, A. K., 1 Jul 2013, Proc. 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 13). IEEE Engineering in Medicine and Biology Society, Vol. 2013. p. 3957-3960

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  76. A Posterior Probability-based System Hybridisation and Combination for Spoken Term Detection

    Tejedor, J., Wang, D., King, S., Frankel, J. & Colas, J., Sep 2009, Interspeech. Citeseer, Vol. 2009. p. 2131-2134 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  77. A novel two-level architecture plus confidence measures for a keyword spotting system.

    Tejedor, J., King, S., Frankel, J., Wang, D., Colas, J. & Garrido, J., Dec 2009, Proceedings of the 5th Biennial Workshop on Speech Technology. p. 247-250 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  78. Augmented set of features for confidence estimation in spoken term detection

    Tejedor, J., Toledano, D. T., Bautista, M., King, S., Wang, D. & Colas, J., 2010, Proc. Interspeech 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  79. Feature analysis for discriminative confidence estimation in spoken term detection

    Tejedor, J., Toledano, D. T., Wang, D., King, S. & Colas, J., Sep 2014, In : Computer Speech and Language. 28, 5, p. 1083–1114 32 p.

    Research output: Contribution to journalArticle

  80. Using Intonation to Constrain Language Models in Speech Recognition

    Taylor, P., King, S., Isard, S., Wright, H. & Kowtko, J., 1997, Proc. Eurospeech'97: 5th European Conference on Speech Communication and Technology . International Speech Communication Association, p. 2763-2766 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  81. Intonation and Dialogue Context as Constraints for Speech Recognition

    Taylor, P., King, S., Isard, S. D. & Wright, H., 1998, In : Language and Speech. 41, 3, p. 493-512 20 p.

    Research output: Contribution to journalArticle

  82. Using Prosodic Information to Constrain Language Models for Spoken dialogue

    Taylor, P., Shimodaira, H., Isard, S., King, S. & Kowtko, J., Oct 1996, Proceedings of the Fourth International Conference on Spoken Language, 1996 (ICSLP `96). Vol. 1. p. 216-219 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  83. Introduction to the Issue on Statistical Parametric Speech Synthesis

    Tao, J., Hirose, K., Tokuda, K., Black, A. W. & King, S., Apr 2014, In : IEEE Journal of Selected Topics in Signal Processing. 8, 2, p. 170-172 3 p.

    Research output: Contribution to journalEditorial

  84. Impact of different speech types on listening effort

    Symantiraki, O., Cooke, M. & King, S., 6 Sep 2018, 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018. Sekhar, CC., Rao, P., Ghosh, PK., Murthy, HA., Yegnanarayana, B., Umesh, S., Alku, P., Prasanna, SRM. & Narayanan, S. (eds.). International Speech Communication Association, p. 2267-2271

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  85. Modelling Prominence and Emphasis Improves Unit-Selection Synthesis

    Strom, V., Nenkova, A., Clark, R., Vazquez-Alvarez, Y., Brenier, J., King, S. & Jurafsky, D., 1 Aug 2007, Interspeech 2007: 8th Annual Conference of the International Speech Communication Association. p. 1282-1285

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  86. Investigating Festival's target cost function using perceptual experiments

    Strom, V. & King, S., 2008, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  87. A Classifier-based target cost for unit selection speech synthesis trained on perceptual data

    Strom, V. & King, S., 2010, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  88. Expressive Prosody for Unit-selection Speech Synthesis

    Strom, V., Clark, R. & King, S., 2006, Interspeech 2006 - ICSLP: 9th International Conference on Spoken Language Processing. International Speech Communication Association, 1522

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  89. The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate

    Stan, A., Yamagishi, J., King, S. & Aylett, M., Mar 2011, In : Speech Communication. 53, 3, p. 442-450 9 p.

    Research output: Contribution to journalArticle

  90. TUNDRA: a multilingual corpus of found data for TTS research created with light supervision

    Stan, A., Watts, O., Mamiya, Y., Giurgiu, M., Clark, R. A. J., Yamagishi, J. & King, S., 2013, INTERSPEECH 2013, 14th Annual Conference of the International Speech Communication Association: Lyon, France, August 25-29, 2013. ISCA-INST SPEECH COMMUNICATION ASSOC, p. 2331-2335 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  91. A grapheme-based method for automatic alignment of speech and text data

    Stan, A., Bell, P. & King, S., 2012, Spoken Language Technology Workshop (SLT), 2012 IEEE. Institute of Electrical and Electronics Engineers (IEEE), p. 286-290 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  92. ALISA: An automatic lightly supervised speech segmentation and alignment tool

    Stan, A., Mamiya, Y., Yamagishi, J., Bell, P., Watts, O., Clark, R. A. J. & King, S., Jan 2016, In : Computer Speech and Language. 35, p. 116-133 18 p.

    Research output: Contribution to journalArticle

  93. Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data

    Stan, A., Bell, P., Yamagishi, J. & King, S., 1 Aug 2013, Proc Interspeech 2013. ISCA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  94. Where are the challenges in speaker diarization?

    Sinclair, M. & King, S., 21 Oct 2013, IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26-31, 2013. Institute of Electrical and Electronics Engineers (IEEE), p. 7741-7745 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  95. Estimating detailed spectral envelopes using articulatory clustering

    Shiga, Y. & King, S., 1 Oct 2004, Interspeech 2004 - ICSLP: 8th International Conference on Spoken Language Processing. International Speech Communication Association, p. 2485-2488 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  96. Estimation of voice source and vocal tract characteristics based on multi-frame analysis

    Shiga, Y. & King, S., 1 Sep 2003, Eurospeech 2003 - Interspeech 2003: 8th European Conference on Speech Communication and Technology. International Speech Communication Association, Vol. 3. p. 1749-1752 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  97. Source-Filter Separation for Articulation-to-Speech Synthesis

    Shiga, Y. & King, S., 1 Oct 2004, Interspeech 2004 - ICSLP: 8th International Conference on Spoken Language Processing. International Speech Communication Association, p. 1913-1916 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  98. Estimating the Spectral Envelope of Voiced Speech Using Multi-frame Analysis

    Shiga, Y. & King, S., 1 Sep 2003, Eurospeech 2003 - Interspeech 2003: 8th European Conference on Speech Communication and Technology. International Speech Communication Association, Vol. 3. p. 1737-1740 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  99. Accurate spectral envelope estimation for articulation-to-speech synthesis

    Shiga, Y. & King, S., 1 Jun 2004, Proc. 5th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 19-24 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

Previous 1 2 3 Next