Edinburgh Research Explorer

Prof Simon King

Personal Chair of Speech Processing

  1. Chapter (peer-reviewed) › Research › Not peer-reviewed
  2. Automatic speech recognition

    Renals, S. & King, S., Feb 2010, Handbook of Phonetic Sciences. Hardcastle, W. J., Laver, J. & Gibbon, F. E. (eds.). 2nd ed. Wiley-Blackwell, Vol. 1.

    Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)

  3. Speech Synthesis

    King, S., Sep 2011, Speech and Audio Signal Processing. Gold, B., Morgan, N. & Ellis, D. (eds.). 2nd ed. Wiley, p. 431-454 23 p.

    Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)

  4. Paper › Research › Peer-reviewed
  5. GlottDNN - A full-band glottal vocoder for statistical parametric speech synthesis

    Airaksinen, M., Bollepalli, B., Juvela, L., Wu, Z., King, S. & Alku, P., 8 Sep 2016.

    Research output: Contribution to conferencePaper

  6. The Blizzard Challenge 2009

    King, S. & Karaiskos, V., 2009.

    Research output: Contribution to conferencePaper

  7. Poster › Research › Not peer-reviewed
  8. An Edinburgh Speech Production Facility

    Turk, A., Scobbie, J., Geng, C., Dickie, C., Bard, E., Hardcastle, W., Hartinger, M., King, S., Lickley, R., Renals, S., Richmond, K., Schaeffler, S., White, K. & Wrench, A., Jul 2010, (Unpublished).

    Research output: Contribution to conferencePoster

  9. Article › Research › Peer-reviewed
  10. A Vector Quantized Variational Autoencoder (VQ-VAE) Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis

    Wang, X., Takaki, S., Yamagishi, J., King, S. & Tokuda, K., 28 Oct 2019, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. p. 1-13 13 p.

    Research output: Contribution to journalArticle

  11. A comparison of grapheme and phoneme-based units for Spanish spoken term detection

    Wang, D., Tejedor, J., Frankel, J., King, S. & Colas, J., Nov 2008, In : Speech Communication. 50, 11-12, p. 980-991 12 p.

    Research output: Contribution to journalArticle

  12. A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification

    Poblete, V., Espic, F., King, S., Stem, R. M., Huenupan, F., Fredes, J. & Yoma, N. B., May 2015, In : Computer Speech and Language. 31, 1, p. 1-27 27 p.

    Research output: Contribution to journalArticle

  13. ALISA: An automatic lightly supervised speech segmentation and alignment tool

    Stan, A., Mamiya, Y., Yamagishi, J., Bell, P., Watts, O., Clark, R. A. J. & King, S., Jan 2016, In : Computer Speech and Language. 35, p. 116-133 18 p.

    Research output: Contribution to journalArticle

  14. An introduction to statistical parametric speech synthesis

    King, S., Oct 2011, In : Sadhana-Academy proceedings in engineering sciences. 36, 5, p. 837-852 16 p.

    Research output: Contribution to journalArticle

  15. Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech

    Barra-Chicote, R., Yamagishi, J., King, S., Montero, J. M. & Macias-Guarasa, J., 1 May 2010, In : Speech Communication. 52, 5, p. 394-404 11 p.

    Research output: Contribution to journalArticle

  16. Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping

    Oura, K., Yamagishi, J., Wester, M., King, S. & Tokuda, K., 1 Jul 2012, In : Speech Communication. 54, 6, p. 703-714 12 p.

    Research output: Contribution to journalArticle

  17. Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance

    Wu, Z., De Leon, P., Demiroglu, C., Khodabakhsh, A., King, S., Ling, Z., Saito, D., Stewart, B., Toda, T., Wester, M. & Yamagishi, J., Apr 2016, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 24, 4, p. 768 - 783 17 p.

    Research output: Contribution to journalArticle

  18. Articulatory feature recognition using dynamic Bayesian networks

    Frankel, J., Wester, M. & King, S., 1 Oct 2007, In : Computer Speech and Language. 21, 4, p. 620-640 21 p.

    Research output: Contribution to journalArticle

  19. Bayesian networks for phone duration prediction

    Goubanova, O. & King, S., Apr 2008, In : Speech Communication. 50, 4, p. 301-311 11 p.

    Research output: Contribution to journalArticle

  20. Combining Lightly-supervised Learning and User Feedback to Construct Andimprove a Statistical Parametric Speech Synthesizer for Malay

    Chee Yong, L., Watts, O. & King, S., 15 Dec 2015, In : Research Journal of Applied Sciences, Engineering and Technology. 11, 11, p. 1227-1232 6 p.

    Research output: Contribution to journalArticle

  21. Comunicaci ́on enriquecida a lo largo de la vida

    Cooke, M., King, S., Hazan, V., Stylianou, Y., Janse, E., Baskent, D., Hohmann, V., Winneke, A. & Hernaez, I., 2019, In : Procesamiento del Lenguaje Natural. 63, p. 175-178

    Research output: Contribution to journalArticle

  22. Cross-Lingual Automatic Speech Recognition Using Tandem Features

    Lal, P. & King, S., Dec 2013, In : IEEE Transactions on Audio, Speech and Language Processing. 21, 12, p. 2506-2515 10 p.

    Research output: Contribution to journalArticle

  23. Dependence and independence in automatic speech recognition and synthesis

    King, S., 2003, In : Journal of Phonetics. 31, 3-4, p. 407-411 5 p.

    Research output: Contribution to journalArticle

  24. Detection of Phonological Features in Continuous Speech using Neural Networks

    King, S. & Taylor, P., 2000, In : Computer Speech and Language. 14, 4, p. 333-353 21 p.

    Research output: Contribution to journalArticle

  25. Direct Posterior Confidence for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Frankel, J., Vipperla, R., Evans, N. & Troncy, R., Aug 2012, In : ACM Transactions on Information Systems. 30, 3, p. - 34 p., 16.

    Research output: Contribution to journalArticle

  26. Enhancing electron affinity and tuning band gap in donor-acceptor organic semiconductors by benzothiadiazole directed C-H borylation

    Crossley, D. L., Cade, I. A., Clark, E. R., Escande, A., Humphries, M. J., King, S. M., Vitorica-Yrezabal, I., Ingleson, M. J. & Turner, M. L., 6 Jun 2015, In : Chemical Science. 6, 9, p. 5144-5151 8 p.

    Research output: Contribution to journalArticle

  27. Factoring Gaussian Precision Matrices for Linear Dynamic Models

    Frankel, J. & King, S., 1 Dec 2007, In : Pattern Recognition Letters. 28, 16, p. 2264-2272 9 p.

    Research output: Contribution to journalArticle

  28. Feature analysis for discriminative confidence estimation in spoken term detection

    Tejedor, J., Toledano, D. T., Wang, D., King, S. & Colas, J., Sep 2014, In : Computer Speech and Language. 28, 5, p. 1083–1114 32 p.

    Research output: Contribution to journalArticle

  29. Impacts of machine translation and speech synthesis on speech-to-speech translation

    Hashimoto, K., Yamagishi, J., Byrne, W., King, S. & Tokuda, K., 2012, In : Speech Communication. 54, 7, p. 857-866 10 p.

    Research output: Contribution to journalArticle

  30. Intelligibility enhancement of HMM-generated speech in additive noise by modifying Mel cepstral coefficients to increase the glimpse proportion

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Maia, R., Mar 2014, In : Computer Speech and Language. 28, 2, p. 665-686 22 p.

    Research output: Contribution to journalArticle

  31. Intonation and Dialogue Context as Constraints for Speech Recognition

    Taylor, P., King, S., Isard, S. D. & Wright, H., 1998, In : Language and Speech. 41, 3, p. 493-512 20 p.

    Research output: Contribution to journalArticle

  32. Introduction to the Special Issue on The listening talker: Context-dependent speech production and perception

    Cooke, M., King, S., Kleijn, W. B. & Stylianou, Y., Mar 2014, In : Computer Speech and Language. 28, 2, p. 540-542

    Research output: Contribution to journalArticle

  33. Letter-to-Sound Pronunciation Prediction Using Conditional Random Fields

    Wang, D. & King, S., 1 Feb 2011, In : IEEE Signal Processing Letters. 18, 2, p. 122-125 4 p.

    Research output: Contribution to journalArticle

  34. Listeners' weighting of acoustic cues to synthetic speech naturalness: A multidimensional scaling analysis

    Mayo, C., Clark, R. A. J. & King, S., 1 Mar 2011, In : Speech Communication. 53, 3, p. 311-326 15 p.

    Research output: Contribution to journalArticle

  35. Measuring a decade of progress in Text-to-Speech

    King, S., Jan 2014, In : Loquens. 1, 1, e006.

    Research output: Contribution to journalArticle

  36. Measuring the Gap Between HMM-Based ASR and TTS

    Dines, J., Yamagishi, J. & King, S., 1 Dec 2010, In : IEEE Journal of Selected Topics in Signal Processing. 4, 6, p. 1046-1058 13 p.

    Research output: Contribution to journalArticle

  37. Modelling the Uncertainty in Recovering Articulation from Acoustics

    Richmond, K., King, S. & Taylor, P., Apr 2003, In : Computer Speech and Language. 17, 2-3, p. 153-172 20 p.

    Research output: Contribution to journalArticle

  38. Multisyn: Open-domain unit selection for the Festival speech synthesis system

    Clark, R. A. J., Richmond, K. & King, S., 2007, In : Speech Communication. 49, 4, p. 317-330 14 p.

    Research output: Contribution to journalArticle

  39. Observation Process Adaptation for Linear Dynamic Models

    Frankel, J. & King, S., 1 Sep 2006, In : Speech Communication. 48, 9, p. 1192-1199 8 p.

    Research output: Contribution to journalArticle

  40. Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

    Dines, J., Liang, H., Saheer, L., Gibson, M., Byrne, W., Oura, K., Tokuda, K., Yamagishi, J., King, S., Wester, M., Hirsimäki, T., Karhila, R. & Kurimo, M., Feb 2013, In : Computer Speech and Language. 27, 2, p. 420-437 18 p.

    Research output: Contribution to journalArticle

  41. Post-polymerization C-H Borylation of Donor-Acceptor Materials Gives Highly Efficient Solid State Near-Infrared Emitters for Near-IR-OLEDs and Effective Biological Imaging

    Crossley, D. L., Urbano, L., Neumann, R., Bourke, S., Jones, J., Dailey, L. A., Green, M., Humphries, M. J., King, S. M., Turner, M. L. & Ingleson, M. J., 30 Aug 2017, In : ACS Applied Materials and Interfaces. 9, 34, p. 28243-28249 7 p.

    Research output: Contribution to journalArticle

  42. Recording speech articulation in dialogue: Evaluating a synchronized double electromagnetic articulography setup

    Geng, C., Turk, A., Scobbie, J. M., Macmartin, C., Hoole, P., Richmond, K., Wrench, A., Pouplier, M., Bard, E., Campbell, Z., Dickie, C., Dubourg, E., Hardcastle, W., Kainada, E., King, S., Lickley, R., Nakai, S., Renals, S., White, K. & Wiegand, R., Nov 2013, In : Journal of Phonetics. 41, 6, p. 421-431 11 p.

    Research output: Contribution to journalArticle

  43. Robust Speaker-Adaptive HMM-Based Text-to-Speech Synthesis

    Yamagishi, J., Nose, T., Zen, H., Ling, Z. H., Toda, T., Tokuda, K., King, S. & Renals, S., Aug 2009, In : IEEE Transactions on Audio, Speech and Language Processing. 17, 6, p. 1208-1230 23 p.

    Research output: Contribution to journalArticle

  44. Soft context clustering for F0 modeling in HMM-based speech synthesis

    Khorram, S., Sameti, H. & King, S., 9 Jan 2015, In : EURASIP Journal on Advances in Signal Processing. 2015, 1

    Research output: Contribution to journalArticle

  45. Speech Recognition using Linear Dynamic Models

    Frankel, J. & King, S., 1 Jan 2007, In : IEEE Transactions on Audio, Speech and Language Processing. 15, 1, p. 246-256 11 p.

    Research output: Contribution to journalArticle

  46. Speech production knowledge in automatic speech recognition

    King, S., Frankel, J., Livescu, K., McDermott, E., Richmond, K. & Wester, M., 1 Feb 2007, In : Journal of the acoustical society of america. 121, 2, p. 723-742 20 p.

    Research output: Contribution to journalArticle

  47. Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction

    Yamagishi, J., Veaux, C., King, S. & Renals, S., 2012, In : Acoustical Science and Technology. 33, 1, p. 1-5 5 p.

    Research output: Contribution to journalArticle

  48. Statistical parametric speech synthesis for Ibibio

    Ekpenyong, M., Urua, E-A., Watts, O., King, S. & Yamagishi, J., Jan 2014, In : Speech Communication. 56, p. 243-251 9 p.

    Research output: Contribution to journalArticle

  49. Stochastic Pronunciation Modelling for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S. & Frankel, J., May 2011, In : IEEE Transactions on Audio, Speech and Language Processing. 19, 4, p. 688-698 11 p.

    Research output: Contribution to journalArticle

  50. Subjective Evaluation of Join Cost and Smoothing Methods for Unit Selection Speech Synthesis

    Vepa, J. & King, S., 1 Sep 2006, In : IEEE Transactions on Audio, Speech and Language Processing. 14, 5, p. 1763-1771 9 p.

    Research output: Contribution to journalArticle

  51. Synthesis of Child Speech With HMM Adaptation and Voice Conversion

    Watts, O., Yamagishi, J., King, S. & Berkling, K., Jul 2010, In : IEEE Transactions on Audio, Speech and Language Processing. 18, 5, p. 1005-1016 12 p.

    Research output: Contribution to journalArticle

  52. Term-dependent Confidence Normalization for Out-of-Vocabulary Spoken Term Detection

    Wang, D., Tejedor, J., King, S. & Frankel, J., Mar 2012, In : Journal of Computer Science and Technology. 27, 2, p. 358-375 17 p.

    Research output: Contribution to journalArticle

  53. The Edinburgh Speech Production Facility's articulatory corpus of spontaneous dialogue.

    Turk, A., Scobbie, J., Geng, C., Macmartin, C., Bard, E., Campbell, B., Dickie, C., Dubourg, E., Hardcastle, B., Hoole, P., Kanaida, E., Lickley, R., Nakai, S., Pouplier, M., King, S., Renals, S., Richmond, K., Schaeffler, S., Wiegand, R., White, K. & 1 others, Wrench, A., 2010, In : Journal of the acoustical society of america. 128, 4, p. 2429-2429 1 p.

    Research output: Contribution to journalArticle

  54. The Romanian speech synthesis (RSS) corpus: Building a high quality HMM-based speech synthesis system using a high sampling rate

    Stan, A., Yamagishi, J., King, S. & Aylett, M., Mar 2011, In : Speech Communication. 53, 3, p. 442-450 9 p.

    Research output: Contribution to journalArticle

  55. Thousands of Voices for HMM-Based Speech Synthesis-Analysis and Application of TTS Systems Built on Various ASR Corpora

    Yamagishi, J., Usabaev, B., King, S., Watts, O., Dines, J., Tian, J., Guan, Y., Hu, R., Oura, K., Wu, Y-J., Tokuda, K., Karhila, R. & Kurimo, M., Jul 2010, In : IEEE Transactions on Audio, Speech and Language Processing. 18, 5, p. 984-1004 21 p.

    Research output: Contribution to journalArticle

  56. Using eigenvoices and nearest-neighbours in HMM-based cross-lingual speaker adaptation with limited data

    Sarfjoo, S. S., Demiroglu, C. & King, S., Apr 2017, In : IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25, 4, p. 839-851 13 p.

    Research output: Contribution to journalArticle

  57. Literature review › Research › Peer-reviewed
  58. The listening talker: A review of human and algorithmic context-induced modifications of speech

    Cooke, M., King, S., Garnier, M. & Aubanel, V., Mar 2014, In : Computer Speech and Language. 28, 2, p. 543-571 29 p.

    Research output: Contribution to journalLiterature review

  59. Editorial › Research › Peer-reviewed
  60. Introduction to the Issue on Statistical Parametric Speech Synthesis

    Tao, J., Hirose, K., Tokuda, K., Black, A. W. & King, S., Apr 2014, In : IEEE Journal of Selected Topics in Signal Processing. 8, 2, p. 170-172 3 p.

    Research output: Contribution to journalEditorial

  61. Digital or Visual Products › Research
  62. The Edinburgh Speech Production Facility Dialogue Corpus

    Turk, A., Scobbie, J., Geng, C., Macmartin, C., King, S. & Renals, S., 2010

    Research output: Non-textual formDigital or Visual Products

  63. Working paper › Research
  64. Final report for Verbmobil Teilprojekt 4.4

    King, S., 1 Jan 1997.

    Research output: Working paper

  65. Inventory design for Verbmobil Teilprojekt 4.4

    King, S., 1 Oct 1996.

    Research output: Working paper

  66. Book › Research
  67. Users Manual for Verbmobil Teilprojekt 4.4

    King, S., 1 Oct 1996, Rheinische Friedrich-Wilhelms-Universität Bonn.

    Research output: Book/ReportBook

  68. Chapter › Research
  69. Speech Synthesis

    King, S., Ellis, D. & Morgan, N., 2011, Speech and Audio Signal Processing. Gold, B., Morgan, N. & Ellis, D. (eds.). 2nd ed. Wiley, p. 431-454 24 p.

    Research output: Chapter in Book/Report/Conference proceedingChapter

  70. Conference contribution › Research
  71. A Classifier-based target cost for unit selection speech synthesis trained on perceptual data

    Strom, V. & King, S., 2010, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  72. A Comparison of Manual and Automatic Voice Repair for Individual with Vocal Disabilities

    Veaux, C., Yamagishi, J. & King, S., Sep 2015, SLPAT 2015, 6th Workshop on Speech and Language Processing for Assistive Technologies. Association for Computational Linguistics (ACL), p. 130-133 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  73. A Hierarchical Encoder-Decoder Model for Statistical Parametric Speech Synthesis

    Ronanki, S., Watts, O. & King, S., 24 Aug 2017, Proceedings Interspeech 2017. p. 1133-1137 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  74. A Hybrid ANN/DBN Approach to Articulatory Feature Recognition

    Frankel, J. & King, S., 1 Sep 2005, Interspeech 2005 - Eurospeech: 9th European Conference on Speech Communication and Technology . International Speech Communication Association, p. 3045-3048 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  75. A Lattice-based Approach to Automatic Filled Pause Insertion

    Tomalin, M., Wester, M., Dall, R., Byrne, B. & King, S., 10 Aug 2015, Proc. of DiSS 2015, The 7th Workshop on Disfluencies in Spontaneous Speech. Edinburgh, 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  76. A Posterior Approach for Microphone Array Based Speech Recognition

    Wang, D., Himawan, I., Frankel, J. & King, S., Sep 2008, Interspeech. p. 996-999 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  77. A Posterior Probability-based System Hybridisation and Combination for Spoken Term Detection

    Tejedor, J., Wang, D., King, S., Frankel, J. & Colas, J., Sep 2009, Interspeech. Citeseer, Vol. 2009. p. 2131-2134 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  78. A Shrinkage Estimator for Speech Recognition with Full Covariance HMMs

    Bell, P. & King, S., Sep 2008, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  79. A comparison of phone and grapheme-based spoken term detection

    Wang, D., Frankel, J., Tejedor, J. & King, S., Mar 2008, IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. Institute of Electrical and Electronics Engineers (IEEE), p. 4969-4972 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  80. A grapheme-based method for automatic alignment of speech and text data

    Stan, A., Bell, P. & King, S., 2012, Spoken Language Technology Workshop (SLT), 2012 IEEE. Institute of Electrical and Electronics Engineers (IEEE), p. 286-290 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  81. A novel two-level architecture plus confidence measures for a keyword spotting system.

    Tejedor, J., King, S., Frankel, J., Wang, D., Colas, J. & Garrido, J., Dec 2009, Proceedings of the 5th Biennial Workshop on Speech Technology. p. 247-250 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  82. A reading list of recent advances in speech synthesis

    King, S., 10 Aug 2015, Proc. 18th International Congress of Phonetic Sciences (ICPhS). T. S. C. F. ICP. . (ed.). Glasgow, UK: University of Glasgow

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  83. A study of speaker adaptation for DNN-based speech synthesis

    Wu, Z., Swietojanski, P., Veaux, C., Renals, S. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  84. A tutorial on HMM speech synthesis (Invited paper)

    King, S., 2010, Sadhana -- Academy Proceedings in Engineering Sciences, Indian Institute of Sciences.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  85. ASR - Articulatory Speech Recognition

    Frankel, J. & King, S., 1 Sep 2001, Eurospeech 2001: 7th European Conference on Speech Communication and Technology . International Speech Communication Association, p. 599-602 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  86. Accurate spectral envelope estimation for articulation-to-speech synthesis

    Shiga, Y. & King, S., 1 Jun 2004, Proc. 5th ISCA Speech Synthesis Workshop. International Speech Communication Association, p. 19-24 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  87. An Accent-Independent Lexicon for Automatic Speech Recognition

    Bael, C. V. & King, S., 2003, Proc. ICPhS. p. 1165-1168 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  88. An analysis of machine translation and speech synthesis in speech-to-speech translation system

    Hashimoto, K., Yamagishi, J., Byrne, W., King, S. & Tokuda, K., May 2011, Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. p. 5108-5111 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  89. An articulatory feature-based tandem approach and factored observation modeling

    Cetin, O., Kantor, A., King, S., Bartels, C., Magimai-Doss, M., Frankel, J. & Livescu, K., 1 Apr 2007, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2007 (ICASSP 2007). Vol. 4. p. 645-648 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  90. An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces

    Frankel, J., Richmond, K., King, S. & Taylor, P., Oct 2000, Sixth International Conference on Spoken Language Processing (ICSLP 2000). International Speech Communication Association, p. 254-257 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  91. Analysis of Speaker Clustering Strategies for HMM-Based Speech Synthesis

    Dall, R., Veaux, C., Yamagishi, J. & King, S., 1 Sep 2012, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  92. Analysis of unsupervised and noise-robust speaker-adaptive HMM-based speech synthesis systems toward a unified ASR and TTS framework

    Yamagishi, J., Lincoln, M., King, S., Dines, J., Gibson, M., Tian, J. & Guan, Y., Sep 2009, Interspeech 2009 Edinburgh..

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  93. Announcing the Electromagnetic Articulography (Day 1) Subset of the mngu0 Articulatory Corpus

    Richmond, K., Hoole, P. & King, S., 1 Aug 2011, Interspeech 2011: 12th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1505-1508 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  94. Articulatory Feature Classifiers Trained on 2000 hours of Telephone Speech

    Frankel, J., Magimai-Doss, M., King, S., Livescu, K. & Çetin, Ã., 1 Aug 2007, Interspeech 2007: 8th Annual Conference of the International Speech Communication Association. p. 2485-2488 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  95. Articulatory feature recognition using dynamic Bayesian networks

    Frankel, J., Wester, M. & King, S., 1 Sep 2004, Interspeech 2004 - ICSLP: 8th International Conference on Spoken Language Processing. International Speech Communication Association, p. 1477-1480 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  96. Articulatory feature-based methods for acoustic and audio-visual speech recognition: Summary from the 2006 JHU Summer Workshop

    Livescu, K., C‡etin, O., Hasegawa-Johnson, M., King, S., Bartels, C., Borges, N., Kantor, A., Lal, P., Yung, L., Bezman Dawson-Haggerty, S., Woods, B., Frankel, J., Magimai-Doss, M. & Saenko, K., 1 Apr 2007, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2007 (ICASSP 2007). Vol. 4. p. 621-621 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  97. Asynchronous Articulatory Feature Recognition Using Dynamic Bayesian Networks

    Wester, M., Frankel, J. & King, S., 1 Dec 2004, Proc. IEICI Beyond HMM Workshop.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  98. Attentive filtering networks for audio replay attack detection

    Lai, C-I., Abad, A., Richmond, K., Yamagishi, J., Dehak, N. & King, S., 17 Apr 2019, 2019 IEEE International Conference on Acoustics, Speech and Signal Processing. p. 6316-6320

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  99. Augmented set of features for confidence estimation in spoken term detection

    Tejedor, J., Toledano, D. T., Bautista, M., King, S., Wang, D. & Colas, J., 2010, Proc. Interspeech 2010.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  100. CRF-based Stochastic Pronunciation Modelling for Out-of-Vocabulary Spoken Term Detection

    Wang, D., King, S., Evans, N. & Troncy, R., 1 Sep 2010, Interspeech 2010: 11th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1668-1671 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  101. Can Objective Measures Predict the Intelligibility of Modified HMM-based Synthetic Speech in Noise?

    Valentini-Botinhao, C., Yamagishi, J. & King, S., 1 Aug 2011, Interspeech 2011: 12th Annual Conference of the International Speech Communication Association. International Speech Communication Association, p. 1837-1840 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  102. Cepstral analysis based on the Glimpse proportion measure for improving the intelligibility of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Maia, R., Yamagishi, J., King, S. & Zen, H., 2012, 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP): Kyoto, Japan. NEW YORK: Institute of Electrical and Electronics Engineers (IEEE), p. 3997-4000 4 p. (IEEE International Conference on Acoustics Speech and Signal Processing).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  103. Combining a Vector Space Representation of Linguistic Context with a Deep Neural Network for Text-To-Speech Synthesis

    Lu, H., King, S. & Watts, O., 1 Aug 2013, 8th ISCA Speech Synthesis Workshop. p. 261-265 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  104. Combining in-domain and out-of-domain speech data for automatic recognition of disordered speech

    Christensen, H., Aniol, M., Bell, P., Green, P., Hain, T., King, S. & Swietojanski, P., 1 Aug 2013, Proc. Interspeech 2013. ISCA

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  105. Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise

    Valentini-Botinhao, C., Yamagishi, J., King, S. & Stylianou, Y., 1 Aug 2013, Interspeech 2013.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  106. Covariance Updates for Discriminative Training by Constrained Line Search

    Bell, P. & King, S., Sep 2008, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  107. Cross-lingual Portability of MLP-Based Tandem Features--A Case Study for English and Hungarian

    Toth, L., Frankel, J., Gosztolya, G. & King, S., 2008, Proc. Interspeech. p. 2695-2698 4 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  108. DNN-based Speech Synthesis for Indian Languages from ASCII text

    Ronanki, S., Gangireddy, S. R., Bollepalli, B. & King, S., 15 Sep 2016, 9th ISCA Speech Synthesis Workshop. p. 74-79 6 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  109. Deep neural network context embeddings for model selection in rich-context HMM synthesis

    Merritt, T., Yamagishi, J., Wu, Z., Watts, O. & King, S., 6 Sep 2015, Proceedings of Interspeech 2015. Dresden: International Speech Communication Association

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  110. Deep neural networks employing multi-task learning and stacked bottleneck features for speech synthesis.

    Wu, Z., Valentini-Botinhao, C., Watts, O. & King, S., 1 Apr 2015, Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. Brisbane, Australia, p. 4460-4464 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  111. Detecting Acronyms from Capital Letter Sequences in Spanish

    San-Segundo, R., Montero, J. M., Lopez-Luden, V. & King, S., 1 Sep 2012, Proc. Interspeech.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

Previous 1 2 3 Next