Grapheme and multilingual posterior features for under-resourced speech recognition: a study on Scottish Gaelic

Ramya Rasipuram, Peter Bell, Mathew Magimai-Doss

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Standard automatic speech recognition (ASR) systems use phonemes as subword units. Thus, one of the primary resources required to build a good ASR system is a well developed phoneme pronunciation lexicon. However, under-resourced languages typically lack such lexical resources. In this paper, we investigate recently proposed grapheme-based ASR in the framework of Kullback-Leibler divergence based hidden Markov model (KL-HMM) for under-resourced languages, particularly Scottish Gaelic which has no lexical resources. More specifically, we study the use of grapheme and multilingual phoneme class conditional probabilities (posterior features) as feature observations in the KL-HMM. ASR studies conducted show that the proposed approach yields better system compared to the conventional HMM/GMM approach using cepstral features. Furthermore, grapheme posterior features estimated using both auxiliary data and Gaelic data yield the best system.
Original languageEnglish
Title of host publicationAcoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
PublisherInstitute of Electrical and Electronics Engineers
Pages7334-7338
Number of pages5
ISBN (Print)978-1-4799-0356-6
DOIs
Publication statusPublished - 1 May 2013

Fingerprint

Dive into the research topics of 'Grapheme and multilingual posterior features for under-resourced speech recognition: a study on Scottish Gaelic'. Together they form a unique fingerprint.
  • Natural Speech Technology

    Renals, S. (Principal Investigator) & King, S. (Co-investigator)

    EPSRC

    1/05/1131/07/16

    Project: Research

Cite this