Investigation of Maxout Networks for Speech Recognition

P. Swietojanski, J. Li, J-T Huang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We explore the use of maxout neuron in various aspects of acoustic modelling for large vocabulary speech recognition systems; including low-resource scenario and multilingual knowledge transfers. Through the experiments on voice search and short message dictation datasets, we found that maxout networks are around three times faster to train and offer lower or comparable word error rates on several tasks, when compared to the networks with logistic nonlinearity. We also present a detailed study of the maxout unit internal behaviour suggesting the use of different nonlinearities in different layers.
Original languageEnglish
Title of host publicationProceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Pages7649-7653
Number of pages5
DOIs
Publication statusPublished - 2014

Fingerprint

Dive into the research topics of 'Investigation of Maxout Networks for Speech Recognition'. Together they form a unique fingerprint.

Cite this