Projects per year
Abstract
The usual attention mechanisms used for encoder-decoder models do not constrain the relationship between input and output sequences to be monotonic. To address this we explore windowed attention mechanisms which restrict attention to a block of source hidden states. Rule-based windowing restricts attention to a (typically large) fixed-length window. The performance of such methods is poor if the window size is small. In this paper, we propose a fully-trainable windowed attention and provide a detailed analysis on the factors which affect the performance of such an attention mechanism. Compared to the rule-based window methods, the learned window size is significantly smaller yet the model's performance is competitive. On the TIMIT corpus this approach has resulted in a 17% (relative) performance improvement over the traditional attention model. Our model also yields comparable accuracies to the joint CTC-attention model on the Wall Street Journal corpus.
Original language | English |
---|---|
Title of host publication | ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
Place of Publication | Brighton, United Kingdom |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Pages | 7100-7104 |
Number of pages | 5 |
ISBN (Electronic) | 978-1-4799-8131-1 |
ISBN (Print) | 978-1-4799-8132-8 |
DOIs | |
Publication status | E-pub ahead of print - 17 Apr 2019 |
Event | 44th International Conference on Acoustics, Speech, and Signal Processing: Signal Processing: Empowering Science and Technology for Humankind - Brighton , United Kingdom Duration: 12 May 2019 → 17 May 2019 Conference number: 44 https://2019.ieeeicassp.org/ |
Publication series
Name | |
---|---|
Publisher | IEEE |
ISSN (Print) | 1520-6149 |
ISSN (Electronic) | 2379-190X |
Conference
Conference | 44th International Conference on Acoustics, Speech, and Signal Processing |
---|---|
Abbreviated title | ICASSP 2019 |
Country/Territory | United Kingdom |
City | Brighton |
Period | 12/05/19 → 17/05/19 |
Internet address |
Keywords
- End-to-end
- Speech recognition
- Attention
Fingerprint
Dive into the research topics of 'Windowed Attention Mechanisms for Speech Recognition'. Together they form a unique fingerprint.Projects
- 2 Finished
-
-
Distant speech recognition of overlapping speech
UK industry, commerce and public corporations
1/10/17 → 30/09/21
Project: Research