Projects per year
Abstract
The cognitive mechanisms needed to account for the English past tense have long been a subject of debate in linguistics and cognitive science. Neural network models were proposed early on, but were shown to have clear flaws. Recently, however, Kirov and Cotterell (2018) showed that modern encoder-decoder (ED) models overcome many of these flaws. They also presented evidence that ED models demonstrate humanlike performance in a nonce-word task. Here, we look more closely at the behaviour of their model in this task. We find that (1) the model exhibits instability across multiple simulations in terms of its correlation with human data, and (2) even when results are aggregated across simulations (treating each simulation as an individual human participant), the fit to the human data is not strong—worse than an older rule-based model. These findings hold up through several alternative training regimes and evaluation measures. Although other neural architectures might do better, we conclude that there is still insufficient evidence to claim that neural nets are a good cognitive model for this task.
Original language | English |
---|---|
Title of host publication | Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics |
Editors | Anna Korhonen, David Traum, Lluís Màrquez |
Place of Publication | Florence, Italy |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 3868–3877 |
Number of pages | 10 |
Publication status | Published - 2 Aug 2019 |
Event | 57th Annual Meeting of the Association for Computational Linguistics - Fortezza da Basso, Florence, Italy Duration: 28 Jul 2019 → 2 Aug 2019 Conference number: 57 http://www.acl2019.org/EN/index.xhtml |
Conference
Conference | 57th Annual Meeting of the Association for Computational Linguistics |
---|---|
Abbreviated title | ACL 2019 |
Country/Territory | Italy |
City | Florence |
Period | 28/07/19 → 2/08/19 |
Internet address |
Fingerprint
Dive into the research topics of 'Are we there yet? Encoder-decoder neural networks as cognitive models of English past tense inflection'. Together they form a unique fingerprint.Projects
- 1 Finished