Skip to main navigation Skip to search Skip to main content

Scan Patterns Predict Sentence Production in the Cross-Modal Processing of Visual Scenes

Moreno Coco, Frank Keller

Research output: Contribution to journalArticlepeer-review

Abstract

Most everyday tasks involve multiple modalities, which raises the question of how the processing of these modalities is coordinated by the cognitive system. In this paper, we focus on the coordination of visual attention and linguistic processing during speaking. Previous research has shown that objects in a visual scene are fixated before they are mentioned, leading us to hypothesize that the scan pattern of a participant can be used to predict what they will say. We test this hypothesis using a data set of cued scene descriptions of photo-realistic scenes. We demonstrate that similar scan patterns are correlated with similar sentences, within and between visual scenes; and that this correlation holds for three phases of the language production process (target identification, sentence planning, and speaking). We also present a simple algorithm that uses scan patterns to accurately predict associated sentences by utilizing similarity-based retrieval.
Keywords: Scan patterns; eye-movements; language production; scene understanding; cross-model processing; similarity measures
Original languageEnglish
Pages (from-to)1204-1223
Number of pages9
JournalCognitive Science: A Multidisciplinary Journal
Volume36
Issue number7
DOIs
Publication statusPublished - Sept 2012

Fingerprint

Dive into the research topics of 'Scan Patterns Predict Sentence Production in the Cross-Modal Processing of Visual Scenes'. Together they form a unique fingerprint.

Cite this