Capturing the objects of vision with neural networks

Benjamin Peters*, Nikolaus Kriegeskorte*

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

Abstract / Description of output

Human visual perception carves a scene at its physical joints, decomposing the world into objects, which are selectively attended, tracked and predicted as we engage our surroundings. Object representations emancipate perception from the sensory input, enabling us to keep in mind that which is out of sight and to use perceptual content as a basis for action and symbolic cognition. Human behavioural studies have documented how object representations emerge through grouping, amodal completion, proto-objects and object files. By contrast, deep neural network models of visual object recognition remain largely tethered to sensory input, despite achieving human-level performance at labelling objects. Here, we review related work in both fields and examine how these fields can help each other. The cognitive literature provides a starting point for the development of new experimental tasks that reveal mechanisms of human object perception and serve as benchmarks driving the development of deep neural network models that will put the object into object recognition.
Original languageEnglish
Pages (from-to)1127-1144
Number of pages18
JournalNature Human Behaviour
Volume5
DOIs
Publication statusPublished - 20 Sept 2021

Fingerprint

Dive into the research topics of 'Capturing the objects of vision with neural networks'. Together they form a unique fingerprint.

Cite this