TY - JOUR
T1 - Capturing the objects of vision with neural networks
AU - Peters, Benjamin
AU - Kriegeskorte, Nikolaus
PY - 2021/9/20
Y1 - 2021/9/20
N2 - Human visual perception carves a scene at its physical joints, decomposing the world into objects, which are selectively attended, tracked and predicted as we engage our surroundings. Object representations emancipate perception from the sensory input, enabling us to keep in mind that which is out of sight and to use perceptual content as a basis for action and symbolic cognition. Human behavioural studies have documented how object representations emerge through grouping, amodal completion, proto-objects and object files. By contrast, deep neural network models of visual object recognition remain largely tethered to sensory input, despite achieving human-level performance at labelling objects. Here, we review related work in both fields and examine how these fields can help each other. The cognitive literature provides a starting point for the development of new experimental tasks that reveal mechanisms of human object perception and serve as benchmarks driving the development of deep neural network models that will put the object into object recognition.
AB - Human visual perception carves a scene at its physical joints, decomposing the world into objects, which are selectively attended, tracked and predicted as we engage our surroundings. Object representations emancipate perception from the sensory input, enabling us to keep in mind that which is out of sight and to use perceptual content as a basis for action and symbolic cognition. Human behavioural studies have documented how object representations emerge through grouping, amodal completion, proto-objects and object files. By contrast, deep neural network models of visual object recognition remain largely tethered to sensory input, despite achieving human-level performance at labelling objects. Here, we review related work in both fields and examine how these fields can help each other. The cognitive literature provides a starting point for the development of new experimental tasks that reveal mechanisms of human object perception and serve as benchmarks driving the development of deep neural network models that will put the object into object recognition.
UR - http://www.scopus.com/inward/record.url?scp=85115265810&partnerID=8YFLogxK
U2 - 10.1038/s41562-021-01194-6
DO - 10.1038/s41562-021-01194-6
M3 - Review article
C2 - 34545237
AN - SCOPUS:85115265810
SN - 2397-3374
VL - 5
SP - 1127
EP - 1144
JO - Nature Human Behaviour
JF - Nature Human Behaviour
ER -