Abstract
Theories of visual object recognition must solve the problem of recognizing 3D objects given that perceivers only receive 2D patterns of light on their retinae. Recent findings from human psychophysics, neurophysiology and machine vision provide converging evidence for 'image-based' models in which objects are represented as collections of viewpoint-specific local features. This approach is contrasted with 'structural-description' models in which objects are represented as configurations of 3D volumes or parts. We then review recent behavioral results that address the biological plausibility of both approaches, as well as some of their computational advantages and limitations. We conclude that, although the image-based approach holds great promise, it has potential pitfalls that may be best overcome by including structural information. Thus, the most viable model of object recognition may be one that incorporates the most appealing aspects of both image-based and structural-description theories.
Original language | English |
---|---|
Pages (from-to) | 1-20 |
Number of pages | 20 |
Journal | Cognition |
Volume | 67 |
Issue number | 1-2 |
DOIs | |
Publication status | Published - 1998 Jul 17 |
Keywords
- Image-based model
- Object recognition
- Structural description
ASJC Scopus subject areas
- Language and Linguistics
- Experimental and Cognitive Psychology
- Developmental and Educational Psychology
- Linguistics and Language
- Cognitive Neuroscience