View-dependent object recognition by monkeys

N. K. Logothetis, J. Pauls, Heinrich Bulthoff, T. Poggio

Research output: Contribution to journalArticle

218 Citations (Scopus)

Abstract

Background: How do we recognize visually perceived three-dimensional objects, particularly when they are seen from novel view-points? Recent psychophysical studies have suggested that the human visual system may store a relatively small number of two-dimensional views of a three-dimensional object, recognizing novel views of the object by interpolation between the stored sample views. In order to investigate the neural mechanisms underlying this process, physiological experiments are required and, as a prelude to such experiments, we have been interested to know whether the observations made with human observers extend to monkeys. Results: We trained monkeys to recognize computer-generated images of objects presented from an arbitrarily chosen training view and containing sufficient three-dimensional information to specify the object's structure. We subsequently tested the trained monkeys' ability to generalize recognition of the object to views generated by rotation of the target object around any arbitrary axis. The monkeys recognized as the target only those two-dimensional views that were close to the familiar, training view. Recognition became increasingly difficult for the monkeys as the stimulus was rotated away from the experienced viewpoint, and failed for views farther than about 40° from the training view. This suggests that, in the early stages of learning to recognize a previously unfamiliar object, the monkeys build two-dimensional, viewer-centered object representations, rather than a three-dimensional model of the object. When the animals were trained with as few as three views of the object, 120° apart, they could often recognize all the views of the object resulting from rotations around the same axis. Conclusion: Our experiments show that recognition of three-dimensional novel objects is a function of the object's retinal projection. This suggests that non-human primates, like humans, may accomplish view-invariant recognition of familiar objects by a viewer-centered system that interpolates between a small number of stored views. The measures of recognition performance can be simulated by a regularization network that stores a few familiar views, and is endowed with the ability to interpolate between these views. Our results provide the basis for physiological studies of object-recognition by monkeys and suggest that the insights gained from such studies should apply also to humans.

Original languageEnglish
Pages (from-to)401-414
Number of pages14
JournalCurrent Biology
Volume4
Issue number5
Publication statusPublished - 1994 Dec 1
Externally publishedYes

Fingerprint

Object recognition
Haplorhini
monkeys
Experiments
Aptitude
Interpolation
Animals
Physiological Phenomena
Recognition (Psychology)
Primates
learning
Learning
animals

ASJC Scopus subject areas

  • Agricultural and Biological Sciences(all)

Cite this

Logothetis, N. K., Pauls, J., Bulthoff, H., & Poggio, T. (1994). View-dependent object recognition by monkeys. Current Biology, 4(5), 401-414.

View-dependent object recognition by monkeys. / Logothetis, N. K.; Pauls, J.; Bulthoff, Heinrich; Poggio, T.

In: Current Biology, Vol. 4, No. 5, 01.12.1994, p. 401-414.

Research output: Contribution to journalArticle

Logothetis, NK, Pauls, J, Bulthoff, H & Poggio, T 1994, 'View-dependent object recognition by monkeys', Current Biology, vol. 4, no. 5, pp. 401-414.
Logothetis NK, Pauls J, Bulthoff H, Poggio T. View-dependent object recognition by monkeys. Current Biology. 1994 Dec 1;4(5):401-414.
Logothetis, N. K. ; Pauls, J. ; Bulthoff, Heinrich ; Poggio, T. / View-dependent object recognition by monkeys. In: Current Biology. 1994 ; Vol. 4, No. 5. pp. 401-414.
@article{17bf90da293f493aad9b9a4d92fcf129,
title = "View-dependent object recognition by monkeys",
abstract = "Background: How do we recognize visually perceived three-dimensional objects, particularly when they are seen from novel view-points? Recent psychophysical studies have suggested that the human visual system may store a relatively small number of two-dimensional views of a three-dimensional object, recognizing novel views of the object by interpolation between the stored sample views. In order to investigate the neural mechanisms underlying this process, physiological experiments are required and, as a prelude to such experiments, we have been interested to know whether the observations made with human observers extend to monkeys. Results: We trained monkeys to recognize computer-generated images of objects presented from an arbitrarily chosen training view and containing sufficient three-dimensional information to specify the object's structure. We subsequently tested the trained monkeys' ability to generalize recognition of the object to views generated by rotation of the target object around any arbitrary axis. The monkeys recognized as the target only those two-dimensional views that were close to the familiar, training view. Recognition became increasingly difficult for the monkeys as the stimulus was rotated away from the experienced viewpoint, and failed for views farther than about 40° from the training view. This suggests that, in the early stages of learning to recognize a previously unfamiliar object, the monkeys build two-dimensional, viewer-centered object representations, rather than a three-dimensional model of the object. When the animals were trained with as few as three views of the object, 120° apart, they could often recognize all the views of the object resulting from rotations around the same axis. Conclusion: Our experiments show that recognition of three-dimensional novel objects is a function of the object's retinal projection. This suggests that non-human primates, like humans, may accomplish view-invariant recognition of familiar objects by a viewer-centered system that interpolates between a small number of stored views. The measures of recognition performance can be simulated by a regularization network that stores a few familiar views, and is endowed with the ability to interpolate between these views. Our results provide the basis for physiological studies of object-recognition by monkeys and suggest that the insights gained from such studies should apply also to humans.",
author = "Logothetis, {N. K.} and J. Pauls and Heinrich Bulthoff and T. Poggio",
year = "1994",
month = "12",
day = "1",
language = "English",
volume = "4",
pages = "401--414",
journal = "Current Biology",
issn = "0960-9822",
publisher = "Cell Press",
number = "5",

}

TY - JOUR

T1 - View-dependent object recognition by monkeys

AU - Logothetis, N. K.

AU - Pauls, J.

AU - Bulthoff, Heinrich

AU - Poggio, T.

PY - 1994/12/1

Y1 - 1994/12/1

N2 - Background: How do we recognize visually perceived three-dimensional objects, particularly when they are seen from novel view-points? Recent psychophysical studies have suggested that the human visual system may store a relatively small number of two-dimensional views of a three-dimensional object, recognizing novel views of the object by interpolation between the stored sample views. In order to investigate the neural mechanisms underlying this process, physiological experiments are required and, as a prelude to such experiments, we have been interested to know whether the observations made with human observers extend to monkeys. Results: We trained monkeys to recognize computer-generated images of objects presented from an arbitrarily chosen training view and containing sufficient three-dimensional information to specify the object's structure. We subsequently tested the trained monkeys' ability to generalize recognition of the object to views generated by rotation of the target object around any arbitrary axis. The monkeys recognized as the target only those two-dimensional views that were close to the familiar, training view. Recognition became increasingly difficult for the monkeys as the stimulus was rotated away from the experienced viewpoint, and failed for views farther than about 40° from the training view. This suggests that, in the early stages of learning to recognize a previously unfamiliar object, the monkeys build two-dimensional, viewer-centered object representations, rather than a three-dimensional model of the object. When the animals were trained with as few as three views of the object, 120° apart, they could often recognize all the views of the object resulting from rotations around the same axis. Conclusion: Our experiments show that recognition of three-dimensional novel objects is a function of the object's retinal projection. This suggests that non-human primates, like humans, may accomplish view-invariant recognition of familiar objects by a viewer-centered system that interpolates between a small number of stored views. The measures of recognition performance can be simulated by a regularization network that stores a few familiar views, and is endowed with the ability to interpolate between these views. Our results provide the basis for physiological studies of object-recognition by monkeys and suggest that the insights gained from such studies should apply also to humans.

AB - Background: How do we recognize visually perceived three-dimensional objects, particularly when they are seen from novel view-points? Recent psychophysical studies have suggested that the human visual system may store a relatively small number of two-dimensional views of a three-dimensional object, recognizing novel views of the object by interpolation between the stored sample views. In order to investigate the neural mechanisms underlying this process, physiological experiments are required and, as a prelude to such experiments, we have been interested to know whether the observations made with human observers extend to monkeys. Results: We trained monkeys to recognize computer-generated images of objects presented from an arbitrarily chosen training view and containing sufficient three-dimensional information to specify the object's structure. We subsequently tested the trained monkeys' ability to generalize recognition of the object to views generated by rotation of the target object around any arbitrary axis. The monkeys recognized as the target only those two-dimensional views that were close to the familiar, training view. Recognition became increasingly difficult for the monkeys as the stimulus was rotated away from the experienced viewpoint, and failed for views farther than about 40° from the training view. This suggests that, in the early stages of learning to recognize a previously unfamiliar object, the monkeys build two-dimensional, viewer-centered object representations, rather than a three-dimensional model of the object. When the animals were trained with as few as three views of the object, 120° apart, they could often recognize all the views of the object resulting from rotations around the same axis. Conclusion: Our experiments show that recognition of three-dimensional novel objects is a function of the object's retinal projection. This suggests that non-human primates, like humans, may accomplish view-invariant recognition of familiar objects by a viewer-centered system that interpolates between a small number of stored views. The measures of recognition performance can be simulated by a regularization network that stores a few familiar views, and is endowed with the ability to interpolate between these views. Our results provide the basis for physiological studies of object-recognition by monkeys and suggest that the insights gained from such studies should apply also to humans.

UR - http://www.scopus.com/inward/record.url?scp=0028429937&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0028429937&partnerID=8YFLogxK

M3 - Article

C2 - 7922354

AN - SCOPUS:0028429937

VL - 4

SP - 401

EP - 414

JO - Current Biology

JF - Current Biology

SN - 0960-9822

IS - 5

ER -