Volume 7, Number 9, Abstract 798, Page 798a doi:10.1167/7.9.798 http://journalofvision.org/7/9/798/ ISSN 1534-7362
Unsupervised learning of higher order statistics of visual features: evidence for relational encoding
Elan Barenholtz
Brown University Dept. of Cognitive and Linguistic Sciences
[e-mail]
Michael J. Tarr
Brown University Dept. of Cognitive and Linguistic Sciences
Abstract

A number of important theories of visual recognition assume that objects are represented on the basis of parts and their explicitly defined relations (e.g. Biederman, 1987). Much of the evidence pertaining to the encoding of relations between features (as opposed to encoding of the features themselves) comes from so-called ‘configural effects’, such as the advantage in recognizing one part of a face when other parts are present (Tanaka & Sengko, 1997). However, many of these findings might be explained by invoking ‘larger’ features that incorporate multiple smaller features, e.g. using a single ‘eye-and-nose’ feature, rather than separate 'eye' and 'nose' features in a particular spatial relationship. The current research aims to disassociate the roles of features and their relations: subjects viewed patterns composed of multiple, distinct polygonal shapes in which the spatial relations between spatially non-contiguous features were controlled, so that specific pairs of features (‘base-pairs) always appeared together, and in the same spatial relation to one another, across multiple patterns. Across three experiments, we tested whether subjects had learned the statistics of the patterns in terms of the joint and conditional probability of the positions of base-pair features. Our results showed that subjects could learn the statistical properties of non-contiguous features while discounting the properties of features located between them. These results are inconsistent with a plausible 'larger feature' hypothesis, which would necessarily include the spatially intermediate features, and provide direct support for the explicit encoding of relations between features in unsupervised learning.
References:
Biederman, I. (1987). Recognition by components: A theory of human image understanding. Psychological Review, 94, 115-147.
Tanaka, J. W., & Sengco, J. A. (1997). Features and their configuration in face recognition. Memory & Cognition, 25, 583-592.
Both authors funded by NGA Award #HM1582-04-C-0051.

History
Received April 27, 2007; published June 30, 2007
Citation
Barenholtz, E., & Tarr, M. J. (2007). Unsupervised learning of higher order statistics of visual features: evidence for relational encoding [Abstract]. Journal of Vision, 7(9):798, 798a, http://journalofvision.org/7/9/798/, doi:10.1167/7.9.798.
Keywords
None
On-Line Presentation
None
for related articles by these authors

for papers that cite this paper
Get citation
Get help with this






jov