 |
| Volume 2, Number 1, Introduction i, Page i |
doi:10.1167/2.1.i |
http://journalofvision.org/2/1/i/ |
ISSN 1534-7362 |
Special Issue Introduction
Classification images: A tool to analyze visual strategies
Miguel P. Eckstein |
Department of Psychology, University of California, Santa Barbara, CA, USA |
|
Albert J. Ahumada, Jr. |
NASA Ames Research Center, Moffett Field, CA, USA |
|
Introduction
Every year experimental techniques appear or resurface
that provide tools and theoretical frameworks with which to study the mechanisms
and processes involved in human visual perception. Occasionally, one of these
techniques will resonate and trigger a wave of excitement among a group of
researchers in the field, and will lead to a large number of studies using the
method. This was so in the case of the technique known as
classification
images which was first presented at the
European Conference on Visual Perception
( Ahumada, 1996).
The technique has still earlier roots in auditory work
( Ahumada & Lovell, 1971), wherein
multiple regression analysis was applied to the problem of auditory tone
detection in noise, to estimate the contribution of auditory stimulus features
to the observer’s decision variable. The regression weights plotted as a
function of feature temporal frequency could be called classification plots. The
central concept of the technique is the correlation of observer decisions with
noisy stimulus features over sets of stimuli. From the correlation of the
features with the decisions and the inter-correlations among the features, the
investigator can then estimate how the observer is weighting the stimulus
features to reach a decision.
In the 1980´s and 1990´s a related technique,
reverse correlation, was used to
estimate the features of receptive fields of the visual cortex (e.g.,
Jones & Palmer, 1987;
Ohzawa, DeAngelis, & Freeman, 1996;
Ringach, Hawken, & Shapley 1997).
Ahumada and Beard applied the technique in visual
psychophysics to study vernier acuity tasks
( Ahumada, 1996;
Beard & Ahumada, 1998). They used image
pixel intensity as the stimulus features, so that the stimulus feature weights
did form an actual classification image. The study revealed that human
observers weighted the visual information of the stimuli differently from the
optimum (ideal) observer. However, perhaps more importantly, it provided a first
glance at the technique for other scientists, and sparked their imagination
about the many questions that might be addressed using the technique. Could it
be used to study stereo, illusory contours, letter discrimination and perceptual
learning? Could it be used with tasks other than the yes/no task? What would
be the best (most efficient) algorithm to estimate the classification
images?
Why do classification images appeal to vision
scientists? Is it the visual nature of the classification image, or is it its
exploratory nature that resembles a sort of archeology of perceptual processes?
For whatever reason, soon after 1996 a number of researchers started using the
technique to study a variety of problems in human visual perception including
letter discrimination
( Watson & Rosenholtz, 1997),
perceptual learning
( Knoblauch, Thomas, & D'Zmura, 1999),
illusory contours
( Gold, Murray, Bennett, & Sekuler, 2000),
foveal versus peripheral vernier performance
( Beard & Ahumada, 1999), stereo
( Neri, Parker, & Blakemore, 1999), and
off-frequency looking in non-white noise
( Abbey & Eckstein, 2000).
On the other hand, some researchers have watched with
skepticism from the sidelines the development and use of the classification
image technique. Can one really know whether an empirical classification image
is meaningful or simply noise? Can the experimenter´s interpretation of the
classification image be biased so that he/she simply sees confirmatory evidence
of a hypothesis? How can one reach any meaningful conclusions when comparing
two classification images? What are the underlying assumptions of the technique
and how meaningful is the technique when the assumptions are violated? All of
these questions should have clear answers for a mature scientific method, and
should oblige investigators to use scientific rigor in the use of the technique.
These questions have indeed motivated investigators to develop sound procedures
to estimate the classification image and to test different hypotheses about the
obtained classification images (e.g. Is the classification image statistically
significantly different from noise? Is it significantly different from the
classification image of the ideal observer?)
The papers in this special issue of the
Journal of Vision describe methods to
estimate classification images and to test hypotheses about them. The papers
also provide examples of the wide variety of topics and questions in visual
perception that can be addressed using the technique. As guest editors for this
special issue, we hope it will provide a useful starting point for researchers
interested in embarking in projects using this novel and exciting
technique.
References
Abbey, C. K., & Eckstein,
M. P. (2000). Estimates of human-observer templates for simple detection tasks
in correlated noise. Proceedings of the SPIE,
3981, 70-77.
Ahumada, A. J., Jr. (1996).
Perceptual classification images from Vernier acuity masked by noise [Abstract].
Perception, 26,
18.[ Link]
Ahumada, A. J., Jr., &
Lovell, J. (1971). Stimulus features in signal detection.
Journal of the Acoustical Society of America,
49, 1751-1756.
Beard, B. L., & Ahumada,
A. J., Jr. (1998). A technique to extract relevant image features for visual
tasks. SPIE Proceedings, 3299,
79-85.
Beard, B. L., & Ahumada,
A. J., Jr. (1999). Detection in fixed and random noise in foveal and parafoveal
vision explained by template learning. Journal
of the Optical Society of America A, 16(3), 755-763.
[ PubMed]
Gold, J. M., Murray, R. F.,
Bennett, P. J., & Sekuler, A. B. (2000). Deriving behavioural receptive
fields for visually completed contours.
Current Biology, 10,
663-666..[ PubMed]
Jones, J. P., & Palmer, L.
A. (1987). The two-dimensional spatial structure of simple receptive fields in
cat striate cortex. Journal of
Neurophysiology, 58, 1187-1211.
[ PubMed]
Knoblauch, K., Thomas, J.
P., & D'Zmura, M. (1999). Feedback temporal frequency and stimulus
classification [Abstract]. Investigative
Ophthalmology and Visual Science, 40(4), 4171.
Neri, P., Parker, A. J., &
Blakemore, C. (1999). Probing the human stereoscopic system with reverse
correlation. Nature, 401,
695-698.[ PubMed]
Ohzawa, I., DeAngelis, G. C.,
& Freeman, R. D. (1996). Encoding of binocular disparity by simple cells in
the cat's visual cortex. J Neurophysiol,
75(5), 1779-1805.
[ PubMed]
Ringach,
D. L., Hawken, M. J., & Shapley, R. (1997). Dynamics of orientation tuning
in macaque primary visual cortex. Nature,
387(6630), 281-284.
[ PubMed]
Watson, A. B., & Rosenholtz, R. (1997). A
Rorschach test for visual classification strategies [Abstract].
Investigative Ophthalmology & Visual
Science, 38(4), 2.
|
|