[Editors] MIT reports new insights in visual recognition
MIT News Office
newsoffice at MIT.EDU
Thu Apr 1 14:26:31 EST 2004
MIT News Office
Massachusetts Institute of Technology
Room 11-400
77 Massachusetts Avenue
Cambridge, MA 02139-4307
Phone: 617-253-2700
http://web.mit.edu/newsoffice/www
======================================
MIT team reports new insights in visual recognition
======================================
For Immediate Release
THURSDAY, APR. 1, 2004
Contact: Elizabeth A. Thomson, MIT News Office
Phone: 617-258-5402
Email: thomson at mit.edu
--PHOTOS AVAILABLE--
CAMBRIDGE, Mass.--MIT scientists are reporting new insights into how
the human brain recognizes objects, especially faces, in work that
could lead to improved machine vision systems, diagnostics for certain
neurological conditions and more.
Look at a photo of people running a marathon. The lead runners' faces
are quite distinct, but we can also make out the faces of those farther
in the distance.
Zoom in on that distant runner, however, "and you'll see that there's
very little intrinsic face-related information, such as eyes and a
nose. It's just a diffuse blob. Yet somehow we can classify that blob
as a face," said Pawan Sinha, an assistant professor in the Department
of Brain and Cognitive Sciences (BCS). In contrast, performing this
task reliably is beyond even the most advanced computer-recognition
systems.
In the April 2 issue of Science, Sinha and colleagues show that a
specific brain region known to be activated by clear images of faces is
also strongly activated by very blurred images, just so long as
surrounding contextual cues (such as a body) are present. "In other
words, the neural circuitry in the human brain can use context to
compensate for extreme levels of image degradations," Sinha said.
Past studies of human behavior and the work of many artists have
suggested that context plays a role in recognition. "What is novel
about this work is that it provides direct evidence of contextual cues
eliciting object-specific neural responses in the brain," Sinha said.
The team used functional magnetic resonance imaging to map neuronal
responses of the brain's fusiform face area (FFA) to a variety of
images. These included clear faces, blurred faces attached to bodies,
blurred faces alone, bodies alone, and a blurred face placed in the
wrong context (below the torso, for example).
Only the clear faces and blurred faces with proper contextual cues
elicited strong FFA responses. "These data support the idea that facial
representations underlying FFA activity are based not only on intrinsic
facial cues, but rather incorporate contextual information as well,"
wrote BCS graduate student David Cox, BCS technical assistant Ethan
Meyers, and Sinha.
"One of the reasons that reports of such contextual influences on
object-specific neural responses have been lacking in the literature so
far is that researchers have tended to 'simplify' images by presenting
objects in isolation. Using such images precludes consideration of
contextual influences," Cox said.
The findings not only add to scientists' understanding of the brain and
vision, but also "open up some very interesting issues from the
perspective of developmental neuroscience," Sinha said. For example,
how does the brain acquire the ability to use contextual cues? Are we
born with this ability, or is it learned over time? Sinha is exploring
these questions through Project Prakash, a scientific and humanitarian
effort to look at how individuals who are born blind but later gain
some vision perceive objects and faces.
POTENTIAL APPLICATIONS
Computer recognition systems work reasonably well when images are
clear, but they break down catastrophically when images are degraded.
"A human's ability is so far beyond what the computer can do," Meyers
emphasized. "The new work could aid the development of better systems
by changing our concept of the kind of image information useful for
determining what an object is."
There could also be clinical applications. For example, said Sinha,
"contextually evoked neural activity, or the lack of it, could
potentially be used as an early diagnostic marker for specific
neurological conditions like autism, which are believed to be
correlated with impairments in information integration. We hope to
address such questions as part of the Brain Development and Disorders
Project, a collaboration between MIT and Children’s Hospital" in
Boston.
This work was supported by the Athinoula A. Martinos Center for
Biomedical Imaging, the National Center for Research Resources and the
Mental Illness and Neuroscience Discovery (MIND) Institute, as well as
the National Defense Science and Engineering Graduate Fellowship, the
Alfred P. Sloan Foundation and the John Merck Scholars Award.
--END--
More information about the Editors
mailing list