Return to search

Visual features for scene recognition and reorientation

Thesis (Ph. D. in Cognitive Science)--Massachusetts Institute of Technology, Dept. of Brain and Cognitive Sciences, 2013. / Cataloged from PDF version of thesis. / Includes bibliographical references (p. 75-78). / In this thesis, I investigate how scenes are represented by the human visual system and how observers use visual information to reorient themselves within a space. Scenes, like objects, are three-dimensional spaces that are experienced through twodimensional views and must be recognized from many different angles. Just as people show a preference for canonical views of objects, which best show the object's surfaces and shape, people also show a preference for canonical views of scenes, which show as much of the surrounding scene layout as possible. Unlike objects, scenes are spaces which envelope the observer and thus a large portion of scene processing must take place in peripheral vision. People are able to perform many scene perception tasks, such as determining whether a scene contains an animal, quickly and easily in peripheral vision. This is somewhat surprising because many perceptual tasks with simpler stimuli, such as spotting a randomly-rotated T among randomly-rotated Ls, are not easily performed in the periphery and seem to require focal attention. However, a statistical summary model of peripheral vision, which assumes that the visual system sees a crowded, texture-like representation of the world in the periphery, predicts human performance on scene perception tasks, as well as predicting performance on peripheral tasks with letter stimuli. This peripheral visual representation of a scene may actually be critical for an observer to understand the spatial geometry of their environment. People's ability to reorient by the shape of an environment is impaired when they explore the space with central vision alone, but not when they explore the space with only peripheral vision. This result suggests that peripheral vision is well-designed for navigation: the representation in peripheral vision is compressed, but this compression preserves the scene layout information that is needed for understanding the three-dimensional geometry of a space. / by Krista Anne Ehinger. / Ph.D.in Cognitive Science

Identiferoai:union.ndltd.org:MIT/oai:dspace.mit.edu:1721.1/81578
Date January 2013
CreatorsEhinger, Krista Anne
ContributorsRuth Rosenholtz., Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences., Massachusetts Institute of Technology. Department of Brain and Cognitive Sciences.
PublisherMassachusetts Institute of Technology
Source SetsM.I.T. Theses and Dissertation
LanguageEnglish
Detected LanguageEnglish
TypeThesis
Format78 p., application/pdf
RightsM.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission., http://dspace.mit.edu/handle/1721.1/7582

Page generated in 0.002 seconds