Invariant object recognition is maybe the most basic and fundamental property of our visual system. It is the basis of many other cognitive tasks, like motor actions and social interactions. Hence, the theoretical understanding and modeling of invariant object recognition is one of the central problems in computational neuroscience.
Indeed, object recognition consists of two different tasks: classification and identification.
The focus of this thesis is on object identification under the basic geometrical
transformations shift, scaling, and rotation. The visual system can
perform shift, size, and rotation invariant object identification.
This thesis consists of two parts. In the first part, we present and investigate the VisNet model proposed by Rolls. The generalization problems of VisNet triggered our development of a new neural network model for invariant object identification. Starting point for an improved generalization behavior is the search for an operation that extracts images features that are invariant under shifts, rotations, and scalings. Extracting invariant features guarantees that an object seen once in a specific pose can be identified in any pose.
We present and investigate our model in the second part of this thesis.
Identifer | oai:union.ndltd.org:DRESDEN/oai:qucosa:de:qucosa:11065 |
Date | 28 October 2010 |
Creators | Wilhelm, Hedwig |
Contributors | Jost, Juergen, Wiskott, Laurenz, Universität Leipzig |
Source Sets | Hochschulschriftenserver (HSSS) der SLUB Dresden |
Language | English |
Detected Language | English |
Type | doc-type:doctoralThesis, info:eu-repo/semantics/doctoralThesis, doc-type:Text |
Rights | info:eu-repo/semantics/openAccess |
Page generated in 0.0023 seconds