Object recognition in the visual cortex is based on a hierarchical architecture, in which specialized brain regions along the ventral pathway extract object features of increasing levels of complexity, accompanied by greater invariance in stimulus size, position, and orientation. Recent theoretical studies postulate a non-linear pooling function, such as the maximum (MAX) operation could be fundamental in achieving such invariance. In this paper, we are concerned with neurally plausible mechanisms that may be involved in realizing the MAX operation. Four canonical circuits are proposed, each based on neural mechanisms that have been previously discussed in the context of cortical processing. Through simulations and mathematical analysis, we examine the relative performance and robustness of these mechanisms. We derive experimentally verifiable predictions for each circuit and discuss their respective physiological considerations.
Identifer | oai:union.ndltd.org:MIT/oai:dspace.mit.edu:1721.1/7240 |
Date | 01 September 2001 |
Creators | Yu, Angela J., Giese, Martin A., Poggio, Tomaso A. |
Source Sets | M.I.T. Theses and Dissertation |
Language | en_US |
Detected Language | English |
Format | 28 p., 2197042 bytes, 930880 bytes, application/postscript, application/pdf |
Relation | AIM-2001-022, CBCL-207 |
Page generated in 0.0033 seconds