Return to search

Towards trainable man-machine interfaces : combining top-down constraints with bottom-up learning in facial analysis / Towards man-machine interfaces : combining top-down constraints with bottom-up learning

Thesis (Ph.D. in Computational Cognitive Science)--Massachusetts Institute of Technology, Dept. of Brain and Cognitive Sciences, 2002. / Includes bibliographical references (leaves 72-[77]). / This thesis proposes a miethodology for the design of man-machine interfaces by combining top-down and bottom-up processes in vision. From a computational perspective, we propose that the scientific-cognitive question of combining top-down and bottom-up knowledge is similar to the engineering question of labeling a training set in a supervised learning problem. We investigate these questions in the realm of facial analysis. We propose the use of a linear morphable model (LMM) for representing top-down structure and use it to model various facial variations such as mouth shapes and expression, the pose of faces and visual speech (visemes). We apply a supervised learning method based on support vector machine (SVM) regression for estimating the parameters of LMMs directly from pixel-based representations of faces. We combine these methods for designing new, more self-contained systems for recognizing facial expressions, estimating facial pose and for recognizing visemes. / by Vinay P. Kumar. / Ph.D.in Computational Cognitive Science

Identiferoai:union.ndltd.org:MIT/oai:dspace.mit.edu:1721.1/29243
Date January 2002
CreatorsKumar, Vinay P. (Vinay Prasanna), 1972-
ContributorsTomaso Poggio., Massachusetts Institute of Technology. Dept. of Brain and Cognitive Sciences., Massachusetts Institute of Technology. Dept. of Brain and Cognitive Sciences.
PublisherMassachusetts Institute of Technology
Source SetsM.I.T. Theses and Dissertation
LanguageEnglish
Detected LanguageEnglish
TypeThesis
Format72, [5] leaves, 3717108 bytes, 3716915 bytes, application/pdf, application/pdf, application/pdf
RightsM.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission., http://dspace.mit.edu/handle/1721.1/7582

Page generated in 0.0024 seconds