Resource data
Towards Man-Machine Interfaces: Combining Top-down Constraints with Bottom-up Learning in Facial Analysis
Kumar, Vinay P.
Location:
AITR-2002-008
CBCL-221
http://hdl.handle.net/1721.1/5569
This thesis proposes a methodology for the design of man-machine interfaces by combining top-down and bottom-up processes in vision. From a computational perspective, we propose that the scientific-cognitive question of combining top-down and bottom-up knowledge is similar to the engineering question of labeling a training set in a supervised learning problem. We investigate these questions in the realm of facial analysis. We propose the use of a linear morphable model (LMM) for representing top-down structure and use it to model various facial variations such as mouth shapes and expression, the pose of faces and visual speech (visemes). We apply a supervised learning method based on support vector machine (SVM) regression for estimating the parameters of LMMs directly from pixel-based representations of faces. We combine these methods for designing new, more self-contained systems for recognizing facial expressions, estimating facial pose and for recognizing visemes.
Belongs to: DSpace at MIT
Descargar SCORM
¡Sea el primero en solicitar este recurso!
Para poder solicitar este recurso debe identificarse como usuario de la biblioteca
Users rating
No hay ninguna valoración para este recurso. Sea el primero en
valorar este recurso.
Detalles del recurso
|
Towards Man-Machine Interfaces: Combining Top-down Constraints with Bottom-up Learning in Facial Analysis
|
| Id. |
25090 |
| Idioma |
inglés (Estados Unidos)
|
| Titulo |
Towards Man-Machine Interfaces: Combining Top-down Constraints with Bottom-up Learning in Facial Analysis |
| Autor(es) |
Kumar, Vinay P. |
| Location |
AITR-2002-008
CBCL-221
http://hdl.handle.net/1721.1/5569
|
| Versión |
1.0 |
| Estado |
Final
|
| Descripción |
This thesis proposes a methodology for the design of man-machine interfaces by combining top-down and bottom-up processes in vision. From a computational perspective, we propose that the scientific-cognitive question of combining top-down and bottom-up knowledge is similar to the engineering question of labeling a training set in a supervised learning problem. We investigate these questions in the realm of facial analysis. We propose the use of a linear morphable model (LMM) for representing top-down structure and use it to model various facial variations such as mouth shapes and expression, the pose of faces and visual speech (visemes). We apply a supervised learning method based on support vector machine (SVM) regression for estimating the parameters of LMMs directly from pixel-based representations of faces. We combine these methods for designing new, more self-contained systems for recognizing facial expressions, estimating facial pose and for recognizing visemes. |
| Tipo |
68 p. 21293042 bytes 2473001 bytes application/postscript application/pdf |
| Palabras clave |
AI |
| Tipo de Interactividad |
Expositivo
|
| Nivel de Interactividad |
muy bajo
|
| Audiencia |
Estudiante
Profesor
Autor
|
| Estructura |
Atomic |
| Coste |
no
|
| Copyright |
sí
|
| Formatos |
68 p. 21293042 bytes 2473001 bytes application/postscript application/pdf |
| Requerimientos técnicos |
Browser: Any |
| Relación |
[References] AITR-2002-008
[References] CBCL-221
|
| Fecha de contribución |
07-may-2008 |
| Contacto |
|
|