Learning Robust Objective Functions with Application to Face Model Fitting (bibtex)
by M Wimmer, S Pietzsch, F Stulp and B Radig
Abstract:
Model-based image interpretation extracts high-level information from images using a priori knowledge about the object of interest. The computational challenge is to determine the model parameters that best match a given image by searching for the global optimum of the involved objective function. Unfortunately, this function is usually designed manually, based on implicit and domain-dependent knowledge, which prevents the fitting task from yielding accurate results. In this paper, we demonstrate how to improve model fitting by learning objective functions from annotated training images. Our approach automates many critical decisions and the remaining manual steps hardly require domain-dependent knowledge. This yields more robust objective functions that are able to achieve the accurate model fit. Our evaluation uses a publicly available image database and compares the obtained results to a recent state-of-the-art approach.
Reference:
Learning Robust Objective Functions with Application to Face Model Fitting (M Wimmer, S Pietzsch, F Stulp and B Radig), In Proceedings of the 29th DAGM Symposium, volume 1, 2007. 
Bibtex Entry:
@inproceedings{wimmer_learning_2007,
 author = {M Wimmer and S Pietzsch and F Stulp and B Radig},
 title = {Learning Robust Objective Functions with Application to Face Model
	Fitting},
 booktitle = {Proceedings of the 29th {DAGM} Symposium},
 year = {2007},
 volume = {1},
 pages = {486--496},
 address = {Heidelberg, Germany},
 month = {sep},
 abstract = {Model-based image interpretation extracts high-level information from
	images using a priori knowledge about the object of interest. The
	computational challenge is to determine the model parameters that
	best match a given image by searching for the global optimum of the
	involved objective function. Unfortunately, this function is usually
	designed manually, based on implicit and domain-dependent knowledge,
	which prevents the fitting task from yielding accurate results. In
	this paper, we demonstrate how to improve model fitting by learning
	objective functions from annotated training images. Our approach
	automates many critical decisions and the remaining manual steps
	hardly require domain-dependent knowledge. This yields more robust
	objective functions that are able to achieve the accurate model fit.
	Our evaluation uses a publicly available image database and compares
	the obtained results to a recent state-of-the-art approach.},
 keywords = {facial expressions},
}
Powered by bibtexbrowser
Learning Robust Objective Functions with Application to Face Model Fitting (bibtex)
Learning Robust Objective Functions with Application to Face Model Fitting (bibtex)
by M Wimmer, S Pietzsch, F Stulp and B Radig
Abstract:
Model-based image interpretation extracts high-level information from images using a priori knowledge about the object of interest. The computational challenge is to determine the model parameters that best match a given image by searching for the global optimum of the involved objective function. Unfortunately, this function is usually designed manually, based on implicit and domain-dependent knowledge, which prevents the fitting task from yielding accurate results. In this paper, we demonstrate how to improve model fitting by learning objective functions from annotated training images. Our approach automates many critical decisions and the remaining manual steps hardly require domain-dependent knowledge. This yields more robust objective functions that are able to achieve the accurate model fit. Our evaluation uses a publicly available image database and compares the obtained results to a recent state-of-the-art approach.
Reference:
Learning Robust Objective Functions with Application to Face Model Fitting (M Wimmer, S Pietzsch, F Stulp and B Radig), In Proceedings of the 29th DAGM Symposium, volume 1, 2007. 
Bibtex Entry:
@inproceedings{wimmer_learning_2007,
 author = {M Wimmer and S Pietzsch and F Stulp and B Radig},
 title = {Learning Robust Objective Functions with Application to Face Model
	Fitting},
 booktitle = {Proceedings of the 29th {DAGM} Symposium},
 year = {2007},
 volume = {1},
 pages = {486--496},
 address = {Heidelberg, Germany},
 month = {sep},
 abstract = {Model-based image interpretation extracts high-level information from
	images using a priori knowledge about the object of interest. The
	computational challenge is to determine the model parameters that
	best match a given image by searching for the global optimum of the
	involved objective function. Unfortunately, this function is usually
	designed manually, based on implicit and domain-dependent knowledge,
	which prevents the fitting task from yielding accurate results. In
	this paper, we demonstrate how to improve model fitting by learning
	objective functions from annotated training images. Our approach
	automates many critical decisions and the remaining manual steps
	hardly require domain-dependent knowledge. This yields more robust
	objective functions that are able to achieve the accurate model fit.
	Our evaluation uses a publicly available image database and compares
	the obtained results to a recent state-of-the-art approach.},
 keywords = {facial expressions},
}
Powered by bibtexbrowser
projects:facial_expressions

Analysis of Facial Expressions

As robots emerge from their classical domain - factories - to be included in every day life, they need to gain new abilities besides those needed in manufacturing. They need not only to support humans, but also be able to socialize with their users to enhance the interaction experience and allow for social bonding. Recent progress in the field of Computer Vision allows intuitive interaction via gesture or facial expressions between humans and technical systems. Recent research aims at enabling machines to utilize communication channels natural to human beings, such as gesture or facial expressions. Humans interpret emotion from video and audio information and heavily rely on this information during every-day communication. Therefore, knowledge about human behavior, intention, and emotion is necessary to construct convenient human-machine interaction mechanisms. The human face provides much of the information that is passed between humans in every-day communication. Although most of this information is passed on a subconscious level, we still rely on the interaction partner's facial expression to determine emotional state or attention to form a prediction of his or her reaction.

Project details

This project aims at determining facial expressions from camera images in real-time. Model-based image interpretation techniques have proven to be a successful method for extracting such high-level information from single images and image sequences. We rely on a model-based technique to determine the exact location of facial components such as eyes or eye brows in the image. Geometric models form an abstraction of real-world objects and contain knowledge about their properties, such as position, shape or texture. This representation of the image content facilitates and accelerates the subsequent interpretation task. In order to extract high-level information, model parameters have to be estimated that best describe the face within a given image. However, correctly estimated model parameters forms the basis of various more applications such as gaze detection or gender estimation.

Our demonstrator for facial expression recognition has been presented at several events with political audience and on TV. The face is detected and a 3D face model is fitted in real-time to extract the facial expression currently visible. We integrate the publicly available Candide-III face model and also rely on publicly available databases to train and evaluate classifiers for facial expression recognition. This contributes to the comparability of our approach with other research groups. Ekman and Friesen find six universal facial expressions that are expressed and interpreted independent from the cultural background, age or country of origin all over the world. The Facial Action Coding System (FACS) precisely describes the muscle activity within a human face that appear during the display of facial expressions. The Candide-III face model integrates the FACS-system in its model parameters.

Evidence suggests that feeling empathy for others is connected to the mirror neuron system and that emotional empathy, which is triggered by deriving the emotional state from facial expressions involves neural activity in the thalamus and cortical areas responsible of the face. Perception and display of facial expression form a closed loop in human-human communication, where the perception of the interaction partner's facial expression has influence on the display of the own facial expression. To research this also on the human-machine interface, we integrate our demonstrator in the Multi-Joint Action Scenario in the CoTeSys Central Robotics Lab. It is combined with the robot head EDDIE, provided by the Institute of Automatic Control Engineering, to form a closed-loop human-machine interaction scenario based on facial expression analysis and synthesis. In its current, preliminary state, the facial expression is merely mirrored, but future plans involve integrating a more complex emotional model on the robotic side.