A Survey of Affect Recognition Methods: Audio, Visual, and Spontaneous Expressions
2.5K
TL;DR: In this paper, the authors discuss human emotion perception from a psychological perspective, examine available approaches to solving the problem of machine understanding of human affective behavior, and discuss important issues like the collection and availability of training and test data.
read more
Abstract: Automated analysis of human affective behavior has attracted increasing attention from researchers in psychology, computer science, linguistics, neuroscience, and related disciplines. However, the existing methods typically handle only deliberately displayed and exaggerated expressions of prototypical emotions despite the fact that deliberate behaviour differs in visual appearance, audio profile, and timing from spontaneously occurring behaviour. To address this problem, efforts to develop algorithms that can process naturally occurring human affective behaviour have recently emerged. Moreover, an increasing number of efforts are reported toward multimodal fusion for human affect analysis including audiovisual fusion, linguistic and paralinguistic fusion, and multi-cue visual fusion based on facial expressions, head movements, and body gestures. This paper introduces and surveys these recent advances. We first discuss human emotion perception from a psychological perspective. Next we examine available approaches to solving the problem of machine understanding of human affective behavior, and discuss important issues like the collection and availability of training and test data. We finally outline some of the scientific and engineering challenges to advancing human affect sensing technology.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Short Papers Spatiotemporal Directional Number Transitional Graph for Dynamic Texture Recognition
Oksam Chae
- 01 Jan 2015
TL;DR: In this article, a spatiotemporal directional number transitional graph (DNG) descriptor was proposed to capture the direction of natural flow in the temporal domain, and the transition of such directions between frames.
51
A three-component framework for empathic technologies to augment human interaction
TL;DR: A three component framework is presented based on psychology and neuroscience, consisting of cognitive empathy, emotional convergence, and empathic responding, which can be situated in affective computing and social signal processing and pose different opportunities for empathic technologies.
Face Expression Recognition by Cross Modal Data Association
Ashish Tawari,Mohan M. Trivedi +1 more
TL;DR: The framework can improve the recognition performance while significantly reducing the computational cost by avoiding redundant or insignificant frame processing by incorporating auditory information and design a single good image representation of image sequence by weighted sums of registered face images where the weights are derived using auditory features.
51
An efficient unconstrained facial expression recognition algorithm based on Stack Binarized Auto-encoders and Binarized Neural Networks
Wenyun Sun,Haitao Zhao,Zhong Jin +2 more
TL;DR: BAEs and Stacked Binarized Auto-encoders are proposed to learn a kind of domain knowledge from a large-scale unlabeled facial dataset to improve the performance of the BNNs on Static Facial Expressions in the Wild benchmark.
50
Automatic facial expression recognition based on spatiotemporal descriptors
Yi Ji,Khalid Idrissi +1 more
TL;DR: Two new textons are proposed, VTB and moments on spatiotemporal plane, to describe the transformation of human face during facial expressions, which aim at catching both general shape changes and motion texture details.
50
References
Development and validation of brief measures of positive and negative affect: The PANAS scales.
TL;DR: Two 10-item mood scales that comprise the Positive and Negative Affect Schedule (PANAS) are developed and are shown to be highly internally consistent, largely uncorrelated, and stable at appropriate levels over a 2-month time period.
•Book
Affective Computing
Rosalind W. Picard
- 01 Jan 1997
TL;DR: Key issues in affective computing, " computing that relates to, arises from, or influences emotions", are presented and new applications are presented for computer-assisted learning, perceptual information retrieval, arts and entertainment, and human health and interaction.
5.7K
Combining Pattern Classifiers
Ludmila I. Kuncheva
- 02 Jul 2004
TL;DR: This combining pattern classifiers methods and algorithms helps people to enjoy a good book with a cup of coffee in the afternoon, instead they cope with some harmful virus inside their computer.
3K
Comprehensive database for facial expression analysis
Takeo Kanade,Jeffrey F. Cohn,Yingli Tian +2 more
- 26 Mar 2000
TL;DR: The problem space for facial expression analysis is described, which includes level of description, transitions among expressions, eliciting conditions, reliability and validity of training and test data, individual differences in subjects, head orientation and scene complexity image characteristics, and relation to non-verbal behavior.