Journal Article10.48550/arXiv.2203.14466
Multi-model Ensemble Learning Method for Human Expression Recognition
7
TL;DR: The multi-fold ensemble method is introduced to train and ensemble several models with the same architecture but different data distributions to enhance the performance of the solution based on the ensemble learning method.
read more
Abstract: Analysis of human affect plays a vital role in human-computer interaction (HCI) systems. Due to the difficulty in capturing large amounts of real-life data, most of the current methods have mainly focused on controlled environments, which limit their application scenarios. To tackle this problem, we propose our solution based on the ensemble learning method. Specifically, we formulate the problem as a classification task, and then train several expression classification models with different types of backbones--ResNet, EfficientNet and InceptionNet. After that, the outputs of several models are fused via model ensemble method to predict the final results. Moreover, we introduce the multi-fold ensemble method to train and ensemble several models with the same architecture but different data distributions to enhance the performance of our solution. We conduct many experiments on the AffWild2 dataset of the ABAW2022 Challenge, and the results demonstrate the effectiveness of our solution.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
ABAW: Learning from Synthetic Data & Multi-Task Learning Challenges
Dimitrios Kollias
- 03 Jul 2022
TL;DR: This paper presents the two Challenges, along with the utilized corpora, then the evaluation metrics and finally the baseline systems per Challenge, as well as their derived results.
91
The 6th Affective Behavior Analysis in-the-wild (ABAW) Competition
Dimitrios Kollias,Panagiotis Tzirakis,Alan Cowen,Stefanos Zafeiriou,I. Kotsia,Alice Baird,Chris Gagne,Chunchang Shao,Guanyu Hu +8 more
TL;DR: The 6th Affective Behavior Analysis in-the-wild (ABAW) Competition addresses human emotion and behavior understanding, comprising five sub-challenges: valence-arousal estimation, expression recognition, action unit detection, compound expression recognition, and emotional mimicry intensity estimation.
33
7th ABAW Competition: Multi-Task Learning and Compound Expression Recognition
Dimitrios Kollias,Stefanos Zafeiriou,Irene Kotsia,Abhinav Dhall,Shreya Ghosh,Chunchang Shao,Guanyu Hu +6 more
- 04 Jul 2024
TL;DR: The 7th ABAW Competition addresses human expression and behavior understanding, comprising two sub-challenges: Multi-Task Learning and Compound Expression Recognition, utilizing s-Aff-Wild2 and C-EXPR-DB datasets for evaluation.
An Assessment of In-the-Wild Datasets for Multimodal Emotion Recognition
Ana Aguilera,Diego Mellado +1 more
TL;DR: In this article , a set of in-the-wild datasets are evaluated to show their strengths and weaknesses for multimodal emotion recognition, and they recommend a combination of multiple datasets in order to obtain better results when new samples are being processed and a good balance in the number of samples by class.
AFFDEX 2.0: A Real-Time Facial Expression Analysis Toolkit
Mina Bishay,Kenneth Preston,Matthew Strafuss,Graham Page,Jay Turcot,Mohammad Mavadati +5 more
- 24 Feb 2022
TL;DR: AFFDEX 2.0 is an enhanced version of the previous toolkit, capable of tracking efficiently faces at more challenging conditions, detecting more accurately facial expressions, and recognizing new emotional states (sentimentality and confusion).
References
•Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
138.5K
•Posted Content
Deep Residual Learning for Image Recognition
TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.
117.9K
Going deeper with convolutions
Christian Szegedy,Wei Liu,Yangqing Jia,Pierre Sermanet,Scott Reed,Dragomir Anguelov,Dumitru Erhan,Vincent Vanhoucke,Andrew Rabinovich +8 more
- 07 Jun 2015
TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
•Journal Article
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky,Jia Deng,Hao Su,Jonathan Krause,Sanjeev Satheesh,Sean Ma,Zhiheng Huang,Andrej Karpathy,Michael S. Bernstein,Li Fei-Fei,Alexander C. Berg,Aditya Khosla +11 more
TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) has been running annually for five years (since 2010) and has become the standard benchmark for large-scale object recognition.
23.9K
Focal Loss for Dense Object Detection
Tsung-Yi Lin,Priya Goyal,Ross Girshick,Kaiming He,Piotr Dollár +4 more
- 07 Aug 2017
TL;DR: This paper proposes to address the extreme foreground-background class imbalance encountered during training of dense detectors by reshaping the standard cross entropy loss such that it down-weights the loss assigned to well-classified examples, and develops a novel Focal Loss, which focuses training on a sparse set of hard examples and prevents the vast number of easy negatives from overwhelming the detector during training.