Enhancing Human Behaviour Analysis through Multi-Embedded Learning for Emotion Recognition in Images

doi:10.1109/iciccs56967.2023.10142747

Proceedings Article10.1109/iciccs56967.2023.10142747

Enhancing Human Behaviour Analysis through Multi-Embedded Learning for Emotion Recognition in Images

17 May 2023

12

TL;DR: In this paper , the authors proposed a multi-embedded learning approach to enhance human behaviour analysis by combining multiple models for emotion recognition in images using stacking, which showed that stacking improved the performance of emotion recognition compared to individual base models.

Abstract: Human behaviour analysis has been an active area of research in computer vision and artificial intelligence. The recognition of emotions in images has been a difficult task, as emotions are subjective and can be expressed in many different ways. Multi-Embedded Learning has been proposed as a promising approach to tackle this problem by combining the outputs of multiple models. In this study, we aimed to enhance human behaviour analysis through Multi-Embedded Learning for emotion recognition in images using stacking. The objective of this study was to enhance human behaviour analysis by combining multiple models for emotion recognition in images. The study aimed to demonstrate the benefits of stacking, a specific ensemble learning algorithm, in improving the performance of emotion recognition. Multiple base models were trained using different architectures and techniques, such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and traditional machine learning algorithms like Support Vector Machines (SVMs) and k-Nearest Neighbors (k-NNs). The outputs of these base models were used as inputs to a meta-model, which was a Convolutional Neural Network. The dataset utilised in this study was the AffectNet dataset, a large collection of images depicting facial expressions. The dataset consists of over 400,000 images of faces labeled with one of the seven emotions mentioned above. To facilitate model training and evaluation, the dataset was partitioned into separate subsets for training, validation, and testing. The results showed that stacking improved the performance of emotion recognition in images compared to the individual base models. The accuracy of the stacked model was 85%, which was greater than the accuracy of any of the basic models.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.3233/idt-230218

EDSR: Empowering super-resolution algorithms with high-quality DIV2K images

Jenefa A, +3 more

- 12 Sep 2023

- Intelligent Decision Technologies

TL;DR: The empirical findings validate that EDSR is highly efficient in enhancing image quality and preserving fine details and empowers super-resolution algorithms by leveraging high-quality DIV2K images.

...read moreread less

10

Journal Article•10.37965/jait.2023.0192

Gender Classification from Fingerprint Using Hybrid CNN-SVM

Vidhya Keren T, +4 more

- 02 Aug 2023

- Journal of artificial intelligence and t...

TL;DR: A CNN-SVM hybrid framework for gender classification from fingerprints is proposed, where preprocessing, feature extraction, and classification are the three main components.

...read moreread less

5

Proceedings Article•10.1109/icacrs58579.2023.10404525

PQC Secure: Strategies for Defending Against Quantum Threats

A. Jenefa, +5 more

- 11 Dec 2023

TL;DR: Evaluating PQC approaches, encompassing lattice-based, code-based, and isogeny-based cryptography, with assessments based on metrics like encryption duration, key length, and key length, underscores PQC’s potential in delivering robust security, albeit with variations in performance metrics, guiding secure communication choices.

...read moreread less

3

Journal Article•10.5935/jetia.v9i43.904

ABM-OCD: Advancing ovarian cancer diagnosis with attention-based models and 3D CNNs

A. Jenefa, +3 more

- Journal of Engineering and Technology fo...

TL;DR: This research seeks to enhance the accuracy and efficiency of ovarian cancer diagnosis, particularly in distinguishing between serous, mucinous, and endometrioid subtypes by introducing Attention-Based Models (ABMs) in combination with 3D Convolutional Neural Networks (CNNs).

...read moreread less

3

Proceedings Article•10.1109/iceca58529.2023.10395723

Utilizing RL and Web-Enhanced Commuting for Traffic Congestion Mitigation and Public Transportation Enhancement

Jenefa A, +5 more

- 22 Nov 2023

TL;DR: This integration of cutting-edge technology and commuter-focused strategies presents an avenue for reshaping urban mobility paradigms, mitigating traffic congestion, and optimizing public transportation systems, thereby paving the way for a more sustainable urban future.

...read moreread less

2

References

•Book Chapter•10.1007/978-3-030-17795-9_10

Deep Learning vs. Traditional Computer Vision

Niall O'Mahony, +7 more

- 25 Apr 2019

TL;DR: The aim of this paper is to promote a discussion on whether knowledge of classical computer vision techniques should be maintained and how the two sides of computer vision can be combined.

...read moreread less

823

Journal Article•10.1016/J.INFFUS.2020.01.011

Emotion recognition using multi-modal data and machine learning techniques: A tutorial and review

Jianhua Zhang, +3 more

- 01 Jul 2020

- Information Fusion

TL;DR: The emotion recognition methods based on multi-channel EEG signals as well as multi-modal physiological signals are reviewed and the correlation between different brain areas and emotions is discussed.

...read moreread less

588

Proceedings Article•10.1109/CVPR.2016.374

Multimodal Spontaneous Emotion Corpus for Human Behavior Analysis

Zheng Zhang, +12 more

- 27 Jun 2016

TL;DR: A well-annotated, multimodal, multidimensional spontaneous emotion corpus of 140 participants, which includes derived features from 3D, 2D, and IR (infrared) sensors and baseline results for facial expression and action unit detection is presented.

...read moreread less

489

Journal Article•10.1109/TMM.2010.2060716

A Natural Visible and Infrared Facial Expression Database for Expression Recognition and Emotion Inference

Shangfei Wang, +7 more

- 01 Nov 2010

- IEEE Transactions on Multimedia

TL;DR: A natural visible and infrared facial expression database, which contains both spontaneous and posed expressions of more than 100 subjects, recorded simultaneously by a visible and an infrared thermal camera, with illumination provided from three different directions is proposed.

...read moreread less

433

•Journal Article•10.1007/S00371-021-02166-7

A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets.

Khaled Bayoudh, +3 more

- 10 Jun 2021

- The Visual Computer

TL;DR: In this paper, the authors summarize the current literature on deep multimodal learning and provide insights and directions for future research, and present a collection of benchmark datasets for solving problems in various vision domains.

...read moreread less

214