Proceedings Article10.1109/iciccs56967.2023.10142747
Enhancing Human Behaviour Analysis through Multi-Embedded Learning for Emotion Recognition in Images
17 May 2023
12
TL;DR: In this paper , the authors proposed a multi-embedded learning approach to enhance human behaviour analysis by combining multiple models for emotion recognition in images using stacking, which showed that stacking improved the performance of emotion recognition compared to individual base models.
read more
Abstract: Human behaviour analysis has been an active area of research in computer vision and artificial intelligence. The recognition of emotions in images has been a difficult task, as emotions are subjective and can be expressed in many different ways. Multi-Embedded Learning has been proposed as a promising approach to tackle this problem by combining the outputs of multiple models. In this study, we aimed to enhance human behaviour analysis through Multi-Embedded Learning for emotion recognition in images using stacking. The objective of this study was to enhance human behaviour analysis by combining multiple models for emotion recognition in images. The study aimed to demonstrate the benefits of stacking, a specific ensemble learning algorithm, in improving the performance of emotion recognition. Multiple base models were trained using different architectures and techniques, such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and traditional machine learning algorithms like Support Vector Machines (SVMs) and k-Nearest Neighbors (k-NNs). The outputs of these base models were used as inputs to a meta-model, which was a Convolutional Neural Network. The dataset utilised in this study was the AffectNet dataset, a large collection of images depicting facial expressions. The dataset consists of over 400,000 images of faces labeled with one of the seven emotions mentioned above. To facilitate model training and evaluation, the dataset was partitioned into separate subsets for training, validation, and testing. The results showed that stacking improved the performance of emotion recognition in images compared to the individual base models. The accuracy of the stacked model was 85%, which was greater than the accuracy of any of the basic models.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
EDSR: Empowering super-resolution algorithms with high-quality DIV2K images
Jenefa A,Bessy M. Kuriakose,Edward Naveen V,Lincy A +3 more
TL;DR: The empirical findings validate that EDSR is highly efficient in enhancing image quality and preserving fine details and empowers super-resolution algorithms by leveraging high-quality DIV2K images.
10
Gender Classification from Fingerprint Using Hybrid CNN-SVM
Vidhya Keren T,Serin J,Mary Ivy Deepa I S,V.Ebenezer,A.Jenefa +4 more
TL;DR: A CNN-SVM hybrid framework for gender classification from fingerprints is proposed, where preprocessing, feature extraction, and classification are the three main components.
5
PQC Secure: Strategies for Defending Against Quantum Threats
A. Jenefa,F. T. Josh,Antony Taurshia,K. R. Kumar,S. Kowsega,Edward Naveen +5 more
- 11 Dec 2023
TL;DR: Evaluating PQC approaches, encompassing lattice-based, code-based, and isogeny-based cryptography, with assessments based on metrics like encryption duration, key length, and key length, underscores PQC’s potential in delivering robust security, albeit with variations in performance metrics, guiding secure communication choices.
3
ABM-OCD: Advancing ovarian cancer diagnosis with attention-based models and 3D CNNs
A. Jenefa,Naveen V. Edward,Veemaraj Ebenezer,A. Lincy +3 more
TL;DR: This research seeks to enhance the accuracy and efficiency of ovarian cancer diagnosis, particularly in distinguishing between serous, mucinous, and endometrioid subtypes by introducing Attention-Based Models (ABMs) in combination with 3D Convolutional Neural Networks (CNNs).
3
Utilizing RL and Web-Enhanced Commuting for Traffic Congestion Mitigation and Public Transportation Enhancement
Jenefa A,Jerusha Miraclin Dulcie B,Carolin Joanna Sheryl K,Esswari S,Sandra Jeslena W,Bessy M. Kuriakose +5 more
- 22 Nov 2023
TL;DR: This integration of cutting-edge technology and commuter-focused strategies presents an avenue for reshaping urban mobility paradigms, mitigating traffic congestion, and optimizing public transportation systems, thereby paving the way for a more sustainable urban future.
2
References
Deep Learning vs. Traditional Computer Vision
Niall O'Mahony,Sean Campbell,Anderson Carvalho,Suman Harapanahalli,Gustavo Velasco Hernandez,Lenka Krpalkova,Daniel Riordan,Joseph Walsh +7 more
- 25 Apr 2019
TL;DR: The aim of this paper is to promote a discussion on whether knowledge of classical computer vision techniques should be maintained and how the two sides of computer vision can be combined.
823
Emotion recognition using multi-modal data and machine learning techniques: A tutorial and review
TL;DR: The emotion recognition methods based on multi-channel EEG signals as well as multi-modal physiological signals are reviewed and the correlation between different brain areas and emotions is discussed.
588
Multimodal Spontaneous Emotion Corpus for Human Behavior Analysis
Zheng Zhang,Jeffrey M. Girard,Yue Wu,Xing Zhang,Peng Liu,Umur Aybars Ciftci,Shaun Canavan,Michael Reale,Andrew Horowitz,Huiyuan Yang,Jeffrey F. Cohn,Qiang Ji,Lijun Yin +12 more
- 27 Jun 2016
TL;DR: A well-annotated, multimodal, multidimensional spontaneous emotion corpus of 140 participants, which includes derived features from 3D, 2D, and IR (infrared) sensors and baseline results for facial expression and action unit detection is presented.
A Natural Visible and Infrared Facial Expression Database for Expression Recognition and Emotion Inference
TL;DR: A natural visible and infrared facial expression database, which contains both spontaneous and posed expressions of more than 100 subjects, recorded simultaneously by a visible and an infrared thermal camera, with illumination provided from three different directions is proposed.
433
A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets.
TL;DR: In this paper, the authors summarize the current literature on deep multimodal learning and provide insights and directions for future research, and present a collection of benchmark datasets for solving problems in various vision domains.