Proceedings Article
Sentiment Knowledge Enhanced Self-supervised Learning for Multimodal Sentiment Analysis
Fan Qian,Jiqing Han,Yongjun He,Tieran Zheng,Guibin Zheng +4 more
pp 12966-12978
TL;DR: Sentiment Knowledge Enhanced Self-supervised Learning (SKESL) is proposed to capture common sentimental patterns in unlabeled videos, which facilitates further learning on limited labeled data and achieves new State-Of-The-Art (SOTA) results.
read more
Abstract: Multimodal Sentiment Analysis (MSA) has made great progress that benefits from extraordinary fusion scheme. However, there is a lack of labeled data, resulting in severe over-fitting and poor generalization for supervised models applied in this field. In this paper, we propose Sentiment Knowledge Enhanced Self-supervised Learning (SKESL) to capture common sentimental patterns in unlabeled videos, which facilitates further learning on limited labeled data. Specifically, with the help of sentiment knowledge and non-verbal behavior, SKESL conducts sentiment word masking and predicts fine-grained word sentiment intensity, so as to embed sentiment information at the word level into pre-trained multimodal representation. In addition, a non-verbal injection method is also proposed to integrate non-verbal information into the word semantics. Experiments on two standard benchmarks of MSA clearly show that SKESL significantly outperforms the baseline, and achieves new State-Of-The-Art (SOTA) results.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Generalizing sentiment analysis: a review of progress, challenges, and emerging directions
Khaled Alahmadi,Sultan Alharbi,Juan Chen,Xianzhi Wang +3 more
4
Capturing High-Level Semantic Correlations via Graph for Multimodal Sentiment Analysis
Fan Qian,Jiqing Han,Yadong Guan,Wenjie Song,Yongjun He +4 more
TL;DR: C capsule networks are introduced to construct high-level semantic nodes in a graph, uncovering deep sentimental structures in multimodal sentiment analysis and the learnable adjacency matrices are employed to construct edges of graph, thus adaptively learning the relations between nodes.
4
Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective
Guimin Hu,Yi Xin,Weimin Lyu,Haojian Huang,Chang Sun,Zhihong Zhu,Lin Gui,Ruichu Cai +7 more
- 11 Sep 2024
TL;DR: This survey presents recent trends in multimodal affective computing from an NLP perspective, covering four tasks, formalizing tasks, and discussing technical approaches, challenges, and future directions in analyzing human behaviors and intentions through text and multimodal data.
A Multi-Grained Perception Model for Sentiment Analysis with Perceived Contrastive Focal Loss
Jin Wei,Jiajie Lin,Zhenguo Yang,Haoran Xie,Fuqiang Yu,Xiaoping Li +5 more
- 30 Jun 2025
TL;DR: This study proposes MGSA1, a multi-grained perception model for multimodal sentiment analysis, addressing imbalanced modality discrimination and sample distributions with a novel MCP module and Perceived Contrastive Focal loss, outperforming baselines on MOSI and MOSEI datasets.
AMTN: Attention-Enhanced Multimodal Temporal Network for Humor Detection
Yangyang Xu,Peng Zou,Rui Wang,Qi Li,Chengpeng Xu,Zhuoer Zhao,Xun Yang,Xiao Sun,Meng Wang +8 more
- 28 Oct 2024
References
•Posted Content
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.
81.7K
•Posted Content
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke,Sam Gross,Francisco Massa,Adam Lerer,James Bradbury,Gregory Chanan,Trevor Killeen,Zeming Lin,Natalia Gimelshein,Luca Antiga,Alban Desmaison,Andreas Kopf,Edward Z. Yang,Zachary DeVito,Martin Raison,Alykhan Tejani,Sasank Chilamkurthy,Benoit Steiner,Lu Fang,Junjie Bai,Soumith Chintala +20 more
TL;DR: PyTorch as discussed by the authors is a machine learning library that provides an imperative and Pythonic programming style that makes debugging easy and is consistent with other popular scientific computing libraries, while remaining efficient and supporting hardware accelerators such as GPUs.
25.9K
•Posted Content
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
TL;DR: This work proposes a method to pre-train a smaller general-purpose language representation model, called DistilBERT, which can be fine-tuned with good performances on a wide range of tasks like its larger counterparts, and introduces a triple loss combining language modeling, distillation and cosine-distance losses.
7.3K
librosa: Audio and Music Signal Analysis in Python
Brian McFee,Colin Raffel,Dawen Liang,Daniel P. W. Ellis,Matt McVicar,Eric Battenberg,Oriol Nieto +6 more
- 01 Jan 2015
TL;DR: A brief overview of the librosa library's functionality is provided, along with explanations of the design goals, software development practices, and notational conventions.
ERNIE: Enhanced Language Representation with Informative Entities
Zhengyan Zhang,Xu Han,Zhiyuan Liu,Xin Jiang,Maosong Sun,Qun Liu +5 more
- 17 May 2019
TL;DR: This paper utilizes both large-scale textual corpora and KGs to train an enhanced language representation model (ERNIE) which can take full advantage of lexical, syntactic, and knowledge information simultaneously, and is comparable with the state-of-the-art model BERT on other common NLP tasks.
1.8K