Multimodal Language Analysis in the Wild: CMU-MOSEI Dataset and Interpretable Dynamic Fusion Graph

doi:10.18653/V1/P18-1208

Open AccessProceedings Article10.18653/V1/P18-1208

Multimodal Language Analysis in the Wild: CMU-MOSEI Dataset and Interpretable Dynamic Fusion Graph

AmirAli Bagher Zadeh, +4 more

- 01 Jul 2018

- Vol. 1, pp 2236-2246

1.1K

TL;DR: This paper introduces CMU Multimodal Opinion Sentiment and Emotion Intensity (CMU-MOSEI), the largest dataset of sentiment analysis and emotion recognition to date and uses a novel multimodal fusion technique called the Dynamic Fusion Graph (DFG), which is highly interpretable and achieves competative performance when compared to the previous state of the art.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Preprint•10.48550/arxiv.2405.00543

New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for Vietnamese Multimodal Aspect-Category Sentiment Analysis

Quy T. Nguyen, +2 more

- 01 May 2024

TL;DR: New benchmark dataset and fine-grained cross-modal fusion framework for Vietnamese multimodal aspect-category sentiment analysis. The dataset includes 4,876 text-image pairs with fine-grained annotations for text and image. The framework effectively learns intra- and inter-modality interactions and achieves the highest F1 score of 79.73%.

...read moreread less

Proceedings Article•10.1145/3652583.3658004

Modality-specific and -shared Contrastive Learning for Sentiment Analysis

Jiuxiang You, +3 more

- 30 May 2024

TL;DR: MMCL achieves state-of-the-art performance for sentiment analysis using modality-specific and -shared contrastive learning.

...read moreread less

Journal Article•10.1109/TNNLS.2023.3294810

Sentiment Analysis: Comprehensive Reviews, Recent Advances, and Open Challenges.

Qiang Lu, +2 more

- 21 Jul 2023

- IEEE transactions on neural networks and...

TL;DR: In this paper , a taxonomy of multimodal Sentiment Analysis (MSA) methods is proposed to provide a comprehensive understanding of relevant advances in the field of sentiment analysis.

...read moreread less

Proceedings Article•10.1109/chilecon60335.2023.10418672

Multimodal Emotion Recognition Dataset in the Wild (MERDWild)

Facundo Martínez, +2 more

- 05 Dec 2023

TL;DR: MERDWild, a multimodal emotion recognition dataset, addresses the challenge of unifying, cleaning, and transforming three datasets collected in uncontrolled environments with the aim of integrating and standardizing a database that encompasses three modalities: facial images, audio, and text.

...read moreread less

Book Chapter•10.1145/3502398.3502402

Machine Learning Approaches for Applied Affective Computing

25 Jan 2022

TL;DR: Tian et al. as discussed by the authors proposed Machine Learning Approaches for Applied Affective Computing Authors: Leimin Tian Monash University Monash university search about this author , Sharon Oviatt Monash United States Naval Academy (US Naval Academy), Monash, Australia search about the author , Michal Muszynski Carnegie Mellon University and University of Geneva Carnegie Mellon and University GenevaSearch about this article , Brent C. Chamberlain Utah State University Utah State universitySearch about that author , Jennifer Healey Adobe Research, San Jose Adobe Research.

...read moreread less

...

Expand

References

•Journal Article•10.1023/A:1010933404324

Random Forests

Leo Breiman

- 01 Oct 2001

TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.

...read moreread less

113.1K

Journal Article•10.1162/NECO.1997.9.8.1735

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997

- Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

99K

•Journal Article•10.1023/A:1022627411411

Support-Vector Networks

Corinna Cortes, +1 more

- 15 Sep 1995

- Machine Learning

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.

...read moreread less

42K

Proceedings Article•10.3115/V1/D14-1162

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

- 01 Oct 2014

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

41.6K