Using Sparse Semantic Embeddings Learned from Multimodal Text and Image Data to Model Human Conceptual Knowledge

Open AccessPosted Content

Using Sparse Semantic Embeddings Learned from Multimodal Text and Image Data to Model Human Conceptual Knowledge

- 07 Sep 2018

11

TL;DR: This paper combines multimodal information from both text and image-based representations derived from state-of-the-art distributional models to produce sparse, interpretable vectors using Joint Non-Negative Sparse Embedding and demonstrates their ability to predict interpretable linguistic descriptions of human ground-truth semantic knowledge.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1016/j.tics.2022.12.006

Decoding semantic representations in mind and brain

Saskia L. Frisby, +4 more

- 01 Jan 2023

- Trends in Cognitive Sciences

TL;DR: This article reviewed cognitive theories of semantic representation and their neural instantiation, and then considered contemporary approaches to neural decoding and assessed which types of representation each can possibly detect, and identified crucial links between cognitive theory, data collection, and analysis that can help to better connect neuroimaging to mechanistic theory of semantic cognition.

...read moreread less

33

•Journal Article•10.1017/S1351324919000196

Learning semantic sentence representations from visually grounded language without lexical knowledge.

Danny Merkx, +1 more

- 27 Mar 2019

- arXiv: Computation and Language

TL;DR: This article used a multimodal sentence encoder trained on a corpus of images with matching text captions to produce visually grounded sentence embeddings, which achieved state-of-the-art results on two image-caption retrieval benchmark data sets.

...read moreread less

14

•Proceedings Article•10.18653/V1/2020.COLING-MAIN.291

Conception: Multilingually-Enhanced, Human-Readable Concept Vector Representations

Simone Conia, +1 more

- 01 Dec 2020

TL;DR: This paper proposes Conception, a novel technique for building language-independent vector representations of concepts which places multilinguality at its core while retaining explicit relationships between concepts, and results in high-coverage representations that outperform the state of the art in multilingual and cross-lingual Semantic Word Similarity and Word Sense Disambiguation.

...read moreread less

14

•Journal Article•10.1017/S1351324919000196

Learning semantic sentence representations from visually grounded language without lexical knowledge

Danny Merkx, +1 more

- 01 Jul 2019

- Natural Language Engineering

TL;DR: The authors used a multimodal sentence encoder trained on a corpus of images with matching text captions to produce visually grounded sentence embeddings, which achieved state-of-the-art results on two image-caption retrieval benchmark datasets.

...read moreread less

11

Proceedings Article•10.48550/arXiv.2205.00756

VICE: Variational Interpretable Concept Embeddings

Lukas Muttenthaler, +5 more

- 02 May 2022

TL;DR: A PAC learning bound is introduced for VICE that can be used to estimate generalization performance or determine a sufﬁcient sample size for different experimental designs and rivals or outperforms its predecessor, SPoSE, at predicting human behavior in a triplet task.

...read moreread less

8

References

Proceedings Article•10.1109/CVPR.2009.5206848

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

- 20 Jun 2009

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

75.9K

Proceedings Article•10.3115/V1/D14-1162

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

- 01 Oct 2014

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

41.6K

•Posted Content

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 16 Oct 2013

- arXiv: Computation and Language

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.

...read moreread less

22.9K

Journal Article•10.1145/219717.219748

WordNet: a lexical database for English

George A. Miller

- 01 Nov 1995

- Communications of The ACM

TL;DR: WordNet1 provides a more effective combination of traditional lexicographic information and modern computing, and is an online lexical database designed for use under program control.

...read moreread less

16.9K

•Posted Content

Visualizing and Understanding Convolutional Networks

Matthew D. Zeiler, +1 more

- 12 Nov 2013

- arXiv: Computer Vision and Pattern Recog...

TL;DR: In this article, the authors introduce a novel visualization technique that gives insight into the function of intermediate feature layers and the operation of the classifier, and perform an ablation study to discover the performance contribution from different model layers.

...read moreread less

9.7K

...

Expand

Using Sparse Semantic Embeddings Learned from Multimodal Text and Image Data to Model Human Conceptual Knowledge

Chat with Paper

AI Agents for this Paper

Citations

Decoding semantic representations in mind and brain

Learning semantic sentence representations from visually grounded language without lexical knowledge.

Conception: Multilingually-Enhanced, Human-Readable Concept Vector Representations

Learning semantic sentence representations from visually grounded language without lexical knowledge

VICE: Variational Interpretable Concept Embeddings

References

ImageNet: A large-scale hierarchical image database

Glove: Global Vectors for Word Representation

Distributed Representations of Words and Phrases and their Compositionality

WordNet: a lexical database for English

Visualizing and Understanding Convolutional Networks

Related Papers (5)

Understanding the Semantic Content of Sparse Word Embeddings Using a Commonsense Knowledge Base.

Multimodal distributional semantics

Neural embeddings: accurate and readable inferences based on semantic kernels

Distributional Semantic Models

Correlating image-based distributional semantic models with neural representations of concepts