From frequency to meaning: vector space models of semantics

doi:10.1613/JAIR.2934

Open AccessJournal Article10.1613/JAIR.2934

From frequency to meaning: vector space models of semantics

Peter D. Turney, +1 more

- 01 Jan 2010

- Journal of Artificial Intelligence Resea...

- Vol. 37, Iss: 1, pp 141-188

3.2K

TL;DR: The goal in this survey is to show the breadth of applications of VSMs for semantics, to provide a new perspective on VSMs, and to provide pointers into the literature for those who are less familiar with the field.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 05 Dec 2013

TL;DR: This paper presents a simple method for finding phrases in text, and shows that learning good vector representations for millions of phrases is possible and describes a simple alternative to the hierarchical softmax called negative sampling.

...read moreread less

24.1K

•Posted Content

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 16 Oct 2013

- arXiv: Computation and Language

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.

...read moreread less

22.9K

Journal Article•10.1145/242224.242229

Machine learning

Thomas G. Dietterich

- 01 Dec 1996

- ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

14K

•Journal Article•10.1162/TACL_A_00051

Enriching Word Vectors with Subword Information

Piotr Bojanowski, +3 more

- 12 Jun 2017

- Transactions of the Association for Comp...

TL;DR: This paper proposed a new approach based on skip-gram model, where each word is represented as a bag of character n-grams, words being represented as the sum of these representations, allowing to train models on large corpora quickly and allowing to compute word representations for words that did not appear in the training data.

...read moreread less

10.3K

•Proceedings Article

Distributed Representations of Sentences and Documents

Quoc V. Le, +1 more

- 21 Jun 2014

TL;DR: Paragraph Vector is an unsupervised algorithm that learns fixed-length feature representations from variable-length pieces of texts, such as sentences, paragraphs, and documents, and its construction gives the algorithm the potential to overcome the weaknesses of bag-of-words models.

...read moreread less

8.9K

...

Expand

References

Foundations of the PARAFAC procedure: Models and conditions for an "explanatory" multi-model factor analysis

Richard A. Harshman

- 01 Jan 1970

TL;DR: It is shown that an extension of Cattell's principle of rotation to Proportional Profiles (PP) offers a basis for determining explanatory factors for three-way or higher order multi-mode data.

...read moreread less

3.3K

•Book

The SMART Retrieval System—Experiments in Automatic Document Processing

Gerard Salton

- 01 Jan 1971

3.3K

Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy

Jay J. Jiang, +1 more

- 01 Aug 1997

TL;DR: This paper presents a new approach for measuring semantic similarity/distance between words and concepts that combines a lexical taxonomy structure with corpus statistical information so that the semantic distance between nodes in the semantic space constructed by the taxonomy can be better quantified with the computational evidence derived from a distributional analysis of corpus data.

...read moreread less

3.2K

•Proceedings Article

Using information content to evaluate semantic similarity in a taxonomy

Philip Resnik

- 20 Aug 1995

TL;DR: This paper presents a new measure of semantic similarity in an IS-A taxonomy, based on the notion of information content, which performs encouragingly well and is significantly better than the traditional edge counting approach.

...read moreread less

3.2K

Journal Article•10.1023/A:1011424425034

Foundations of Statistical Natural Language Processing

Paul B. Kantor

- 01 Apr 2001

- Information Retrieval

TL;DR: This book is already in probability information theory and linguistic found it should be well grounded and indeed it is, this foundational text in human language applications who want to create the way.

...read moreread less

3K

...