From frequency to meaning: vector space models of semantics

doi:10.1613/JAIR.2934

Open AccessJournal Article10.1613/JAIR.2934

From frequency to meaning: vector space models of semantics

Peter D. Turney, +1 more

- 01 Jan 2010

- Journal of Artificial Intelligence Resea...

- Vol. 37, Iss: 1, pp 141-188

3.2K

TL;DR: The goal in this survey is to show the breadth of applications of VSMs for semantics, to provide a new perspective on VSMs, and to provide pointers into the literature for those who are less familiar with the field.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article

Medical synonym extraction with concept space models

Chang Wang, +2 more

- 25 Jul 2015

TL;DR: A novel approach to integrate the term embedding with the medical domain knowledge for healthcare applications and it is shown that the proposed approach outperforms the baseline approaches by a large margin.

...read moreread less

41

•Journal Article•10.1109/ACCESS.2019.2900462

Automatic Classification Method for Software Vulnerability Based on Deep Neural Network

Guoyan Huang, +5 more

- 28 Feb 2019

- IEEE Access

TL;DR: Compared to SVM, Naive Bayes, and KNN, the TFI-DNN model has achieved better performance in multi-dimensional evaluation indexes including accuracy, recall rate, precision, and F1-score.

...read moreread less

41

•Proceedings Article•10.1145/3183713.3196930

Sketching Linear Classifiers over Data Streams

Kai Sheng Tai, +3 more

- 27 May 2018

TL;DR: The Weight-Median Sketch as mentioned in this paper adopts the core data structure used in the Count-Sketch, but instead of sketching counts, it captures sketched gradient updates to the model parameters.

...read moreread less

41

•Journal Article•10.1371/journal.pone.0266325

Topic modeling revisited: New evidence on algorithm performance and quality metrics

Matthias Rüdiger, +3 more

- 28 Apr 2022

- PLOS ONE

TL;DR: This study compares all commonly used, non-application-specific topic modeling algorithms and assess their relative performance, and analyzes the relationship between existing metrics and the known clustering to objectively determine under what conditions these algorithms may be utilized effectively.

...read moreread less

41

UCCA: A Semantics-based Grammatical Annotation Scheme

Omri Abend, +1 more

- 01 Mar 2013

TL;DR: A simple semantic annotation scheme, UCCA for Universal Conceptual Cognitive Annotation, that covers many of the most important elements and relations present in linguistic utterances, including verb-argument structure, optional adjuncts such as adverbials, clause embeddings, and the linkage between them is proposed.

...read moreread less

41

...

Expand

References

Journal Article•10.1002/J.1538-7305.1948.TB01338.X

A mathematical theory of communication

Claude E. Shannon

- 01 Jul 1948

- Bell System Technical Journal

TL;DR: This final installment of the paper considers the case where the signals or the messages or both are continuously variable, in contrast with the discrete nature assumed until now.

...read moreread less

74.4K

•Journal Article

The mathematical theory of communication

Claude E. Shannon, +1 more

- 01 Jan 1949

- IEEE Transactions on Instrumentation and...

TL;DR: The Mathematical Theory of Communication (MTOC) as discussed by the authors was originally published as a paper on communication theory more than fifty years ago and has since gone through four hardcover and sixteen paperback printings.

...read moreread less

36.2K

•Journal Article•10.5555/944919.944937

Latent dirichlet allocation

David M. Blei, +2 more

- 01 Mar 2003

- Journal of Machine Learning Research

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.

...read moreread less

36.2K

•Proceedings Article

Latent Dirichlet Allocation

David M. Blei, +2 more

- 03 Jan 2001

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).

...read moreread less

25.5K

Journal Article•10.21276/IJRE.2018.5.5.4

MapReduce: simplified data processing on large clusters

Jeffrey Dean, +1 more

- 06 Dec 2004

TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.

...read moreread less

22.7K