Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Clickthrough-based latent semantic models for web search
Jianfeng Gao,Kristina Toutanova,Wen-tau Yih +2 more
- 24 Jul 2011
TL;DR: Two new document ranking models for Web search based upon the methods of semantic representation and the statistical translation-based approach to information retrieval (IR) are presented.
Computational historiography: Data mining in a century of classics journals
TL;DR: Computational methods for identifying patterns and testing hypotheses about Classics as a field can help organize large collections, introduce younger scholars to the history of the field, and act as a “survey,” identifying anomalies that can be explored using more traditional methods.
Mapping the Topic Landscape of JPIM , 1984-2013: In Search of Hidden Structures and Development Trajectories
TL;DR: This work uses a topic modeling algorithm to extract 57 distinct topics and the corresponding most common words, terms, and phrases from the entire full-text corpus of 1008 JPIM articles published between 1984 and 2013, and maps these topics onto the PDMA Body of Knowledge categories.
104
•Proceedings Article
Word Epoch Disambiguation: Finding How Words Change Over Time
Rada Mihalcea,Vivi Nastase +1 more
- 08 Jul 2012
TL;DR: The novel task of "word epoch disambiguation," defined as the problem of identifying changes in word usage over time, is introduced and it is shown that the task is feasible, and significant differences can be observed between occurrences of words in different periods of time.
103
•Posted Content
Deep Exponential Families
TL;DR: This extensive study shows that going beyond one layer improves predictions for DEFs, and demonstrates that DEFs find interesting exploratory structure in large data sets, and give better predictive performance than state-of-the-art models.
References
Regression Shrinkage and Selection via the Lasso
TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.
Latent dirichlet allocation
TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
Inference of population structure using multilocus genotype data
TL;DR: Pritch et al. as discussed by the authors proposed a model-based clustering method for using multilocus genotype data to infer population structure and assign individuals to populations, which can be applied to most of the commonly used genetic markers, provided that they are not closely linked.
•Book
Monte Carlo Statistical Methods
Christian P. Robert,George Casella +1 more
- 01 Jan 1999
TL;DR: This new edition contains five completely new chapters covering new developments and has sold 4300 copies worldwide of the first edition (1999).
Finding scientific topics
TL;DR: A generative model for documents is described, introduced by Blei, Ng, and Jordan, and a Markov chain Monte Carlo algorithm is presented for inference in this model, which is used to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics.