Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Proceedings Article
On Estimation and Selection for Topic Models
Matt Taddy
- 21 Mar 2012
TL;DR: In this paper, the authors describe posterior maximization for topic models, identifying computational and conceptual gains from inference under a non-standard parametrization, and show that fitted parameters can be used as the basis for a novel approach to marginal likelihood estimation, via block-diagonal approximation to the information matrix, that facilitates choosing the number of latent topics.
•Posted Content
Measuring Polarization in High-Dimensional Data: Method and Application to Congressional Speech
TL;DR: The authors study trends in the partisanship of Congressional speech from 1873 to 2009 and find that partisanship is far greater today than at any point in the past, suggesting innovation in political persuasion beginning with the Contract with America, possibly reinforced by changes in the media environment.
Mapping the scattered field of research on higher education. A correlated topic model of 17,000 articles, 1991–2018
TL;DR: In this paper, the abstracts of 16,928 articles on higher education between 1991 and 2018 were analyzed using topic models, which are a collection of automatic content analysis methods that allow to map the structure of large text data.
Dimensionality Reduction and Topic Modeling: From Latent Semantic Indexing to Latent Dirichlet Allocation and Beyond
Steven P. Crain,Ke Zhou,Shuang-Hong Yang,Hongyuan Zha +3 more
- 01 Jan 2012
TL;DR: This chapter surveys two influential forms of dimension reduction, including probabilistic latent semantic indexing and latent Dirichlet allocation, and describes the basic technologies in detail and exposes the underlying mechanism.
131
The relative importance of service quality dimensions in E-commerce experiences
Biagio Palese,Antonio Usai +1 more
TL;DR: This study shows that evaluation systems designed considering the knowledge extracted directly from customers review lead to a service quality measurement that not only is theory-based, but also more accurate.
125
References
Regression Shrinkage and Selection via the Lasso
TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.
Latent dirichlet allocation
TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
Inference of population structure using multilocus genotype data
TL;DR: Pritch et al. as discussed by the authors proposed a model-based clustering method for using multilocus genotype data to infer population structure and assign individuals to populations, which can be applied to most of the commonly used genetic markers, provided that they are not closely linked.
•Book
Monte Carlo Statistical Methods
Christian P. Robert,George Casella +1 more
- 01 Jan 1999
TL;DR: This new edition contains five completely new chapters covering new developments and has sold 4300 copies worldwide of the first edition (1999).
Finding scientific topics
TL;DR: A generative model for documents is described, introduced by Blei, Ng, and Jordan, and a Markov chain Monte Carlo algorithm is presented for inference in this model, which is used to analyze abstracts from PNAS by using Bayesian model selection to establish the number of topics.