Studying software evolution using topic models
TL;DR: A first step towards evaluating topic models in the analysis of software evolution is taken by performing a detailed manual analysis on the source code histories of two well-known and well-documented systems, JHotDraw and jEdit.
read more
About: This article is published in Science of Computer Programming. The article was published on 01 Feb 2014. and is currently open access. The article focuses on the topics: Topic model & Software evolution.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
What are developers talking about? An analysis of topics and trends in Stack Overflow
TL;DR: This article uses latent Dirichlet allocation (LDA), a statistical topic modeling technique, to automatically discover the main topics present in developer discussions of Stack Overflow and analyzes these discovered topics, as well as their relationships and trends over time, to gain insights into the development community.
What is wrong with topic modeling? And how to fix it using search-based software engineering
TL;DR: LDADE, a search-based software engineering tool which uses Differential Evolution (DE) to tune the LDA’s parameters, is used to provide a method in which distributions generated by LDA are more stable and can be used for further analysis.
258
Source code metrics
Alberto S. Nuez-Varela,Hctor G. Prez-Gonzalez,Francisco E. Martnez-Perez,Carlos Soubervielle-Montalvo +3 more
TL;DR: There is a current need for more studies on aspect and feature oriented metrics, especially for the current interest in programming concerns and software product lines.
159
Topic Modeling Using Latent Dirichlet allocation: A Survey
Uttam Chauhan,Apurva Shah +1 more
TL;DR: The background and advancement of topic modeling techniques can be found in this paper, where the authors introduce the preliminaries of the topic modelling techniques and review its extensions and variations, such as hierarchical topic modeling over various domains, hierarchical topic modelling, word embedded topic models, and topic models in multilingual perspectives.
156
An exploratory analysis of mobile development issues using stack overflow
Mario Linares-Vasquez,Bogdan Dit,Denys Poshyvanyk +2 more
- 18 May 2013
TL;DR: This paper used topic modeling techniques to extract hot-topics from mobile-development related questions and suggests that most of the questions include topics related to general questions and compatibility issues, and the most specific topics are present in a reduced set of questions.
References
A mathematical theory of communication
TL;DR: This final installment of the paper considers the case where the signals or the messages or both are continuously variable, in contrast with the discrete nature assumed until now.
74.4K
•Book
Elements of information theory
Thomas M. Cover,Joy A. Thomas +1 more
- 01 Jan 1991
TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.
Latent dirichlet allocation
TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
•Proceedings Article
Latent Dirichlet Allocation
David M. Blei,Andrew Y. Ng,Michael I. Jordan +2 more
- 03 Jan 2001
TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).
•Journal Article
A Mathematical Theory Communication
TL;DR: Scientific knowledge grows at a phenomenal pace--but few books have had as lasting an impact or played as important a role in our modern world as The Mathematical Theory of Communication, published originally as a paper on communication theory more than fifty years ago.
18.4K