Journal Article10.1016/J.JSS.2015.12.019
Automatically classifying software changes via discriminative topic model
59
TL;DR: A discriminative Probability Latent Semantic Analysis model with a novel initialization method which initializes the word distributions for different topics using labeled samples so that DPLSA is well applicable to cross-project software change message analysis.
read more
About: This article is published in Journal of Systems and Software. The article was published on 01 Mar 2016. The article focuses on the topics: Topic model & Discriminative model.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Neural-machine-translation-based commit message generation: how far are we?
Zhongxin Liu,Xin Xia,Ahmed E. Hassan,David Lo,Zhenchang Xing,Xinyu Wang +5 more
- 03 Sep 2018
TL;DR: A simpler and faster approach is proposed, named NNGen (Nearest Neighbor Generator), to generate concise commit messages using the nearest neighbor algorithm, which is over 2,600 times faster than NMT, and outperforms NMT in terms of BLEU by 21%.
284
Automatic generation of pull request descriptions
Zhongxin Liu,Xin Xia,Christoph Treude,David Lo,Shanping Li +4 more
- 10 Nov 2019
TL;DR: Zhang et al. as mentioned in this paper proposed an approach to automatically generate PR descriptions based on the commit messages and the added source code comments in the PRs using a sequence-to-sequence model.
100
Automating Change-Level Self-Admitted Technical Debt Determination
TL;DR: The experimental results show that the proposed change-level SATD Determination model achieves a promising and better performance than four baselines in terms of AUC and cost-effectiveness and “Diffusion” is the most discriminative dimension among the three dimensions of features for determining TD-introducing changes.
95
What Makes a Good Commit Message?
Ying-Jun Tian,Yuxia Zhang,Klaas-Jan Stol,Lin Jiang,Hui Li +4 more
- 07 Feb 2022
TL;DR: A taxonomy based on recurring patterns in commit messages' expressions is developed, investigating whether “good” commit messages can be automatically identified and whether such automation could prompt developers to write better commit messages.
How we refactor and how we document it? On the use of supervised machine learning algorithms to classify refactoring documentation
Eman Abdullah AlOmar,Anthony Peruma,Mohamed Wiem Mkaouer,Christian D. Newman,Ali Ouni,Marouane Kessentini +5 more
TL;DR: The results of the empirical investigation show that fixing code smells is not the main driver for developers to refactoring their code bases, and this classification challenges the original definition ofRefactoring, being exclusive to improving software design and fixing code smelling.
66
References
Latent dirichlet allocation
TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
•Proceedings Article
Latent Dirichlet Allocation
David M. Blei,Andrew Y. Ng,Michael I. Jordan +2 more
- 03 Jan 2001
TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).
WordNet: a lexical database for English
TL;DR: WordNet1 provides a more effective combination of traditional lexicographic information and modern computing, and is an online lexical database designed for use under program control.
16.9K
Robust Face Recognition via Sparse Representation
TL;DR: This work considers the problem of automatically recognizing human faces from frontal views with varying expression and illumination, as well as occlusion and disguise, and proposes a general classification algorithm for (image-based) object recognition based on a sparse representation computed by C1-minimization.
Related Papers (5)
Raymond P.L. Buse,Westley Weimer +1 more
- 20 Sep 2010
Martin Fowler
- 01 Jan 1999
Jaechang Nam,Sunghun Kim +1 more
- 09 Nov 2015