Proceedings Article10.1109/CSMR.2011.5
Process Mining Software Repositories
Wouter Poncin,Alexander Serebrenik,Mark van den Brand +2 more
- 01 Mar 2011
- pp 5-14
TL;DR: This work proposes to apply process mining techniques, originally developed for business process analysis, to address the challenge of successful information extraction by addressing the necessity to simultaneously analyze different repositories and to combine the information obtained.
read more
Abstract: Software developers’ activities are in general recorded in software repositories such as version control systems, bug trackers and mail archives While abundant information is usually present in such repositories, successful information extraction is often challenged by the necessity to simultaneously analyze different repositories and to combine the information obtained We propose to apply process mining techniques, originally developed for business process analysis, to address this challenge However, in order for process mining to become applicable, different software repositories should be combined, and “related” software development events should be matched: eg, mails sent about a file, modifications of the file and bug reports that can be traced back to it The combination and matching of events has been implemented in FRASR (Framework for Analyzing Software Repositories), augmenting the process mining framework ProM FRASR has been successfully applied in a series of case studies addressing such aspects of the development process as roles of different developers and the way bug reports are handled
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
StackOverflow and GitHub: Associations between Software Development and Crowdsourced Knowledge
Bogdan Vasilescu,Vladimir Filkov,Alexander Serebrenik +2 more
- 08 Sep 2013
TL;DR: This paper investigates the interplay between Stack Overflow activities and the development process, reflected by code changes committed to the largest social coding repository, GitHub, and shows that active GitHub committers ask fewer questions and provide more answers than others.
Process mining techniques and applications – A systematic mapping study
Cleiton dos Santos Garcia,Alex Meincheim,Elio Ribeiro Faria Junior,Marcelo Rosano Dallagassa,Denise Maria Vecino Sato,Denise Maria Vecino Sato,Deborah Ribeiro Carvalho,Eduardo Alves Portela Santos,Edson Emílio Scalabrin +8 more
TL;DR: It is possible to observe that the most active research topics are associated with the process discovery algorithms, followed by conformance checking, and architecture and tools improvements, and finally application domains among different business segments are reported on.
323
A survey on educational process mining
TL;DR: This paper introduces EPM and elaborates on some of the potential of this technology in the educational domain and describes some other relevant, related areas such as intentional mining, sequential pattern mining and graph mining.
232
A comparison of identity merge algorithms for software repositories
Mathieu Goeminne,Tom Mens +1 more
TL;DR: This article provides an objective comparison of identity merge algorithms, including some improvements over existing algorithms, and is validated on a selection of large ongoing open source software projects.
87
References
•Book
Introduction to Information Retrieval
Christopher D. Manning,Prabhakar Raghavan,Hinrich Schütze +2 more
- 01 Jan 2008
TL;DR: In this article, the authors present an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections.
Guidelines for conducting and reporting case study research in software engineering
Per Runeson,Martin Höst +1 more
TL;DR: This paper aims at providing an introduction to case study methodology and guidelines for researchers conducting case studies and readers studying reports of such studies, and presents recommended practices and evaluated checklists for researchers and readers of case study research.
The application of Petri-nets to workflow management
TL;DR: This paper introduces workflow management as an application domain for Petri nets, presents state-of-the-art results with respect to the verification of workflows, and highlights some Petri-net-based workflow tools.
The prom framework: a new era in process mining tool support
B. F. van Dongen,A. K. A. de Medeiros,H. M. W. Verbeek,A.J.M.M. Weijters,W.M.P. van der Aalst +4 more
- 20 Jun 2005
TL;DR: The ProM framework is introduced and an overview of the plug-ins that have been developed and is flexible with respect to the input and output format, and is also open enough to allow for the easy reuse of code during the implementation of new process mining ideas.
Fuzzy mining: adaptive process simplification based on multi-perspective metrics
CW Christian Günther,Wil M. P. van der Aalst +1 more
- 24 Sep 2007
TL;DR: A new process mining approach is proposed that is configurable and allows for different faithfully simplifiedviews of a particular process, just like different roadmaps provide suitable abstractions of reality.