Proceedings Article10.1109/ISSRE.2007.31
Using Machine Learning to Support Debugging with Tarantula
Lionel C. Briand,Yvan Labiche,Xuetao Liu +2 more
- 05 Nov 2007
- pp 137-146
114
TL;DR: It is shown that failure conditions as modeled by a C4.5 decision tree accurately predict failures and can therefore be used as well to help debugging.
read more
Abstract: Using a specific machine learning technique, this paper proposes a way to identify suspicious statements during debugging. The technique is based on principles similar to Tarantula but addresses its main flaw: its difficulty to deal with the presence of multiple faults as it assumes that failing test cases execute the same fault(s). The improvement we present in this paper results from the use of C4.5 decision trees to identify various failure conditions based on information regarding the test cases' inputs and outputs. Failing test cases executing under similar conditions are then assumed to fail due to the same fault(s). Statements are then considered suspicious if they are covered by a large proportion of failing test cases that execute under similar conditions. We report on a case study that demonstrates improvement over the original Tarantula technique in terms of statement ranking. Another contribution of this paper is to show that failure conditions as modeled by a C4.5 decision tree accurately predict failures and can therefore be used as well to help debugging.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A Survey on Software Fault Localization
TL;DR: A comprehensive overview of a broad spectrum of fault localization techniques, each of which aims to streamline the fault localization process and make it more effective by attacking the problem in a unique way is provided.
1.1K
DeepFL: integrating multiple fault diagnosis dimensions for deep fault localization
Xia Li,Wei Li,Yuqun Zhang,Lingming Zhang +3 more
- 10 Jul 2019
TL;DR: This work proposes DeepFL, a deep learning approach to automatically learn the most effective existing/latent features for precise fault localization and results show DeepFL can significantly outperform state-of-the-art TraPT/FLUCCS and is also surprisingly effective for cross-project prediction.
287
Latent error prediction and fault localization for microservice applications by learning from system trace logs
Xiang Zhou,Xin Peng,Tao Xie,Jun Sun,Chao Ji,Dewei Liu,Qilin Xiang,Chuan He +7 more
- 12 Aug 2019
TL;DR: The results indicate that MEPFL can achieve high accuracy in intraapplication prediction of latent errors, faulty microservices, and fault types, and outperforms a state-of-the-art approach of failure diagnosis for distributed systems.
251
Fault Localization with Code Coverage Representation Learning
Yi Li,Shaohua Wang,Tien N. Nguyen +2 more
- 22 May 2021
TL;DR: DeepRL4FL as mentioned in this paper uses code coverage representation learning (RL) and data dependencies RL for program statements to locate the buggy code at the statement and method levels by treating FL as an image pattern recognition problem.
CNN-FL: An Effective Approach for Localizing Faults using Convolutional Neural Networks
Zhuo Zhang,Yan Lei,Xiaoguang Mao,Panpan Li +3 more
- 01 Feb 2019
TL;DR: CNN-FL is proposed: an approach for localizing faults based on convolutional neural networks to explore the promising potential of deep learning in fault localization and empirical results show that CNN-FL significantly improves fault localization effectiveness.
123
References
•Book
C4.5: Programs for Machine Learning
J. Ross Quinlan
- 15 Oct 1992
TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.
27.2K
•Book
Data Mining: Practical Machine Learning Tools and Techniques
Ian H. Witten,Eibe Frank,Mark Hall +2 more
- 25 Oct 1999
TL;DR: This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.
25.4K
Programs for Machine Learning
Steven L. Salzberg,Alberto Segre +1 more
- 01 Jan 1994
TL;DR: In his new book, C4.5: Programs for Machine Learning, Quinlan has put together a definitive, much needed description of his complete system, including the latest developments, which will be a welcome addition to the library of many researchers and students.
Data mining
TL;DR: The graduate certificate’s narrow focus allows you to dig deep into this specific topic, and start applying your knowledge sooner.
6.8K
Prioritizing test cases for regression testing
TL;DR: Test case prioritization techniques schedule test cases for execution in an order that attempts to increase their effectiveness at meeting some performance goal as discussed by the authors, such as rate of fault detection, a measure of how quickly faults are detected within the testing process.
1.3K