Do Automatically Generated Test Cases Make Debugging Easier? An Experimental Assessment of Debugging Effectiveness and Efficiency

doi:10.1145/2768829

Journal Article10.1145/2768829

Do Automatically Generated Test Cases Make Debugging Easier? An Experimental Assessment of Debugging Effectiveness and Efficiency

Mariano Ceccato, +4 more

- 02 Dec 2015

- ACM Transactions on Software Engineering...

- Vol. 25, Iss: 1, pp 5

54

TL;DR: It is shown that automatically generated test cases are as useful for debugging as manual test cases and, for less experienced developers, automatic tests are more useful on average due to their lower static and dynamic complexity.

Abstract: Several techniques and tools have been proposed for the automatic generation of test cases. Usually, these tools are evaluated in terms of fault-revealing or coverage capability, but their impact on the manual debugging activity is not considered. The question is whether automatically generated test cases are equally effective in supporting debugging as manually written tests.We conducted a family of three experiments (five replications) with humans (in total, 55 subjects) to assess whether the features of automatically generated test cases, which make them less readable and understandable (e.g., unclear test scenarios, meaningless identifiers), have an impact on the effectiveness and efficiency of debugging. The first two experiments compare different test case generation tools (Randoop vs. EvoSuite). The third experiment investigates the role of code identifiers in test cases (obfuscated vs. original identifiers), since a major difference between manual and automatically generated test cases is that the latter contain meaningless (obfuscated) identifiers.We show that automatically generated test cases are as useful for debugging as manual test cases. Furthermore, we find that, for less experienced developers, automatic tests are more useful on average due to their lower static and dynamic complexity.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

ACM Transactions on Software Engineering and Methodology : Volume 22, Nomor 4, 2013

Giovanni Denaro, +2 more

- 24 Mar 2014

135

Proceedings Article•10.1145/2931037.2931061

Automatic generation of oracles for exceptional behaviors

Alberto Goffi, +3 more

- 18 Jul 2016

TL;DR: Toradocu is proposed, a technique that automatically creates test oracles for exceptional behaviors from Javadoc comments that uses a combination of natural language processing and run-time instrumentation.

...read moreread less

118

•Proceedings Article•10.1145/2884781.2884847

The impact of test case summaries on bug fixing performance: an empirical investigation

Sebastiano Panichella, +4 more

- 14 May 2016

TL;DR: An approach which automatically generates test case summaries of the portion of code exercised by each individual test, thereby improving understandability, is proposed, which can complement the current techniques around automated unit test generation or search-based techniques designed to generate a possibly minimal set of test cases.

...read moreread less

104

Journal Article•10.1145/3078840

Human Competitiveness of Genetic Programming in Spectrum-Based Fault Localisation: Theoretical and Empirical Analysis

Shin Yoo, +4 more

- 28 Jun 2017

- ACM Transactions on Software Engineering...

TL;DR: This work reports on the application of Genetic Programming to Software Fault Localisation, a problem in the area of Search-Based Software Engineering (SBSE), and proves that no future human investigation could outperform the evolved solutions.

...read moreread less

79

•Proceedings Article•10.1109/ICST.2016.44

Unit Test Generation During Software Development: EvoSuite Plugins for Maven, IntelliJ and Jenkins

Andrea Arcuri, +2 more

- 11 Apr 2016

TL;DR: The resulting architecture of the plugins, and the challenges arising when building such plugins, are discussed, which are targeted for the EvoSuite tool and can be adapted and reused for other test generation tools as well.

...read moreread less

44

...

Expand

References

•Journal Article•10.2307/4615733

A Simple Sequentially Rejective Multiple Test Procedure

Sture Holm

- 01 Jan 1979

- Scandinavian Journal of Statistics

TL;DR: In this paper, a simple and widely accepted multiple test procedure of the sequentially rejective type is presented, i.e. hypotheses are rejected one at a time until no further rejections can be done.

...read moreread less

23.4K

•Journal Article•10.1109/TIT.1967.1053964

Nearest neighbor pattern classification

Thomas M. Cover, +1 more

- 01 Jan 1967

- IEEE Transactions on Information Theory

TL;DR: The nearest neighbor decision rule assigns to an unclassified sample point the classification of the nearest of a set of previously classified points, so it may be said that half the classification information in an infinite sample set is contained in the nearest neighbor.

...read moreread less

15.2K

•Book

Questionnaire Design, Interviewing and Attitude Measurement

Abraham Naftali Oppenheim

- 01 Jul 1992

TL;DR: The second edition of Dr Bram Oppenheim's established work, like the first, is a practical teaching text of survey methods as mentioned in this paper, which includes interviewing (both clip-board and depth interviewing), sampling and research design, data analysis, and a special chapter on pilot work.

...read moreread less

5.2K

Questionnaire Design, Interviewing and Attitude Measurement

Peter M. Chisnall

- 01 Oct 1993

4.1K

Proceedings Article•10.1145/1081706.1081750

CUTE: a concolic unit testing engine for C

Koushik Sen, +2 more

- 01 Sep 2005

TL;DR: In this paper, the authors address the problem of automating unit testing with memory graphs as inputs, and develop a method to represent and track constraints that capture the behavior of a symbolic execution of a unit with memory graph as inputs.

...read moreread less

2K