Multi-objective software effort estimation
Federica Sarro,Alessio Petrozziello,Mark Harman +2 more
- 14 May 2016
- pp 619-630
TL;DR: A bi-objective effort estimation algorithm that combines Confidence Interval Analysis and assessment of Mean Absolute Error is introduced that outperforms the baseline, state-of-the-art and all three alternative formulations, statistically significantly.
read more
Abstract: We introduce a bi-objective effort estimation algorithm that combines Confidence Interval Analysis and assessment of Mean Absolute Error. We evaluate our proposed algorithm on three different alternative formulations, baseline comparators and current state-of-the-art effort estimators applied to five real-world datasets from the PROMISE repository, involving 724 different software projects in total. The results reveal that our algorithm outperforms the baseline, state-of-the-art and all three alternative formulations, statistically significantly (p Â12 ≥ 0.9) over all five datasets. We also provide evidence that our algorithm creates a new state-of-the-art, which lies within currently claimed industrial human-expert-based thresholds, thereby demonstrating that our findings have actionable conclusions for practicing software engineers.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A Deep Learning Model for Estimating Story Points
TL;DR: In this article, the authors propose a prediction model for estimating story points based on a novel combination of two powerful deep learning architectures: long short-term memory and recurrent highway network.
Finding Faster Configurations Using FLASH
TL;DR: Flash as mentioned in this paper is a sequential model-based method that sequentially explores the configuration space by reflecting on the configurations evaluated so far to determine the next best configuration to explore, which reduces the effort required to find the better configuration.
How to Evaluate Solutions in Pareto-based Search-Based Software Engineering? A Critical Review and Methodological Guidance
Miqing Li,Tao Chen,Xin Yao +2 more
TL;DR: This paper surveys 717 papers published between 2009 and 2019 from 36 venues in seven repositories, and selects 95 prominent studies, through which five important but overlooked issues in the area are identified, to codify a methodological guidance for selecting and using evaluation methods in different SBSE scenarios.
Cross-validation based K nearest neighbor imputation for software quality datasets: An empirical study
TL;DR: This work develops a novel incomplete-instance based KNN imputation technique, which utilizes a cross-validation scheme to optimize the parameters for each missing value and observes that the classification accuracy is improved or at least maintained by using the approach for missing data imputation.
83
Investigating the relationship between price, rating, and popularity in the Blackberry World App Store
TL;DR: The findings reveal that there are strong correlations between customer rating and popularity (rank of app downloads), and a mild correlation between app price and the number of features claimed for the app and also found that higher priced features tended to be lower rated by their users.
75
References
•Book
Statistical Power Analysis for the Behavioral Sciences
Jacob Cohen
- 01 Dec 1969
TL;DR: The concepts of power analysis are discussed in this paper, where Chi-square Tests for Goodness of Fit and Contingency Tables, t-Test for Means, and Sign Test are used.
124.4K
A fast and elitist multiobjective genetic algorithm: NSGA-II
TL;DR: This paper suggests a non-dominated sorting-based MOEA, called NSGA-II (Non-dominated Sorting Genetic Algorithm II), which alleviates all of the above three difficulties, and modify the definition of dominance in order to solve constrained multi-objective problems efficiently.
•Book
Classification and regression trees
Leo Breiman
- 01 Jan 1983
TL;DR: The methodology used to construct tree structured rules is the focus of a monograph as mentioned in this paper, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.
22.7K
Classification and regression trees
TL;DR: This article gives an introduction to the subject of classification and regression trees by reviewing some widely available algorithms and comparing their capabilities, strengths, and weakness in two examples.
Multiobjective evolutionary algorithms: a comparative case study and the strength Pareto approach
Eckart Zitzler,Lothar Thiele +1 more
TL;DR: The proof-of-principle results obtained on two artificial problems as well as a larger problem, the synthesis of a digital hardware-software multiprocessor system, suggest that SPEA can be very effective in sampling from along the entire Pareto-optimal front and distributing the generated solutions over the tradeoff surface.
8.6K