Evaluating prediction systems in software project estimation

Shepperd, M; MacDonell, SG

Evaluating prediction systems in software project estimation

Files

(278.37 KB)

Date

2012

Authors

Shepperd, M

MacDonell, SG

Item type

Journal Article

Publisher

Elsevier

Abstract

Context

Software engineering has a problem in that when we empirically evaluate competing prediction systems we obtain conflicting results.

Objective To reduce the inconsistency amongst validation study results and provide a more formal foundation to interpret results with a particular focus on continuous prediction systems.

Method A new framework is proposed for evaluating competing prediction systems based upon (1) an unbiased statistic, Standardised Accuracy, (2) testing the result likelihood relative to the baseline technique of random ‘predictions’, that is guessing, and (3) calculation of effect sizes.

Results Previously published empirical evaluations of prediction systems are re-examined and the original conclusions shown to be unsafe. Additionally, even the strongest results are shown to have no more than a medium effect size relative to random guessing.

Conclusions Biased accuracy statistics such as MMRE are deprecated. By contrast this new empirical validation framework leads to meaningful results. Such steps will assist in performing future meta-analyses and in providing more robust and usable recommendations to practitioners.

Source

Information and Software Technology, vol.54(8), pp.820 - 827

DOI

10.1016/j.infsof.2011.12.008

Publisher's version

http://dx.doi.org/10.1016/j.infsof.2011.12.008

Rights statement

Copyright © 2012 Elsevier Ltd. All rights reserved. This is the author’s version of a work that was accepted for publication in (see Citation). Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. The definitive version was published in (see Citation). The original publication is available at (see Publisher's Version).

Permanent link

https://hdl.handle.net/10292/4423

Collections

SERL - Software Engineering Research Laboratory

Full item page