The Value and Validity of Software Effort Estimation Models Built From a Multiple Organization Data Set

Deng, Kefu

The Value and Validity of Software Effort Estimation Models Built From a Multiple Organization Data Set

aut.embargo	No	en
aut.thirdpc.contains	No
aut.thirdpc.permission	No
aut.thirdpc.removed	No
dc.contributor.advisor	MacDonell, Stephen
dc.contributor.author	Deng, Kefu
dc.date.accessioned	2009-01-29T22:35:48Z
dc.date.available	2009-01-29T22:35:48Z
dc.date.copyright	2008
dc.date.issued	2008
dc.description.abstract	The objective of this research is to empirically assess the value and validity of a multi-organization data set in the building of prediction models for several ‘local’ software organizations; that is, smaller organizations that might have a few project records but that are interested in improving their ability to accurately predict software project effort. Evidence to date in the research literature is mixed, due not to problems with the underlying research ideas but with limitations in the analytical processes employed: • the majority of previous studies have used only a single organization as the ‘local’ sample, introducing the potential for bias • the degree to which the conclusions of these studies might apply more generally is unable to be determined because of a lack of transparency in the data analysis processes used. It is the aim of this research to provide a more robust and visible test of the utility of the largest multi-organization data set currently available – that from the ISBSG – in terms of enabling smaller-scale organizations to build relevant and accurate models for project-level effort prediction. Stepwise regression is employed to enable the construction of ‘local’, ‘global’ and ‘refined global’ models of effort that are then validated against actual project data from eight organizations. The results indicate that local data, that is, data collected for a single organization, is almost always more effective as a basis for the construction of a predictive model than data sourced from a global repository. That said, the accuracy of the models produced from the global data set, while worse than that achieved with local data, may be sufficiently accurate in the absence of reliable local data – an issue that could be investigated in future research. The study concludes with recommendations for both software engineering practice – in setting out a more dynamic scenario for the management of software development – and research – in terms of implications for the collection and analysis of software engineering data.
dc.identifier.uri	https://hdl.handle.net/10292/473
dc.language.iso	en	en_US
dc.publisher	Auckland University of Technology
dc.rights.accessrights	OpenAccess
dc.subject	Organisation data set
dc.subject	Ordinary Least Squares OLS modelling
dc.subject	Project estimation
dc.subject	International Software Benchmarking Standards Group ISBSG
dc.subject	Statistics
dc.subject	Regression
dc.title	The Value and Validity of Software Effort Estimation Models Built From a Multiple Organization Data Set
dc.type	Thesis
thesis.degree.grantor	Auckland University of Technology
thesis.degree.level	Masters Theses
thesis.degree.name	Master of Philosophy

Files

Original bundle

Now showing 1 - 1 of 1

Name:: DengK.pdf
Size:: 1.3 MB
Format:: Adobe Portable Document Format
Description:: Thesis

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 979 B
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Masters Theses