Data accumulation and software effort prediction

aut.researcherMacDonell, Stephen Gerard
dc.contributor.authorMacDonell, SG
dc.contributor.authorShepperd, M
dc.date.accessioned2011-10-31T07:56:46Z
dc.date.available2011-10-31T07:56:46Z
dc.date.copyright2010-09-16
dc.date.issued2010-09-16
dc.description.abstractBACKGROUND: In reality project managers are constrained by the incremental nature of data collection. Specifically, project observations are accumulated one project at a time. Likewise within-project data are accumulated one stage or phase at a time. However, empirical researchers have given limited attention to this perspective. PROBLEM: Consequently, our analyses may be biased. On the one hand, our predictions may be optimistic due to the availability of the entire data set, but on the other hand pessimistic due to the failure to capitalize upon the temporal nature of the data. Our goals are (i) to explore the impact of ignoring time when building cost prediction models and (ii) to show the benefits of re-estimating using completed phase data during a project. METHOD: Using a small industrial data set of sixteen software projects from a single organization we compare predictive models developed using a time-aware approach with a more traditional leave-one-out analysis. We then investigate the impact of using requirements, design and implementation phase data on estimating subsequent phase effort. RESULTS: First, we find that failure to take the temporal nature of data into account leads to unreliable estimates of their predictive efficacy. Second, for this organization, prior-phase effort data could be used to improve the management of subsequent process tasks. CONCLUSION: We should collect time-related data and use it in our analyses. Failure to do so may lead to incorrect conclusions being drawn, and may also inhibit industrial take up of our research work.
dc.identifier.citationPresentation at the 4th International Symposium on Empirical Software Engineering and Measurement, Bolzano-Bozen, Italy and published in Proceeding ESEM '10 Proceedings of the 2010 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement ACM New York, NY, USA
dc.identifier.doi10.1145/1852786.1852828
dc.identifier.isbn978-1-4503-0039-1
dc.identifier.urihttps://hdl.handle.net/10292/2441
dc.publisherIEEE Computer Society Press
dc.relation.urihttp://dx.doi.org/10.1145/1852786.1852828
dc.relation.urihttp://esem2010.case.unibz.it/program.php
dc.rightsCopyright © 2010 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.rights.accessrightsOpenAccess
dc.titleData accumulation and software effort prediction
dc.typeConference Contribution
pubs.organisational-data/AUT
pubs.organisational-data/AUT/Design & Creative Technologies
pubs.organisational-data/AUT/Design & Creative Technologies/School of Computing & Mathematical Science
pubs.organisational-data/AUT/PBRF Researchers
pubs.organisational-data/AUT/PBRF Researchers/Design & Creative Technologies PBRF Researchers
pubs.organisational-data/AUT/PBRF Researchers/Design & Creative Technologies PBRF Researchers/DCT C & M Computing
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
MacDonell and Shepperd (2010) ESEM.pdf
Size:
60.52 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
licence.htm
Size:
29.98 KB
Format:
Unknown data format
Description: