The value and validity of software effort estimation models built from a multiple organization data set : a thesis submitted to Auckland University of Technology in fulfilment of the degree of Master of Philosophy(MPhil), 2008 / Kefu Deng ; supervisor: Stephen MacDonell.

The objective of this research is to empirically assess the value and validity of a multi-organization data set in the building of prediction models for several 'local' software organizations; that is, smaller organizations that might have a few project records but that are interested in i...

Full description

Saved in:
Bibliographic Details
Main Author: Deng, Kefu (Author)
Corporate Author: AUT University. School of Computing and Mathematical Sciences
Format: Ethesis
Language:English
Subjects:
Online Access:Click here to access this resource online
Description
Summary:The objective of this research is to empirically assess the value and validity of a multi-organization data set in the building of prediction models for several 'local' software organizations; that is, smaller organizations that might have a few project records but that are interested in improving their ability to accurately predict software project effort. Evidence to date in the research literature is mixed, due not to problems with the underlying research ideas but with limitations in the analytical processes employed: • the majority of previous studies have used only a single organization as the 'local' sample, introducing the potential for bias • the degree to which the conclusions of these studies might apply more generally is unable to be determined because of a lack of transparency in the data analysis processes used. It is the aim of this research to provide a more robust and visible test of the utility of the largest multi-organization data set currently available - that from the ISBSG - in terms of enabling smaller-scale organizations to build relevant and accurate models for project-level effort prediction. Stepwise regression is employed to enable the construction of 'local', 'global' and 'refined global' models of effort that are then validated against actual project data from eight organizations. The results indicate that local data, that is, data collected for a single organization, is almost always more effective as a basis for the construction of a predictive model than data sourced from a global repository. That said, the accuracy of the models produced from the global data set, while worse than that achieved with local data, may be sufficiently accurate in the absence of reliable local data - an issue that could be investigated in future research. The study concludes with recommendations for both software engineering practice - in setting out a more dynamic scenario for the management of software development - and research - in terms of implications for the collection and analysis of software engineering data.
Author supplied keywords: Organisation data set; Ordinary Least Squares OLS modelling; Project estimation; International Software Benchmarking Standards Group ISBSG; Statistics; Regression.
Physical Description:1 online resource
Also held in print (viii,113 leaves : illustrations ; 30 cm) (T 005.1068 DEN) in off-campus storage, box 109
Bibliography:Includes bibliographical references.
Requests
Request this item Request this AUT item so you can pick it up when you're at the library.
Interlibrary Loan With Interlibrary Loan you can request the item from another library. It's a free service.