Mining software metrics from the jazz repository

Date
2011-09-19
Authors
Connor, AM
Supervisor
Item type
Journal Article
Degree name
Journal Title
Journal ISSN
Volume Title
Publisher
ARPN Journal of Systems and Software
Abstract

This paper describes the extraction of source code metrics from the Jazz repository and the systematic application of data mining techniques to identify the most useful of those metrics for predicting the success or failure of an attempt to construct a working instance of the software product. Results are presented from a study using the J48 classification method used in conjunction with a number of attribute selection strategies applied to a set of source code metrics. These strategies involve the investigation of differing slices of code from the version control system and the cross-dataset classification of the various significant metrics in an attempt to work around the multicollinearity implicit in the available data. The results indicate that only a relatively small number of the available software metrics that have been considered have any significance for predicting the outcome of a build. These significant metrics are outlined and implication of the results discussed, particularly the relative difficulty of being able to predict failed build attempts.

Description
Keywords
Data mining , Jazz , Software metrics , Software repositories
Source
ARPN Journal of Systems & Software, vol.1(5), pp.194 - 204
DOI
Rights statement
ARPN Journal of Systems and Software is partly sponsored by some non-governmental organizations. Being part of open-access initiative, the published research papers are freely available to everyone and we don’t apply any subscription charges for our readers or libraries.