An exploratory study about cross-project defect prediction: impact of using different classification algorithms and a measure of performance in building predictive models.
Ricardo F. P. Satin$^{1}$, Igor Scaliante Wiese$^{1}$, Reginaldo Ré$^{1}$
$^{1}$Universidade Tecnológica Federal do Paraná–Campo Mourão. Paraná Brazil
email: ricardo.francisco@gmail.com, igor@utfpr.edu.br, reginaldo@utfpr.edu.br
Schedule:Tue 20th@10:15, Room: B

Predicting defects in software projects is a complex task, especially in the initial phases of software development because there are a few available data. The use of cross-project defect prediction is indicated in such situation because it enables reuse of data from similar projects. In order to find and group similar projects, this paper proposes the construction of cross-project prediction models using a measure of performance achieved through the application of classification algorithms. To do so, we studied the combined application of different algorithms of classification, of feature selection, and clustering data, applied to 1270 projects aiming to building different cross-project prediction models. In this study we concluded that Naive Bayes algorithm obtained the best performance, with 31.58% of satisfactory predictions in 19 models created with its use. This proposal seems to hold promise, once the local predictions considered satisfactory reached 31.58%, against 26.31% of global predictions.

BibTex

@InProceedings{CLEI-2015:144463,
	author 		= {Ricardo F. P. Satin and Igor Scaliante Wiese and Reginaldo RĂ©},
	title 		= {An exploratory study about cross-project defect prediction: impact of using different classification algorithms and a measure of performance in building predictive models.},
	booktitle 	= {2015 XLI Latin American Computing Conference (CLEI)},
	pages 		= {683--694},
	year 		= {2015},
	editor 		= {Hector Cancela and Alex Cuadros-Vargas and Ernesto Cuadros-Vargas},
	address 	= {Arequipa-Peru},
	month 		= {October},
	organization 	= {CLEI},
	publisher 	= {CLEI},
	url 		= {http://clei.org/clei2015/144463},
	isbn 		= {978-1-4673-9143-6},
	}


Generated by Ernesto Cuadros-Vargas , Sociedad Peruana de Computación-Peru, Universidad Católica San Pablo, Arequipa-Perú