The impact of automated feature selection techniques on the interpretation of defect models

Jiarpakdee, J.; Tantithamthavorn, C.; Treude, C.

Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/127248

Scopus	Web of Science®	Altmetric
Citations
?	?

Full metadata record

DC Field	Value	Language
dc.contributor.author	Jiarpakdee, J.	-
dc.contributor.author	Tantithamthavorn, C.	-
dc.contributor.author	Treude, C.	-
dc.date.issued	2020	-
dc.identifier.citation	Empirical Software Engineering: an international journal, 2020; OninePubl.(5):1-49	-
dc.identifier.issn	1382-3256	-
dc.identifier.issn	1573-7616	-
dc.identifier.uri	http://hdl.handle.net/2440/127248	-
dc.description.abstract	The interpretation of defect models heavily relies on software metrics that are used to construct them. Prior work often uses feature selection techniques to remove metrics that are correlated and irrelevant in order to improve model performance. Yet, conclusions that are derived from defect models may be inconsistent if the selected metrics are inconsistent and correlated. In this paper, we systematically investigate 12 automated feature selection techniques with respect to the consistency, correlation, performance, computational cost, and the impact on the interpretation dimensions. Through an empirical investigation of 14 publicly-available defect datasets, we find that (1) 94–100% of the selected metrics are inconsistent among the studied techniques; (2) 37–90% of the selected metrics are inconsistent among training samples; (3) 0–68% of the selected metrics are inconsistent when the feature selection techniques are applied repeatedly; (4) 5–100% of the produced subsets of metrics contain highly correlated metrics; and (5) while the most important metrics are inconsistent among correlation threshold values, such inconsistent most important metrics are highly-correlated with the Spearman correlation of 0.85–1. Since we find that the subsets of metrics produced by the commonly-used feature selection techniques (except for AutoSpearman) are often inconsistent and correlated, these techniques should be avoided when interpreting defect models. In addition to introducing AutoSpearman which mitigates correlated metrics better than commonly-used feature selection techniques, this paper opens up new research avenues in the automated selection of features for defect models to optimise for interpretability as well as performance.	-
dc.description.statementofresponsibility	Jirayus Jiarpakdee, Chakkrit Tantithamthavorn, Christoph Treude	-
dc.language.iso	en	-
dc.publisher	Springer Nature	-
dc.rights	© Springer Science+Business Media, LLC, part of Springer Nature 2020	-
dc.source.uri	http://dx.doi.org/10.1007/s10664-020-09848-1	-
dc.subject	Software analytics; defect prediction; model interpretation; feature selection	-
dc.title	The impact of automated feature selection techniques on the interpretation of defect models	-
dc.type	Journal article	-
dc.identifier.doi	10.1007/s10664-020-09848-1	-
dc.relation.grant	http://purl.org/au-research/grants/arc/DE200100941	-
dc.relation.grant	http://purl.org/au-research/grants/arc/DE180100153	-
pubs.publication-status	Published	-
dc.identifier.orcid	Tantithamthavorn, C. [0000-0002-5516-9984]	-
dc.identifier.orcid	Treude, C. [0000-0002-6919-2149]	-
Appears in Collections:	Aurora harvest 4 Computer Science publications

Files in This Item:

There are no files associated with this item.

Show simple item record

Adelaide Research & Scholarship