A comparative study of classification methods for microarray data analysis

Date

2006

Authors

Hu, H.
Li, J.
Plank, A.
Wang, H.
Daggard, G.

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Conference paper

Citation

Conferences in Research and Practice in Information Technology Series, 2006, vol.61, pp.33-37

Statement of Responsibility

Conference Name

Australasian Data Mining Conference (29 Nov 2006 : Sydney, New South Wales)

Abstract

In response to the rapid development of DNA Microarray technology, many classification methods have been used for Microarray classification. SVMs, decision trees, Bagging, Boosting and Random Forest are commonly used methods. In this paper, we conduct experimental comparison of LibSVMs, C4.5, BaggingC4.5, AdaBoostingC4.5, and Random Forest on seven Microarray cancer data sets. The experimental results show that all ensemble methods outperform C4.5. The experimental results also show that all five methods benefit from data preprocessing, including gene selection and discretization, in classification accuracy. In addition to comparing the average accuracies of ten-fold cross validation tests on seven data sets, we use two statistical tests to validate findings. We observe that Wilcoxon signed rank test is better than sign test for such purpose.

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

Copyright 2006 Australian Computer Society

License

Grant ID

Published Version

Call number

Persistent link to this record