Recently, demand on the tools to efficiently analyze biological genomic information has been on the rise. In this paper, we attempt to explore the optimal features and classifiers through a comparative study with the most promising feature selection methods and machine learning classifiers. In order to predict the cancer class, the gene information from patient’s marrow expressed by DNA microarray, who has either the acute myeloid leukemia or acute lymphoblastic leukemia. Pearson and Spearman’s correlation, Euclidean distance, cosine coefficient, information gain, mutual information and signal to noise ratio have been used for feature selection. Backpropagation neural network, self-organizing map, structure adaptive self-organizing map, support vector machine, inductive decision tree and k-nearest neighbor have been used for classification. Experimental results indicate that backpropagation neural network with Pearson’s correlation coefficients is the best method, obtaining 97.1% of recognition rate on the test data.
|Title of host publication||Advances in Soft Computing - AFSS 2002 - 2002 AFSS International Conference on Fuzzy Systems, Proceedings|
|Editors||Nikhil R. Pal, Michio Sugeno|
|Number of pages||8|
|Publication status||Published - 2002|
|Event||5th International Conference on Asian Fuzzy Systems Society, AFSS 2002 - Calcutta, India|
Duration: 2002 Feb 3 → 2002 Feb 6
|Name||Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)|
|Other||5th International Conference on Asian Fuzzy Systems Society, AFSS 2002|
|Period||02/2/3 → 02/2/6|
Bibliographical notePublisher Copyright:
© Springer-Verlag Berlin Heidelberg 2002.
All Science Journal Classification (ASJC) codes
- Theoretical Computer Science
- Computer Science(all)