Ensemble classifiers based on correlation analysis for DNA microarray classification

Kyung Joong Kim, Sung Bae Cho

Research output: Contribution to journalArticle

46 Citations (Scopus)

Abstract

Since accurate classification of DNA microarray is a very important issue for the treatment of cancer, it is more desirable to make a decision by combining the results of various expert classifiers rather than by depending on the result of only one classifier. In spite of the many advantages of mutually error-correlated ensemble classifiers, they are limited in performance. It is difficult to create an optimal ensemble for DNA analysis that deals with few samples with large features. Usually, different feature sets are provided to learn the components of the ensemble expecting the improvement of classifiers. If the feature sets provide similar information, the combination of the classifiers trained from them cannot improve the performance because they will make the same error and there is no possibility of compensation. In this paper, we adopt correlation analysis of feature selection methods as a guideline of the separation of features to learn the components of ensemble. We propose two different correlation methods for the generation of feature sets to learn ensemble classifiers. Each ensemble classifier combines several other classifiers learned from different features and based on correlation analysis to classify cancer precisely. In this way, it is possible to systematically evaluate the performance of the proposed method with three benchmark datasets. Experimental results show that two ensemble classifiers whose components are learned from different feature sets that are negatively or complementarily correlated with each other produce the best recognition rates on the three benchmark datasets.

Original languageEnglish
Pages (from-to)187-199
Number of pages13
JournalNeurocomputing
Volume70
Issue number1-3
DOIs
Publication statusPublished - 2006 Dec 1

Fingerprint

Microarrays
Oligonucleotide Array Sequence Analysis
Benchmarking
DNA
Classifiers
Neoplasms
Guidelines
Correlation methods
Datasets
Feature extraction

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Cognitive Neuroscience
  • Artificial Intelligence

Cite this

@article{99f85dfc409e4d43a49f816714abd21e,
title = "Ensemble classifiers based on correlation analysis for DNA microarray classification",
abstract = "Since accurate classification of DNA microarray is a very important issue for the treatment of cancer, it is more desirable to make a decision by combining the results of various expert classifiers rather than by depending on the result of only one classifier. In spite of the many advantages of mutually error-correlated ensemble classifiers, they are limited in performance. It is difficult to create an optimal ensemble for DNA analysis that deals with few samples with large features. Usually, different feature sets are provided to learn the components of the ensemble expecting the improvement of classifiers. If the feature sets provide similar information, the combination of the classifiers trained from them cannot improve the performance because they will make the same error and there is no possibility of compensation. In this paper, we adopt correlation analysis of feature selection methods as a guideline of the separation of features to learn the components of ensemble. We propose two different correlation methods for the generation of feature sets to learn ensemble classifiers. Each ensemble classifier combines several other classifiers learned from different features and based on correlation analysis to classify cancer precisely. In this way, it is possible to systematically evaluate the performance of the proposed method with three benchmark datasets. Experimental results show that two ensemble classifiers whose components are learned from different feature sets that are negatively or complementarily correlated with each other produce the best recognition rates on the three benchmark datasets.",
author = "Kim, {Kyung Joong} and Cho, {Sung Bae}",
year = "2006",
month = "12",
day = "1",
doi = "10.1016/j.neucom.2006.03.002",
language = "English",
volume = "70",
pages = "187--199",
journal = "Review of Economic Dynamics",
issn = "1094-2025",
publisher = "Academic Press Inc.",
number = "1-3",

}

Ensemble classifiers based on correlation analysis for DNA microarray classification. / Kim, Kyung Joong; Cho, Sung Bae.

In: Neurocomputing, Vol. 70, No. 1-3, 01.12.2006, p. 187-199.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Ensemble classifiers based on correlation analysis for DNA microarray classification

AU - Kim, Kyung Joong

AU - Cho, Sung Bae

PY - 2006/12/1

Y1 - 2006/12/1

N2 - Since accurate classification of DNA microarray is a very important issue for the treatment of cancer, it is more desirable to make a decision by combining the results of various expert classifiers rather than by depending on the result of only one classifier. In spite of the many advantages of mutually error-correlated ensemble classifiers, they are limited in performance. It is difficult to create an optimal ensemble for DNA analysis that deals with few samples with large features. Usually, different feature sets are provided to learn the components of the ensemble expecting the improvement of classifiers. If the feature sets provide similar information, the combination of the classifiers trained from them cannot improve the performance because they will make the same error and there is no possibility of compensation. In this paper, we adopt correlation analysis of feature selection methods as a guideline of the separation of features to learn the components of ensemble. We propose two different correlation methods for the generation of feature sets to learn ensemble classifiers. Each ensemble classifier combines several other classifiers learned from different features and based on correlation analysis to classify cancer precisely. In this way, it is possible to systematically evaluate the performance of the proposed method with three benchmark datasets. Experimental results show that two ensemble classifiers whose components are learned from different feature sets that are negatively or complementarily correlated with each other produce the best recognition rates on the three benchmark datasets.

AB - Since accurate classification of DNA microarray is a very important issue for the treatment of cancer, it is more desirable to make a decision by combining the results of various expert classifiers rather than by depending on the result of only one classifier. In spite of the many advantages of mutually error-correlated ensemble classifiers, they are limited in performance. It is difficult to create an optimal ensemble for DNA analysis that deals with few samples with large features. Usually, different feature sets are provided to learn the components of the ensemble expecting the improvement of classifiers. If the feature sets provide similar information, the combination of the classifiers trained from them cannot improve the performance because they will make the same error and there is no possibility of compensation. In this paper, we adopt correlation analysis of feature selection methods as a guideline of the separation of features to learn the components of ensemble. We propose two different correlation methods for the generation of feature sets to learn ensemble classifiers. Each ensemble classifier combines several other classifiers learned from different features and based on correlation analysis to classify cancer precisely. In this way, it is possible to systematically evaluate the performance of the proposed method with three benchmark datasets. Experimental results show that two ensemble classifiers whose components are learned from different feature sets that are negatively or complementarily correlated with each other produce the best recognition rates on the three benchmark datasets.

UR - http://www.scopus.com/inward/record.url?scp=33750366183&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33750366183&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2006.03.002

DO - 10.1016/j.neucom.2006.03.002

M3 - Article

AN - SCOPUS:33750366183

VL - 70

SP - 187

EP - 199

JO - Review of Economic Dynamics

JF - Review of Economic Dynamics

SN - 1094-2025

IS - 1-3

ER -