Optimal gene selection for cancer classification with partial correlation and k-nearest neighbor classifier

Si Ho Yoo, Sung Bae Cho

Research output: Contribution to journalConference articlepeer-review

2 Citations (Scopus)

Abstract

High density DNA microarrays are widely used in cancer research, monitoring thousands of genes at once. Due to small sample size and the large amount of genes in micrarray experiments, selection of significant genes via expression patterns is an important matter in cancer classification. Many gene selection methods have been investigated, but it is hard to find out the perfect one. In this paper we propose a new gene selection method based on partial correlation in regression analysis to find the informative genes to predict cancer. The genes selected by this method tend to have information about the cancer that is not overlapped by the genes selected previously. We have measured the sensitivity, specificity, and recognition rate of the selected genes with k-nearest neighbor classifier for colon cancer dataset. In most of the cases, the proposed method has produced better results than the gene selection methods based on correlation coefficients, showing high accuracy of 90.3% for colon cancer dataset.

Original languageEnglish
Pages (from-to)713-722
Number of pages10
JournalLecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
Volume3157
DOIs
Publication statusPublished - 2004
Event8th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2004: Trends in Artificial Intelligence - Auckland, New Zealand
Duration: 2004 Aug 92004 Aug 13

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Optimal gene selection for cancer classification with partial correlation and k-nearest neighbor classifier'. Together they form a unique fingerprint.

Cite this