K-TSN(k-Top Scoring N): Microarray data classification based on rank-comparison decision rules

Youngmi Yoon, Sangjay Bien, Sanghyun Park

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Microarrays produce expression measurements for thousands of genes simultaneously, which is useful for the phenotype classification. We performed a direct integration of individual microarrays with same biological objectives by converting an expression value into a rank value within a sample and built a classifier based on rank comparison. Our classifier is an ensemble method, which has k top-scoring decision rules. Each rule contains a number of genes, a relationship between those genes, and a class label. Current classifiers fix the number of genes in each rule as a pair or a triple. In this paper, we generalized the number of genes involved in each rule. Generalizing the number of genes increases the robustness and the reliability of the classifier. Our algorithm saves resources by combining shorter rules to build a longer-rule, shows a rapid convergence toward its high-scoring rule list, and outperforms the current methods in run-time and classification accuracy.

Original languageEnglish
Title of host publicationProceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007
Pages188-192
Number of pages5
DOIs
Publication statusPublished - 2007 Dec 1
EventFrontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007 - Jeju Island, Korea, Republic of
Duration: 2007 Oct 112007 Oct 13

Publication series

NameProceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007

Other

OtherFrontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007
CountryKorea, Republic of
CityJeju Island
Period07/10/1107/10/13

Fingerprint

Microarrays
Genes
Classifiers
Labels

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Information Systems

Cite this

Yoon, Y., Bien, S., & Park, S. (2007). K-TSN(k-Top Scoring N): Microarray data classification based on rank-comparison decision rules. In Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007 (pp. 188-192). [4524102] (Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007). https://doi.org/10.1109/FBIT.2007.19
Yoon, Youngmi ; Bien, Sangjay ; Park, Sanghyun. / K-TSN(k-Top Scoring N) : Microarray data classification based on rank-comparison decision rules. Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007. 2007. pp. 188-192 (Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007).
@inproceedings{df6eb2d9bf13478aa007952eeb1070c4,
title = "K-TSN(k-Top Scoring N): Microarray data classification based on rank-comparison decision rules",
abstract = "Microarrays produce expression measurements for thousands of genes simultaneously, which is useful for the phenotype classification. We performed a direct integration of individual microarrays with same biological objectives by converting an expression value into a rank value within a sample and built a classifier based on rank comparison. Our classifier is an ensemble method, which has k top-scoring decision rules. Each rule contains a number of genes, a relationship between those genes, and a class label. Current classifiers fix the number of genes in each rule as a pair or a triple. In this paper, we generalized the number of genes involved in each rule. Generalizing the number of genes increases the robustness and the reliability of the classifier. Our algorithm saves resources by combining shorter rules to build a longer-rule, shows a rapid convergence toward its high-scoring rule list, and outperforms the current methods in run-time and classification accuracy.",
author = "Youngmi Yoon and Sangjay Bien and Sanghyun Park",
year = "2007",
month = "12",
day = "1",
doi = "10.1109/FBIT.2007.19",
language = "English",
isbn = "0769529992",
series = "Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007",
pages = "188--192",
booktitle = "Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007",

}

Yoon, Y, Bien, S & Park, S 2007, K-TSN(k-Top Scoring N): Microarray data classification based on rank-comparison decision rules. in Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007., 4524102, Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007, pp. 188-192, Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007, Jeju Island, Korea, Republic of, 07/10/11. https://doi.org/10.1109/FBIT.2007.19

K-TSN(k-Top Scoring N) : Microarray data classification based on rank-comparison decision rules. / Yoon, Youngmi; Bien, Sangjay; Park, Sanghyun.

Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007. 2007. p. 188-192 4524102 (Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - K-TSN(k-Top Scoring N)

T2 - Microarray data classification based on rank-comparison decision rules

AU - Yoon, Youngmi

AU - Bien, Sangjay

AU - Park, Sanghyun

PY - 2007/12/1

Y1 - 2007/12/1

N2 - Microarrays produce expression measurements for thousands of genes simultaneously, which is useful for the phenotype classification. We performed a direct integration of individual microarrays with same biological objectives by converting an expression value into a rank value within a sample and built a classifier based on rank comparison. Our classifier is an ensemble method, which has k top-scoring decision rules. Each rule contains a number of genes, a relationship between those genes, and a class label. Current classifiers fix the number of genes in each rule as a pair or a triple. In this paper, we generalized the number of genes involved in each rule. Generalizing the number of genes increases the robustness and the reliability of the classifier. Our algorithm saves resources by combining shorter rules to build a longer-rule, shows a rapid convergence toward its high-scoring rule list, and outperforms the current methods in run-time and classification accuracy.

AB - Microarrays produce expression measurements for thousands of genes simultaneously, which is useful for the phenotype classification. We performed a direct integration of individual microarrays with same biological objectives by converting an expression value into a rank value within a sample and built a classifier based on rank comparison. Our classifier is an ensemble method, which has k top-scoring decision rules. Each rule contains a number of genes, a relationship between those genes, and a class label. Current classifiers fix the number of genes in each rule as a pair or a triple. In this paper, we generalized the number of genes involved in each rule. Generalizing the number of genes increases the robustness and the reliability of the classifier. Our algorithm saves resources by combining shorter rules to build a longer-rule, shows a rapid convergence toward its high-scoring rule list, and outperforms the current methods in run-time and classification accuracy.

UR - http://www.scopus.com/inward/record.url?scp=49349092797&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=49349092797&partnerID=8YFLogxK

U2 - 10.1109/FBIT.2007.19

DO - 10.1109/FBIT.2007.19

M3 - Conference contribution

AN - SCOPUS:49349092797

SN - 0769529992

SN - 9780769529998

T3 - Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007

SP - 188

EP - 192

BT - Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007

ER -

Yoon Y, Bien S, Park S. K-TSN(k-Top Scoring N): Microarray data classification based on rank-comparison decision rules. In Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007. 2007. p. 188-192. 4524102. (Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007). https://doi.org/10.1109/FBIT.2007.19