TY - GEN
T1 - K-TSN(k-Top Scoring N)
T2 - Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007
AU - Yoon, Youngmi
AU - Bien, Sangjay
AU - Park, Sanghyun
PY - 2007
Y1 - 2007
N2 - Microarrays produce expression measurements for thousands of genes simultaneously, which is useful for the phenotype classification. We performed a direct integration of individual microarrays with same biological objectives by converting an expression value into a rank value within a sample and built a classifier based on rank comparison. Our classifier is an ensemble method, which has k top-scoring decision rules. Each rule contains a number of genes, a relationship between those genes, and a class label. Current classifiers fix the number of genes in each rule as a pair or a triple. In this paper, we generalized the number of genes involved in each rule. Generalizing the number of genes increases the robustness and the reliability of the classifier. Our algorithm saves resources by combining shorter rules to build a longer-rule, shows a rapid convergence toward its high-scoring rule list, and outperforms the current methods in run-time and classification accuracy.
AB - Microarrays produce expression measurements for thousands of genes simultaneously, which is useful for the phenotype classification. We performed a direct integration of individual microarrays with same biological objectives by converting an expression value into a rank value within a sample and built a classifier based on rank comparison. Our classifier is an ensemble method, which has k top-scoring decision rules. Each rule contains a number of genes, a relationship between those genes, and a class label. Current classifiers fix the number of genes in each rule as a pair or a triple. In this paper, we generalized the number of genes involved in each rule. Generalizing the number of genes increases the robustness and the reliability of the classifier. Our algorithm saves resources by combining shorter rules to build a longer-rule, shows a rapid convergence toward its high-scoring rule list, and outperforms the current methods in run-time and classification accuracy.
UR - http://www.scopus.com/inward/record.url?scp=49349092797&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=49349092797&partnerID=8YFLogxK
U2 - 10.1109/FBIT.2007.19
DO - 10.1109/FBIT.2007.19
M3 - Conference contribution
AN - SCOPUS:49349092797
SN - 0769529992
SN - 9780769529998
T3 - Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007
SP - 188
EP - 192
BT - Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies, FBIT 2007
Y2 - 11 October 2007 through 13 October 2007
ER -