Estimating redundancy information of selected features in multi-dimensional pattern classification

Chi Sang Jung, Hyunson Seo, Hong-Goo Kang

Research output: Contribution to journalArticle

14 Citations (Scopus)

Abstract

This paper proposes a novel criterion for estimating the redundancy information of selected feature sets in multi-dimensional pattern classification. An appropriate feature selection process typically maximizes the relevancy of features to each class and minimizes the redundancy of features between selected features. Unlike to the relevancy information that can be measured by mutual information, however, it is difficult to estimate the redundancy information because its dynamic range is varied by the characteristics of features and classes. By utilizing the conceptual diagram of the relationship between candidate features, selected features, and class variables, this paper proposes a new criterion to accurately compute the amount of redundancy. Specifically, the redundancy term is estimated by conditional mutual information between selected and candidate features to each class variable, which does not need a cumbersome normalization process as the conventional algorithm does. The proposed algorithm is implemented into a speech/music discrimination system to evaluate classification performance. Experimental results by varying the number of selected features verify that the proposed method shows higher classification accuracy than conventional algorithms.

Original languageEnglish
Pages (from-to)590-596
Number of pages7
JournalPattern Recognition Letters
Volume32
Issue number4
DOIs
Publication statusPublished - 2011 Mar 1

Fingerprint

Pattern recognition
Redundancy
Feature extraction

All Science Journal Classification (ASJC) codes

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Cite this

@article{736830a712c4477e835917df787715ee,
title = "Estimating redundancy information of selected features in multi-dimensional pattern classification",
abstract = "This paper proposes a novel criterion for estimating the redundancy information of selected feature sets in multi-dimensional pattern classification. An appropriate feature selection process typically maximizes the relevancy of features to each class and minimizes the redundancy of features between selected features. Unlike to the relevancy information that can be measured by mutual information, however, it is difficult to estimate the redundancy information because its dynamic range is varied by the characteristics of features and classes. By utilizing the conceptual diagram of the relationship between candidate features, selected features, and class variables, this paper proposes a new criterion to accurately compute the amount of redundancy. Specifically, the redundancy term is estimated by conditional mutual information between selected and candidate features to each class variable, which does not need a cumbersome normalization process as the conventional algorithm does. The proposed algorithm is implemented into a speech/music discrimination system to evaluate classification performance. Experimental results by varying the number of selected features verify that the proposed method shows higher classification accuracy than conventional algorithms.",
author = "Jung, {Chi Sang} and Hyunson Seo and Hong-Goo Kang",
year = "2011",
month = "3",
day = "1",
doi = "10.1016/j.patrec.2010.11.023",
language = "English",
volume = "32",
pages = "590--596",
journal = "Pattern Recognition Letters",
issn = "0167-8655",
publisher = "Elsevier",
number = "4",

}

Estimating redundancy information of selected features in multi-dimensional pattern classification. / Jung, Chi Sang; Seo, Hyunson; Kang, Hong-Goo.

In: Pattern Recognition Letters, Vol. 32, No. 4, 01.03.2011, p. 590-596.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Estimating redundancy information of selected features in multi-dimensional pattern classification

AU - Jung, Chi Sang

AU - Seo, Hyunson

AU - Kang, Hong-Goo

PY - 2011/3/1

Y1 - 2011/3/1

N2 - This paper proposes a novel criterion for estimating the redundancy information of selected feature sets in multi-dimensional pattern classification. An appropriate feature selection process typically maximizes the relevancy of features to each class and minimizes the redundancy of features between selected features. Unlike to the relevancy information that can be measured by mutual information, however, it is difficult to estimate the redundancy information because its dynamic range is varied by the characteristics of features and classes. By utilizing the conceptual diagram of the relationship between candidate features, selected features, and class variables, this paper proposes a new criterion to accurately compute the amount of redundancy. Specifically, the redundancy term is estimated by conditional mutual information between selected and candidate features to each class variable, which does not need a cumbersome normalization process as the conventional algorithm does. The proposed algorithm is implemented into a speech/music discrimination system to evaluate classification performance. Experimental results by varying the number of selected features verify that the proposed method shows higher classification accuracy than conventional algorithms.

AB - This paper proposes a novel criterion for estimating the redundancy information of selected feature sets in multi-dimensional pattern classification. An appropriate feature selection process typically maximizes the relevancy of features to each class and minimizes the redundancy of features between selected features. Unlike to the relevancy information that can be measured by mutual information, however, it is difficult to estimate the redundancy information because its dynamic range is varied by the characteristics of features and classes. By utilizing the conceptual diagram of the relationship between candidate features, selected features, and class variables, this paper proposes a new criterion to accurately compute the amount of redundancy. Specifically, the redundancy term is estimated by conditional mutual information between selected and candidate features to each class variable, which does not need a cumbersome normalization process as the conventional algorithm does. The proposed algorithm is implemented into a speech/music discrimination system to evaluate classification performance. Experimental results by varying the number of selected features verify that the proposed method shows higher classification accuracy than conventional algorithms.

UR - http://www.scopus.com/inward/record.url?scp=78650359969&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78650359969&partnerID=8YFLogxK

U2 - 10.1016/j.patrec.2010.11.023

DO - 10.1016/j.patrec.2010.11.023

M3 - Article

VL - 32

SP - 590

EP - 596

JO - Pattern Recognition Letters

JF - Pattern Recognition Letters

SN - 0167-8655

IS - 4

ER -