TY - JOUR
T1 - Data mining approach to policy analysis in a health insurance domain
AU - Chae, Young Moon
AU - Ho, Seung Hee
AU - Cho, Kyoung Won
AU - Lee, Dong Ha
AU - Ji, Sun Ha
PY - 2001/7
Y1 - 2001/7
N2 - This study examined the characteristics of the knowledge discovery and data mining algorithms to demonstrate how they can be used to predict health outcomes and provide policy information for hypertension management using the Korea Medical Insurance Corporation database. Specifically, this study validated the predictive power of data mining algorithms by comparing the performance of logistic regression and two decision tree algorithms, CHIAD (Chi-squared Automatic Interaction Detection) and C5.0 (a variant of C4.5) using the test set of 4588 beneficiaries and the training set of 13,689 beneficiaries. Contrary to the previous study, the CHIAD algorithm performed better than the logistic regression in predicting hypertension, and C5.0 had the lowest predictive power. In addition, the CHIAD algorithm and the association rule also provided the segment-specific information for the risk factors and target group that may be used in a policy analysis for hypertension management.
AB - This study examined the characteristics of the knowledge discovery and data mining algorithms to demonstrate how they can be used to predict health outcomes and provide policy information for hypertension management using the Korea Medical Insurance Corporation database. Specifically, this study validated the predictive power of data mining algorithms by comparing the performance of logistic regression and two decision tree algorithms, CHIAD (Chi-squared Automatic Interaction Detection) and C5.0 (a variant of C4.5) using the test set of 4588 beneficiaries and the training set of 13,689 beneficiaries. Contrary to the previous study, the CHIAD algorithm performed better than the logistic regression in predicting hypertension, and C5.0 had the lowest predictive power. In addition, the CHIAD algorithm and the association rule also provided the segment-specific information for the risk factors and target group that may be used in a policy analysis for hypertension management.
UR - http://www.scopus.com/inward/record.url?scp=0034894781&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0034894781&partnerID=8YFLogxK
U2 - 10.1016/S1386-5056(01)00154-X
DO - 10.1016/S1386-5056(01)00154-X
M3 - Article
C2 - 11470613
AN - SCOPUS:0034894781
SN - 1386-5056
VL - 62
SP - 103
EP - 111
JO - International Journal of Medical Informatics
JF - International Journal of Medical Informatics
IS - 2-3
ER -