An extension of regression trees to generate better predictive models

Hyunjoong Kim, Frank M. Guess, Timothy M. Young

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

For situations where the data are drawn from reasonably homogeneous populations, traditional methods such as multiple regression typically yield insightful analyses. For situations where the data are drawn from more heterogeneous populations, decision tree approaches, such as Classification and Regression Trees (CART) and Generalized, Unbiased, Interaction, Detection, and Estimation (GUIDE), are more likely to recognize idiosyncratic subpopulations and interactions automatically. In contrast to CART, however, GUIDE yields models with better predictive performance for each subpopulation. This article extends the idea of GUIDE to handle analysis of covariance-type problems. This article compares GUIDE modeling to various decision tree methods and to multiple regression. The article identifies and discusses the relative advantages and disadvantages of multiple regression, CART, and GUIDE. GUIDE produces quality or reliability models that exhibit greater predictive accuracy than multiple regression or CART for complex, highly diverse populations. Also, GUIDE is readily applicable to many other areas, such as repairability and maintainability settings involving both qualitative and quantitative variables. A small case study of an engineered wood product, medium-density fiberboard, is presented to illustrate the application of GUIDE. Accepted in 2005 for a special issue on Reliability co-edited by Hoang Pham, Rutgers University; Dong Ho Park, Hallym University, Korea; and Richard Cassady, University of Arkansas.

Original languageEnglish
Pages (from-to)43-54
Number of pages12
JournalIIE Transactions (Institute of Industrial Engineers)
Volume44
Issue number1
DOIs
Publication statusPublished - 2012 Jan 1

Fingerprint

Decision trees
Wood products
Maintainability

All Science Journal Classification (ASJC) codes

  • Industrial and Manufacturing Engineering

Cite this

@article{10499616c5424ba1986b1a1b26eb2be0,
title = "An extension of regression trees to generate better predictive models",
abstract = "For situations where the data are drawn from reasonably homogeneous populations, traditional methods such as multiple regression typically yield insightful analyses. For situations where the data are drawn from more heterogeneous populations, decision tree approaches, such as Classification and Regression Trees (CART) and Generalized, Unbiased, Interaction, Detection, and Estimation (GUIDE), are more likely to recognize idiosyncratic subpopulations and interactions automatically. In contrast to CART, however, GUIDE yields models with better predictive performance for each subpopulation. This article extends the idea of GUIDE to handle analysis of covariance-type problems. This article compares GUIDE modeling to various decision tree methods and to multiple regression. The article identifies and discusses the relative advantages and disadvantages of multiple regression, CART, and GUIDE. GUIDE produces quality or reliability models that exhibit greater predictive accuracy than multiple regression or CART for complex, highly diverse populations. Also, GUIDE is readily applicable to many other areas, such as repairability and maintainability settings involving both qualitative and quantitative variables. A small case study of an engineered wood product, medium-density fiberboard, is presented to illustrate the application of GUIDE. Accepted in 2005 for a special issue on Reliability co-edited by Hoang Pham, Rutgers University; Dong Ho Park, Hallym University, Korea; and Richard Cassady, University of Arkansas.",
author = "Hyunjoong Kim and Guess, {Frank M.} and Young, {Timothy M.}",
year = "2012",
month = "1",
day = "1",
doi = "10.1080/0740817X.2011.590441",
language = "English",
volume = "44",
pages = "43--54",
journal = "IISE Transactions",
issn = "2472-5854",
publisher = "Taylor and Francis Ltd.",
number = "1",

}

An extension of regression trees to generate better predictive models. / Kim, Hyunjoong; Guess, Frank M.; Young, Timothy M.

In: IIE Transactions (Institute of Industrial Engineers), Vol. 44, No. 1, 01.01.2012, p. 43-54.

Research output: Contribution to journalArticle

TY - JOUR

T1 - An extension of regression trees to generate better predictive models

AU - Kim, Hyunjoong

AU - Guess, Frank M.

AU - Young, Timothy M.

PY - 2012/1/1

Y1 - 2012/1/1

N2 - For situations where the data are drawn from reasonably homogeneous populations, traditional methods such as multiple regression typically yield insightful analyses. For situations where the data are drawn from more heterogeneous populations, decision tree approaches, such as Classification and Regression Trees (CART) and Generalized, Unbiased, Interaction, Detection, and Estimation (GUIDE), are more likely to recognize idiosyncratic subpopulations and interactions automatically. In contrast to CART, however, GUIDE yields models with better predictive performance for each subpopulation. This article extends the idea of GUIDE to handle analysis of covariance-type problems. This article compares GUIDE modeling to various decision tree methods and to multiple regression. The article identifies and discusses the relative advantages and disadvantages of multiple regression, CART, and GUIDE. GUIDE produces quality or reliability models that exhibit greater predictive accuracy than multiple regression or CART for complex, highly diverse populations. Also, GUIDE is readily applicable to many other areas, such as repairability and maintainability settings involving both qualitative and quantitative variables. A small case study of an engineered wood product, medium-density fiberboard, is presented to illustrate the application of GUIDE. Accepted in 2005 for a special issue on Reliability co-edited by Hoang Pham, Rutgers University; Dong Ho Park, Hallym University, Korea; and Richard Cassady, University of Arkansas.

AB - For situations where the data are drawn from reasonably homogeneous populations, traditional methods such as multiple regression typically yield insightful analyses. For situations where the data are drawn from more heterogeneous populations, decision tree approaches, such as Classification and Regression Trees (CART) and Generalized, Unbiased, Interaction, Detection, and Estimation (GUIDE), are more likely to recognize idiosyncratic subpopulations and interactions automatically. In contrast to CART, however, GUIDE yields models with better predictive performance for each subpopulation. This article extends the idea of GUIDE to handle analysis of covariance-type problems. This article compares GUIDE modeling to various decision tree methods and to multiple regression. The article identifies and discusses the relative advantages and disadvantages of multiple regression, CART, and GUIDE. GUIDE produces quality or reliability models that exhibit greater predictive accuracy than multiple regression or CART for complex, highly diverse populations. Also, GUIDE is readily applicable to many other areas, such as repairability and maintainability settings involving both qualitative and quantitative variables. A small case study of an engineered wood product, medium-density fiberboard, is presented to illustrate the application of GUIDE. Accepted in 2005 for a special issue on Reliability co-edited by Hoang Pham, Rutgers University; Dong Ho Park, Hallym University, Korea; and Richard Cassady, University of Arkansas.

UR - http://www.scopus.com/inward/record.url?scp=84863016468&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84863016468&partnerID=8YFLogxK

U2 - 10.1080/0740817X.2011.590441

DO - 10.1080/0740817X.2011.590441

M3 - Article

VL - 44

SP - 43

EP - 54

JO - IISE Transactions

JF - IISE Transactions

SN - 2472-5854

IS - 1

ER -