Bayesian model averaging approach in health effects studies

Sensitivity analyses using PM10 and cardiopulmonary hospital admissions in Allegheny County, Pennsylvania and simulated data

Ya Hsiu Chuang, Sati Mazumdar, Taeyoung Park, Gong Tang, Vincent C. Arena, Mark J. Nicolich

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Generalized Additive Models (GAMs) with natural cubic splines (NS) as smoothing functions have become a standard analytical tool in time series studies of health effects of air pollution. However, standard model selection procedures ignore the model uncertainty that may lead to biased estimates, in particular those of the lagged effects. We addressed this issue by Bayesian model averaging (BMA) approach which accounts for model uncertainty by combining information from all possible models where GAMs and NS were used. Firstly, we conducted a sensitivity analysis with simulation studies for Bayesian model averaging with different calibrated hyperparameters contained in the posterior model probabilities. Our results indicated the importance of selecting the optimum degree of lagging for variables, based not only on maximizing the likelihood, but also by considering the possible effects of concurvity, consistency of degree of lagging, and biological plausibility. This was illustrated by analyses of the Allegheny County Air Pollution Study (ACAPS) where the quantity of interest was the relative risk of cardiopulmonary hospital admissions for a 20 μg/m3 increase in PM10 values for the current day. Results showed that the posterior means of the relative risk and 95% posterior probability intervals were close to each other under different choices of the prior distributions. Simulation results were consistent with these findings. It was also found that using lag variables in the model when there is only same day effect, may underestimate the relative risk attributed to the same day effect.

Original languageEnglish
Pages (from-to)161-167
Number of pages7
JournalAtmospheric Pollution Research
Volume1
Issue number3
DOIs
Publication statusPublished - 2010 Jan 1

Fingerprint

Health
Air pollution
Splines
atmospheric pollution
health
effect
hospital
smoothing
Sensitivity analysis
simulation
sensitivity analysis
Time series
time series

All Science Journal Classification (ASJC) codes

  • Waste Management and Disposal
  • Pollution
  • Atmospheric Science

Cite this

@article{9a86a9789e2c4710b1a957e5fe275836,
title = "Bayesian model averaging approach in health effects studies: Sensitivity analyses using PM10 and cardiopulmonary hospital admissions in Allegheny County, Pennsylvania and simulated data",
abstract = "Generalized Additive Models (GAMs) with natural cubic splines (NS) as smoothing functions have become a standard analytical tool in time series studies of health effects of air pollution. However, standard model selection procedures ignore the model uncertainty that may lead to biased estimates, in particular those of the lagged effects. We addressed this issue by Bayesian model averaging (BMA) approach which accounts for model uncertainty by combining information from all possible models where GAMs and NS were used. Firstly, we conducted a sensitivity analysis with simulation studies for Bayesian model averaging with different calibrated hyperparameters contained in the posterior model probabilities. Our results indicated the importance of selecting the optimum degree of lagging for variables, based not only on maximizing the likelihood, but also by considering the possible effects of concurvity, consistency of degree of lagging, and biological plausibility. This was illustrated by analyses of the Allegheny County Air Pollution Study (ACAPS) where the quantity of interest was the relative risk of cardiopulmonary hospital admissions for a 20 μg/m3 increase in PM10 values for the current day. Results showed that the posterior means of the relative risk and 95{\%} posterior probability intervals were close to each other under different choices of the prior distributions. Simulation results were consistent with these findings. It was also found that using lag variables in the model when there is only same day effect, may underestimate the relative risk attributed to the same day effect.",
author = "Chuang, {Ya Hsiu} and Sati Mazumdar and Taeyoung Park and Gong Tang and Arena, {Vincent C.} and Nicolich, {Mark J.}",
year = "2010",
month = "1",
day = "1",
doi = "10.5094/APR.2010.021",
language = "English",
volume = "1",
pages = "161--167",
journal = "Atmospheric Pollution Research",
issn = "1309-1042",
publisher = "Turkish National Committee for Air Pollution Research (TUNCAP)",
number = "3",

}

Bayesian model averaging approach in health effects studies : Sensitivity analyses using PM10 and cardiopulmonary hospital admissions in Allegheny County, Pennsylvania and simulated data. / Chuang, Ya Hsiu; Mazumdar, Sati; Park, Taeyoung; Tang, Gong; Arena, Vincent C.; Nicolich, Mark J.

In: Atmospheric Pollution Research, Vol. 1, No. 3, 01.01.2010, p. 161-167.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Bayesian model averaging approach in health effects studies

T2 - Sensitivity analyses using PM10 and cardiopulmonary hospital admissions in Allegheny County, Pennsylvania and simulated data

AU - Chuang, Ya Hsiu

AU - Mazumdar, Sati

AU - Park, Taeyoung

AU - Tang, Gong

AU - Arena, Vincent C.

AU - Nicolich, Mark J.

PY - 2010/1/1

Y1 - 2010/1/1

N2 - Generalized Additive Models (GAMs) with natural cubic splines (NS) as smoothing functions have become a standard analytical tool in time series studies of health effects of air pollution. However, standard model selection procedures ignore the model uncertainty that may lead to biased estimates, in particular those of the lagged effects. We addressed this issue by Bayesian model averaging (BMA) approach which accounts for model uncertainty by combining information from all possible models where GAMs and NS were used. Firstly, we conducted a sensitivity analysis with simulation studies for Bayesian model averaging with different calibrated hyperparameters contained in the posterior model probabilities. Our results indicated the importance of selecting the optimum degree of lagging for variables, based not only on maximizing the likelihood, but also by considering the possible effects of concurvity, consistency of degree of lagging, and biological plausibility. This was illustrated by analyses of the Allegheny County Air Pollution Study (ACAPS) where the quantity of interest was the relative risk of cardiopulmonary hospital admissions for a 20 μg/m3 increase in PM10 values for the current day. Results showed that the posterior means of the relative risk and 95% posterior probability intervals were close to each other under different choices of the prior distributions. Simulation results were consistent with these findings. It was also found that using lag variables in the model when there is only same day effect, may underestimate the relative risk attributed to the same day effect.

AB - Generalized Additive Models (GAMs) with natural cubic splines (NS) as smoothing functions have become a standard analytical tool in time series studies of health effects of air pollution. However, standard model selection procedures ignore the model uncertainty that may lead to biased estimates, in particular those of the lagged effects. We addressed this issue by Bayesian model averaging (BMA) approach which accounts for model uncertainty by combining information from all possible models where GAMs and NS were used. Firstly, we conducted a sensitivity analysis with simulation studies for Bayesian model averaging with different calibrated hyperparameters contained in the posterior model probabilities. Our results indicated the importance of selecting the optimum degree of lagging for variables, based not only on maximizing the likelihood, but also by considering the possible effects of concurvity, consistency of degree of lagging, and biological plausibility. This was illustrated by analyses of the Allegheny County Air Pollution Study (ACAPS) where the quantity of interest was the relative risk of cardiopulmonary hospital admissions for a 20 μg/m3 increase in PM10 values for the current day. Results showed that the posterior means of the relative risk and 95% posterior probability intervals were close to each other under different choices of the prior distributions. Simulation results were consistent with these findings. It was also found that using lag variables in the model when there is only same day effect, may underestimate the relative risk attributed to the same day effect.

UR - http://www.scopus.com/inward/record.url?scp=84882747243&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84882747243&partnerID=8YFLogxK

U2 - 10.5094/APR.2010.021

DO - 10.5094/APR.2010.021

M3 - Article

VL - 1

SP - 161

EP - 167

JO - Atmospheric Pollution Research

JF - Atmospheric Pollution Research

SN - 1309-1042

IS - 3

ER -