Fast accelerated failure time modeling for case-cohort data

Sy Han Chiou, Sangwook Kang, Jun Yan

Research output: Contribution to journalArticle

10 Citations (Scopus)

Abstract

Semiparametric accelerated failure time (AFT) models directly relate the expected failure times to covariates and are a useful alternative to models that work on the hazard function or the survival function. For case-cohort data, much less development has been done with AFT models. In addition to the missing covariates outside of the sub-cohort in controls, challenges from AFT model inferences with full cohort are retained. The regression parameter estimator is hard to compute because the most widely used rank-based estimating equations are not smooth. Further, its variance depends on the unspecified error distribution, and most methods rely on computationally intensive bootstrap to estimate it. We propose fast rank-based inference procedures for AFT models, applying recent methodological advances to the context of case-cohort data. Parameters are estimated with an induced smoothing approach that smooths the estimating functions and facilitates the numerical solution. Variance estimators are obtained through efficient resampling methods for nonsmooth estimating functions that avoids full blown bootstrap. Simulation studies suggest that the recommended procedure provides fast and valid inferences among several competing procedures. Application to a tumor study demonstrates the utility of the proposed method in routine data analysis.

Original languageEnglish
Pages (from-to)559-568
Number of pages10
JournalStatistics and Computing
Volume24
Issue number4
DOIs
Publication statusPublished - 2014 Jul 1

Fingerprint

Accelerated Failure Time Model
Failure Time
Estimating Function
Modeling
Bootstrap
Missing Covariates
Resampling Methods
Nonsmooth Function
Hazard Function
Variance Estimator
Survival Function
Estimating Equation
Semiparametric Model
Covariates
Smoothing
Tumor
Data analysis
Regression
Numerical Solution
Simulation Study

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Statistics and Probability
  • Statistics, Probability and Uncertainty
  • Computational Theory and Mathematics

Cite this

Chiou, Sy Han ; Kang, Sangwook ; Yan, Jun. / Fast accelerated failure time modeling for case-cohort data. In: Statistics and Computing. 2014 ; Vol. 24, No. 4. pp. 559-568.
@article{0570617520ea46ecaa15f6809951f7c8,
title = "Fast accelerated failure time modeling for case-cohort data",
abstract = "Semiparametric accelerated failure time (AFT) models directly relate the expected failure times to covariates and are a useful alternative to models that work on the hazard function or the survival function. For case-cohort data, much less development has been done with AFT models. In addition to the missing covariates outside of the sub-cohort in controls, challenges from AFT model inferences with full cohort are retained. The regression parameter estimator is hard to compute because the most widely used rank-based estimating equations are not smooth. Further, its variance depends on the unspecified error distribution, and most methods rely on computationally intensive bootstrap to estimate it. We propose fast rank-based inference procedures for AFT models, applying recent methodological advances to the context of case-cohort data. Parameters are estimated with an induced smoothing approach that smooths the estimating functions and facilitates the numerical solution. Variance estimators are obtained through efficient resampling methods for nonsmooth estimating functions that avoids full blown bootstrap. Simulation studies suggest that the recommended procedure provides fast and valid inferences among several competing procedures. Application to a tumor study demonstrates the utility of the proposed method in routine data analysis.",
author = "Chiou, {Sy Han} and Sangwook Kang and Jun Yan",
year = "2014",
month = "7",
day = "1",
doi = "10.1007/s11222-013-9388-2",
language = "English",
volume = "24",
pages = "559--568",
journal = "Statistics and Computing",
issn = "0960-3174",
publisher = "Springer Netherlands",
number = "4",

}

Fast accelerated failure time modeling for case-cohort data. / Chiou, Sy Han; Kang, Sangwook; Yan, Jun.

In: Statistics and Computing, Vol. 24, No. 4, 01.07.2014, p. 559-568.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Fast accelerated failure time modeling for case-cohort data

AU - Chiou, Sy Han

AU - Kang, Sangwook

AU - Yan, Jun

PY - 2014/7/1

Y1 - 2014/7/1

N2 - Semiparametric accelerated failure time (AFT) models directly relate the expected failure times to covariates and are a useful alternative to models that work on the hazard function or the survival function. For case-cohort data, much less development has been done with AFT models. In addition to the missing covariates outside of the sub-cohort in controls, challenges from AFT model inferences with full cohort are retained. The regression parameter estimator is hard to compute because the most widely used rank-based estimating equations are not smooth. Further, its variance depends on the unspecified error distribution, and most methods rely on computationally intensive bootstrap to estimate it. We propose fast rank-based inference procedures for AFT models, applying recent methodological advances to the context of case-cohort data. Parameters are estimated with an induced smoothing approach that smooths the estimating functions and facilitates the numerical solution. Variance estimators are obtained through efficient resampling methods for nonsmooth estimating functions that avoids full blown bootstrap. Simulation studies suggest that the recommended procedure provides fast and valid inferences among several competing procedures. Application to a tumor study demonstrates the utility of the proposed method in routine data analysis.

AB - Semiparametric accelerated failure time (AFT) models directly relate the expected failure times to covariates and are a useful alternative to models that work on the hazard function or the survival function. For case-cohort data, much less development has been done with AFT models. In addition to the missing covariates outside of the sub-cohort in controls, challenges from AFT model inferences with full cohort are retained. The regression parameter estimator is hard to compute because the most widely used rank-based estimating equations are not smooth. Further, its variance depends on the unspecified error distribution, and most methods rely on computationally intensive bootstrap to estimate it. We propose fast rank-based inference procedures for AFT models, applying recent methodological advances to the context of case-cohort data. Parameters are estimated with an induced smoothing approach that smooths the estimating functions and facilitates the numerical solution. Variance estimators are obtained through efficient resampling methods for nonsmooth estimating functions that avoids full blown bootstrap. Simulation studies suggest that the recommended procedure provides fast and valid inferences among several competing procedures. Application to a tumor study demonstrates the utility of the proposed method in routine data analysis.

UR - http://www.scopus.com/inward/record.url?scp=84910653306&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84910653306&partnerID=8YFLogxK

U2 - 10.1007/s11222-013-9388-2

DO - 10.1007/s11222-013-9388-2

M3 - Article

VL - 24

SP - 559

EP - 568

JO - Statistics and Computing

JF - Statistics and Computing

SN - 0960-3174

IS - 4

ER -