Journal of Clinical Epidemiology
Volume 63, Issue 10 , Pages 1145-1155, October 2010

Logistic regression had superior performance compared with regression trees for predicting in-hospital mortality in patients hospitalized with heart failure

  • Peter C. Austin

      Affiliations

    • Institute for Clinical Evaluative Sciences, Toronto, Ontario, Canada
    • Department of Health Management, Policy and Evaluation, University of Toronto, Toronto, Ontario, Canada
    • Corresponding Author InformationCorresponding author. Institute for Clinical Evaluative Sciences, G1 06, 2075 Bayview Avenue, Toronto, Ontario M4N 3M5, Canada. Tel.: +416-480-6131; fax: +416-480-6048.
  • ,
  • Jack V. Tu

      Affiliations

    • Institute for Clinical Evaluative Sciences, Toronto, Ontario, Canada
    • Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
    • Department of Health Management, Policy and Evaluation, University of Toronto, Toronto, Ontario, Canada
    • Schulich Heart Centre, Sunnybrook Health Sciences Centre, Toronto, Ontario, Canada
    • Department of Medicine, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
  • ,
  • Douglas S. Lee

      Affiliations

    • Institute for Clinical Evaluative Sciences, Toronto, Ontario, Canada
    • Department of Medicine, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
    • Division of Cardiology, University Health Network, Toronto, Ontario, Canada

Accepted 22 December 2009. published online 22 March 2010.

Abstract 

Objective

To compare the predictive accuracy of regression trees with that of logistic regression models for predicting in-hospital mortality in patients hospitalized with heart failure.

Study Design and Setting

Models were developed in 8,236 patients hospitalized with heart failure between April 1999 and March 2001. Models included the Enhanced Feedback for Effective Cardiac Treatment and Acute Decompensated Heart Failure National Registry (ADHERE) regression models and tree. Predictive accuracy was assessed using 7,608 patients hospitalized between April 2004 and March 2005.

Results

The area under the receiver operating characteristic curve for five different logistic regression models ranged from 0.747 to 0.775, whereas the corresponding values for three different regression trees ranged from 0.620 to 0.651. For the regression trees grown in 1,000 random samples drawn from the derivation sample, the number of terminal nodes ranged from 1 to 6, whereas the number of variables used in specific trees ranged from 0 to 5. Three different variables (blood urea nitrogen, dementia, and systolic blood pressure) were used for defining the first binary split when growing regression trees.

Conclusion

Logistic regression predicted in-hospital mortality in patients hospitalized with heart failure more accurately than did the regression trees. Regression trees grown in random samples from the same data set can differ substantially from one another.

Keywords: Logistic regression, Regression trees, Classification trees, Predictive model, Validation, Recursive partitioning, Congestive heart failure

To access this article, please choose from the options below

Login to an existing account or Register a new account.

  • Purchase this article for 31.50 USD (You must login/register to purchase this article)

    Online access for 24 hours. The PDF version can be downloaded as your permanent record.

  • Subscribe to this title

    Get unlimited online access to this article and all other articles in this title 24/7 for one year.

  • Claim access now

    For current subscribers with Society Membership or Account Number.

  • Visit SciVerse ScienceDirect to see if you have access via your institution.
 

PII: S0895-4356(09)00389-8

doi:10.1016/j.jclinepi.2009.12.004

Journal of Clinical Epidemiology
Volume 63, Issue 10 , Pages 1145-1155, October 2010