Journal of Clinical Epidemiology
Volume 57, Issue 11 , Pages 1138-1146 , November 2004

Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality

  • Peter C. Austin

      Affiliations

    • Institute for Clinical Evaluative Sciences, G1 06, 2075 Bayview Avenue, Toronto, Ontario M4N 3M5, Canada
    • Department of Public Health Sciences, University of Toronto, McMurrich Bldg, 4th Floor, 12 Queen's Park Crescent West, Toronto, Ontario, M5S 1A8 Canada
    • Department of Health Policy, Management and Evaluation, University of Toronto, Toronto, McMurrich Bldg, 2nd Floor, 12 Queen's Park Crescent West, Ontario, M5S 1A8 Canada
    • Corresponding Author InformationCorresponding author. Tel.: 416-480-6131; fax: 416-480-6048.
  • ,
  • Jack V. Tu

      Affiliations

    • Institute for Clinical Evaluative Sciences, G1 06, 2075 Bayview Avenue, Toronto, Ontario M4N 3M5, Canada
    • Department of Public Health Sciences, University of Toronto, McMurrich Bldg, 4th Floor, 12 Queen's Park Crescent West, Toronto, Ontario, M5S 1A8 Canada
    • Department of Health Policy, Management and Evaluation, University of Toronto, Toronto, McMurrich Bldg, 2nd Floor, 12 Queen's Park Crescent West, Ontario, M5S 1A8 Canada
    • Clinical Epidemiology and Health Care Research Program, Sunnybrook & Women's College Health Science Centre, 2075 Bayview Ave, Toronto, Ontario, M4N 3M5 Canada
    • Division of General Internal Medicine, Sunnybrook & Women's College Health Sciences Centre, 2075 Bayview Ave, Toronto, Ontario, M4N 3M5 Canada

,Accepted 14 April 2004.

References 

  1. Krumholz HM, Chen J, Wang Y, Radford MJ, Chen YT, Marciniak TA. Comparing AMI mortality among hospitals in patients 65 years and older: evaluating methods of risk adjustment. Circulation. 1999;99:2986–2992
  2. Murtaugh PA. Methods of variable selection in regression modeling. Commun Stat Simulation Computation. 1998;27:711–734
  3. Wears RL, Lewis RJ. Statistical models and Occam's razor. Acad Emerg Med. 1999;6:93–94
  4. The Multicenter Postinfarction Research Group . Risk stratification and survival after myocardial infarction. N Engl J Med. 1983;309:331–336
  5. Suarez C, Herrera M, Vera A, Torrado E, Ferriz J, Arboleda JA. Prediction on admission of in-hospital mortality in patients older than 70 years with acute myocardial infarction. Chest. 1995;108:83–88
  6. Henning R, Wedel H, Wilhelmsen L. Mortality risk estimation by multivariate statistical analyses. Acta Med Scand. 1975;586(Suppl):14–31
  7. Dubois C, Pierard LA, Albert A, Smeets J, Demoulin J, Boland J, et al. Short-term risk stratification at admission based on simple clinical data in acute myocardial infarction. Am J Cardiol. 1988;61:216–219
  8. Miller AJ. Selection of subsets of regression variables. J R Stat Soc [Ser A]. 1984;147:389–425
  9. Miller A. Subset selection in regression. 2nd edition. Boca Raton (FL): Chapman & Hall/CRC; 2002;
  10. Hocking RR. The analysis and selection of variables in linear regression. Biometrics. 1976;32:1–49
  11. Flack VF, Chang PC. Frequency of selecting noise variables in subset regression analysis: a simulation study. Am Stat. 1987;14:84–86
  12. Copas JB, Long T. Estimating the residual variance in orthogonal regression with variable selection. Statistician. 1991;40:51–59
  13. Harrell FE. Regression modelling strategies: with applications to linear models, logistic regression, and survival analyses. New York: Springer-Verlag; 2001;
  14. Derkson S, Keselman HJ. Backward, forward and stepwise automated subset selection algorithms: frequency of obtaining authentic and noise variables. Br J Math Stat Psychol. 1992;45:265–282
  15. Tu JV, Naylor CD, Austin P. Temporal changes in the outcomes of acute myocardial infarction in Ontario, 1992–1996. Can Med Assoc J. 1999;161:1257–1261
  16. Tu JV, Austin P, Naylor CD, Iron K, Zhang H. Acute myocardial infarction outcomes in Ontario. In:  Naylor CD,  Slaughter PM editor. Cardiovascular health services in Ontario: an ICES atlas. Toronto, Canada: Institute for Clinical Evaluative Sciences; 1999;p. 83–110
  17. Hosmer DW, Lemeshow S. Applied logistic regression. New York: John Wiley & Sons; 1989;
  18. Efron B, Tibshirani R. An introduction to the bootstrap. London: Chapman & Hall; 1993;
  19. Henderson HV, Velleman RF. Building multiple regression models interactively. Biometrics. 1981;37:391–411
  20. Rothman KJ, Greenland S. Modern epidemiology. 2nd edition. Philadelphia: Lippincott Williams & Wilkins; 1998;
  21. Picard RR, Berk KN. Data splitting. Am Stat. 1990;4:140–147
  22. Austin PC, Tu JV. Bootstrap methods for developing predictive models. Am Stat. 2004;58:131–137
  23. Altman DG, Andersen PK. Bootstrap investigation of the stability of a Cox regression model. Stat Med. 1989;8:771–783

PII: S0895-4356(04)00111-8

doi: 10.1016/j.jclinepi.2004.04.003

Journal of Clinical Epidemiology
Volume 57, Issue 11 , Pages 1138-1146 , November 2004