Journal of Clinical Epidemiology
Volume 57, Issue 12 , Pages 1262-1270, December 2004

Penalized maximum likelihood estimation to directly adjust diagnostic and prognostic prediction models for overoptimism: a clinical example

  • K.G.M. Moons

      Affiliations

    • Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, P.O. Box 80035, 3508 TA Utrecht, The Netherlands
    • Corresponding Author InformationCorresponding author. Tel.: +31 30 2509368; fax: +31 30 2505480.
  • ,
  • A. Rogier T. Donders

      Affiliations

    • Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, P.O. Box 80035, 3508 TA Utrecht, The Netherlands
    • Center for Biostatistics, Utrecht University, Utrecht, The Netherlands
  • ,
  • E.W. Steyerberg

      Affiliations

    • Department of Public Health, Erasmus Medical Center Rotterdam, Rotterdam, The Netherlands
  • ,
  • F.E. Harrell

      Affiliations

    • Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA

Accepted 20 January 2004.

Abstract 

Background and objective

There is growing interest in developing prediction models. The accuracy of such models when applied in new patient samples is commonly lower than estimated from the development sample. This may be because of differences between the samples and/or because the developed model was overfitted (too optimistic). Various methods, including bootstrapping techniques exist for afterwards shrinking the regression coefficients and the model's discrimination and calibration for overoptimism. Penalized maximum likelihood estimation (PMLE) is a more rigorous method because adjustment for overfitting is directly built into the model development, instead of relying on shrinkage afterwards. PMLE has been described mainly in the statistical literature and is rarely applied to empirical data. Using empirical data, we illustrate the use of PMLE to develop a prediction model.

Methods

The accuracy of the final PMLE model will be contrasted with the final models derived by ordinary stepwise logistic regression without and with shrinkage afterwards. The potential advantages and disadvantages of PMLE over the other two strategies are discussed.

Results

PMLE leads to smaller prediction errors, provides for model reduction to a user-defined degree, and may differently shrink each predictor for overoptimism without sacrificing much discriminative accuracy of the model.

Conclusion

PMLE is an easily applicable and promising method to directly adjust clinical prediction models for overoptimism.

Keywords: Prediction research, Overoptimism, Overfitting, Penalization, Bootstrapping, Shrinkage

To access this article, please choose from the options below

Login to an existing account or Register a new account.

  • Purchase this article for 31.50 USD (You must login/register to purchase this article)

    Online access for 24 hours. The PDF version can be downloaded as your permanent record.

  • Subscribe to this title

    Get unlimited online access to this article and all other articles in this title 24/7 for one year.

  • Claim access now

    For current subscribers with Society Membership or Account Number.

  • Visit SciVerse ScienceDirect to see if you have access via your institution.
 

PII: S0895-4356(04)00171-4

doi:10.1016/j.jclinepi.2004.01.020

Journal of Clinical Epidemiology
Volume 57, Issue 12 , Pages 1262-1270, December 2004