Advertisement
Journal Home
Search for

Volume 56, Issue 1, Pages 28-37 (January 2003)


View previous. 4 of 14 View next.

Developing a prognostic model in the presence of missing data: an ovarian cancer case study

Taane G. ClarkCorresponding Author Informationemail address, Douglas G. Altman

Received 8 October 2001; received in revised form 15 February 2002; accepted 19 July 2002.

Abstract 

When developing prognostic models in medicine, covariate data are often missing and the standard response is to exclude those individuals whose data are incomplete from the analyses. This practice leads to a reduction in the statistical power, and may lead to biased results. We wished to develop a prognostic model for overall survival from 1,189 primary cases (842 deaths) of epithelial ovarian cancer. A complete case analysis restricted the sample size to 518 (380 deaths). After applying a multiple imputation (MI) framework we included three real values for each one imputed, and constructed a model composed of more statistically significant prognostic factors and with increased predictive ability. Missing values can be imputed in cases where the reason for the data being missing is known, particularly where it can be explained by available data. This will increase the power of an analysis and may produce models that are more statistically reliable and applicable within clinical practice.

Centre for Statistics in Medicine, Institute of Health Sciences, University of Oxford, Old Road, Oxford OX3 7LF, United Kingdom

Corresponding Author InformationCorresponding author. Tel: +44 (0)1865 226874; fax: +44 (0) 1865 226962.

PII: S0895-4356(02)00539-5


View previous. 4 of 14 View next.

Advertisement