This paper is only available as a PDF. To read, Please Download here.
Abstract
The analytical effect of the number of events per variable (EPV) in a proportional
hazards regression analysis was evaluated using Monte Carlo simulation techniques
for data from a randomized trial containing 673 patients and 252 deaths, in which
seven predictor variables had an original significance level of p < 0.10. The 252 deaths and 7 variables correspond to 36 events per variable analyzed
in the full data set.
Five hundred simulated analyses were conducted for these seven variables at EPVs of
2, 5, 10, 15, 20, and 25. For each simulation, a random exponential survival time
was generated for each of the 673 patients, and the simulated results were compared
with their original counterparts. As EPV decreased, the regression coefficients became
more biased relative to the true value; the 90% confidence limits about the simulated
values did not have a coverage of 90% for the original value; large sample properties
did not hold for variance estimates from the proportional hazards model, and the Z statistics used to test the significance of the regression coefficients lost validity
under the null hypothesis.
Although a single boundary level for avoiding problems is not easy to choose, the
value of EPV = 10 seems most prudent. Below this value for EPV, the results of proportional
hazards regression analyses should be interpreted with caution because the statistical
model may not be valid.
To read this article in full you will need to make a payment
Purchase one-time access:
Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online accessOne-time access price info
- For academic or personal research use, select 'Academic and Personal'
- For corporate R&D use, select 'Corporate R&D Professionals'
Subscribe:
Subscribe to Journal of Clinical EpidemiologyAlready a print subscriber? Claim online access
Already an online subscriber? Sign in
Register: Create an account
Institutional Access: Sign in to ScienceDirect
References
- The risk of determining risk with multivariable models.Ann Intern Med. 1993; 118: 201-210
- Regression models for prognostic prediction: Advantages, problems, and suggested solutions.Cancer Treat Rep. 1985; 69: 1071-1077
- EDF statistics for goodness of fit and some comparisons.J Am Stat Assoc. 1974; 69: 730-737
- A large sample study of Cox's regression model.Ann Stat. 1981; 9: 93-108
Article info
Publication history
Received in revised form:
February 8,
1995
Identification
Copyright
© 1995 Published by Elsevier Inc.
User license
Creative Commons Attribution – NonCommercial – NoDerivs (CC BY-NC-ND 4.0) | How you can reuse
Elsevier's open access license policy

Creative Commons Attribution – NonCommercial – NoDerivs (CC BY-NC-ND 4.0)
Permitted
For non-commercial purposes:
- Read, print & download
- Redistribute or republish the final article
- Text & data mine
- Translate the article (private use only, not for distribution)
- Reuse portions or extracts from the article in other works
Not Permitted
- Sell or re-use for commercial purposes
- Distribute translations or adaptations of the article
Elsevier's open access license policy