The Mehta et al.[
[1]
] study conducted an empirical evaluation of different methods of weighting the 17 chronic conditions in the original 1987 Charlson [[2]
] comorbidity index. In summarizing the original study, the authors missed that the validation used 10-year mortality rates in the breast cancer population rather than one-year mortality rates in hospitalized patients. This is a critical difference, since any estimate of predictive power in one population will deteriorate in a subsequent population, particularly if it is overfit. We have always believed that the remarkable robustness of the Charlson comorbidity index (CCI) was that it was developed and validated in very different populations over different follow up intervals.Mehta et al. did all the calculations in a single population. Claims from one year were used to calculate the comorbidity score, and claims for the second year were used to assess mortality. They did not use either split sample techniques or validation samples so it is a given that their estimate of fit overestimate the explained variance for their derived “new” indices. Also of note, the Mehta et al. population is over 75 years of age, where the drivers of mortality may differ from a general population.
A primary motivation for the Mehta et al. study was that in the calculation of the CCI score an additive approach is being applied instead of a multiplicative one, they infer that incorrect mathematics is being used. If one casts the construction of the risk measure in terms of utility theory from discrete choice economics [
[3]
], it turns out that the sum of HRs is the mathematically correct method for assessing an individual's specific unobserved comorbidity. Specifically, let β = (β1, …, βM) be a vector of length M which represents the components of comorbidity associated with each of M diseases. Suppose βj, the comorbidity (ie regression coefficient) contribution associated with disease j has an additional unobservable individual specific random component ej. If these random components are independent and identically distributed across j and follow the type-I extreme value distribution, these random error account for individual specific unmeasured contributions to morbidity. Then it can be shown that the expected value of the Max {β1+e1, β2+e2, …, βM+eM} = log{exp(β1) + exp(β2) + …+ exp(βM)}. The exp(βj) are exactly the HRs if the βj's are regression coefficients. Since the log is monotone, it is equivalent to use the sum of the HRs (= CCI) the risk measure. The expected value of Max {β1+e1, β2+e2, …, βM+eM} can thought of as an expected worse case comorbidity value for an individual (as measured by regression coefficients) that accounts for specific unobserved heterogeneity. The use of the type-I extreme value distribution is standard in discrete choice economics. Consequently, there is a well established theoretical framework that suggests that the mathematics that underlies the CCI is well-grounded.Mehta et al. contrasts the uses of beta coefficients (original, rounded, multiplied by 10, increased by 1 for each 0.3 increased in β (Schneewiess) or divided by the smallest regression coefficient). They also contrast he use of hazard/odds ratio, using two different weighting system. The Schneewiess weighting system is based on a one-unit weight increase for a 35% increase in the HR, so it can also be considered a HR scoring system.
In Mehta et al.'s Table 4 the AIC and Adjusted R2 are remarkably similar across all of the measure within the CPRD population, both likely overestimate the predictive validity in a different population due to overfitting. The existing CCS measures listed in Table 4 also have strikingly similar performance. It is hard to conclude from Table 4 that the use of the regression coefficient based scoring system performs meaningfully better than a risk ratio based scoring system in predicting mortality. The simplicity of calculating the Charlson comorbidity index and its interpretability has likely propelled its widespread use.
References
- Regression Coefficient Based Scoring System Should be Used to Assign Weights to the Risk Index.J Clin Epidemiol. 2016; 72
- A new method of classifying prognostic comorbidity in longitudinal studies: development and validation.J Chronic Dis. 1987; 40: 373-383
- Discrete choice models with multiple unobserved choice characteristics.Int Econ Rev. 2007; 48: 1159-1192
Article info
Identification
Copyright
© 2016 Elsevier Inc. All rights reserved.
ScienceDirect
Access this article on ScienceDirectLinked Article
- Regression coefficient–based scoring system should be used to assign weights to the risk indexJournal of Clinical EpidemiologyVol. 79
- PreviewSome previously developed risk scores contained a mathematical error in their construction: risk ratios were added to derive weights to construct a summary risk score. This study demonstrates the mathematical error and derived different versions of the Charlson comorbidity score (CCS) using regression coefficient–based and risk ratio–based scoring systems to further demonstrate the effects of incorrect weighting on performance in predicting mortality.
- Full-Text
- Preview
- Editorial comment: Ratios should be multiplied, not addedJournal of Clinical EpidemiologyVol. 79
- PreviewIt is a good thing that widely used instruments, including those whose validations have been published in peer reviewed journals, are regularly evaluated, as this ensures further development and improvement. At the same time, this can be challenging both for developers and users if an instrument as designed has been working already for a long time.
- Full-Text
- Preview