Advertisement
Research Article| Volume 56, ISSUE 6, P559-564, June 2003

Application of negative binomial modeling for discrete outcomes

A case study in aging research
  • Amy L Byers
    Correspondence
    Corresponding author. Tel.: 203-764-9800; fax: 203-764-9831
    Affiliations
    Program on Aging, Department of Epidemiology and Public Health, Yale University School of Medicine, 1 Church Street 7th Floor, New Haven, CT 06510, USA
    Search for articles by this author
  • Heather Allore
    Affiliations
    Department of Internal Medicine, Yale University School of Medicine, New Haven, CT 06510, USA
    Search for articles by this author
  • Thomas M Gill
    Affiliations
    Department of Internal Medicine, Yale University School of Medicine, New Haven, CT 06510, USA
    Search for articles by this author
  • Peter N Peduzzi
    Affiliations
    Program on Aging, Department of Epidemiology and Public Health, Yale University School of Medicine, 1 Church Street 7th Floor, New Haven, CT 06510, USA

    Department of Internal Medicine, Yale University School of Medicine, New Haven, CT 06510, USA

    Cooperative Studies Program Coordinating Center, VA Connecticut Healthcare System, West Haven, CT 06516, USA
    Search for articles by this author

      Abstract

      We present a case study using the negative binomial regression model for discrete outcome data arising from a clinical trial designed to evaluate the effectiveness of a prehabilitation program in preventing functional decline among physically frail, community-living older persons. The primary outcome was a measure of disability at 7 months that had a range from 0 to 16 with a mean of 2.8 (variance of 16.4) and a median of 1. The data were right skewed with clumping at zero (i.e., 40% of subjects had no disability at 7 months). Because the variance was nearly 6 times greater than the mean, the negative binomial model provided an improved fit to the data and accounted better for overdispersion than the Poisson regression model, which assumes that the mean and variance are the same. Although correcting the variance and corresponding test statistics for overdispersion is a standard procedure in the Poisson model, the estimates of the regression parameters are inefficient because they have more sampling variability than is necessary. The negative binomial model provides an alternative approach for the analysis of discrete data where overdispersion is a problem, provided that the model is correctly specified and adequately fits the data.

      Keywords

      To read this article in full you will need to make a payment

      Purchase one-time access:

      Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online access
      One-time access price info
      • For academic or personal research use, select 'Academic and Personal'
      • For corporate R&D use, select 'Corporate R&D Professionals'

      Subscribe:

      Subscribe to Journal of Clinical Epidemiology
      Already a print subscriber? Claim online access
      Already an online subscriber? Sign in
      Institutional Access: Sign in to ScienceDirect

      References

        • Gill T.M
        • Desai M.M
        • Gahbauer E
        • et al.
        Restricted activity among community-living older persons: incidence, precipitants, and health care utilization.
        Ann Intern Med. 2001; 135: 313-321
        • Katz S
        • Ford A.B
        • Moskowitz R.W
        • et al.
        The Index of ADL: a standardized measure of biological and psychosocial function.
        JAMA. 1963; 185: 914-919
        • Gill T.M
        • Baker D.I
        • Gottschalk M
        • et al.
        A program to prevent functional decline in physically frail elderly persons who live at home.
        N Engl J Med. 2002; 347: 1068-1074
        • Gill T.M
        • Robinson J.T
        • Tinetti M.E
        Difficulty and dependence: two components of the disability continuum among community-living older persons.
        Ann Intern Med. 1998; 128: 96-101
        • Agresti A
        An introduction to categorical data analysis.
        John Wiley & Sons, New York1996
      1. McCullagh P Nelder J.A Generalized linear models. 2nd edition. Chapman and Hall, London1989
        • Berry D
        Logarithmic transformations in ANOVA.
        Biometrics. 1987; 43: 439-456
        • Chang B.H
        • Pocock S
        Analyzing data with clumping at zero: an example demonstration.
        Biometrics. 2000; 53: 1036-1043
        • Ridout M
        • Hinde J
        • Demetrio C.G
        A score test for testing a zero-inflated Poisson regression model against zero-inflated negative binomial alternatives.
        Biometrics. 2001; 57: 219-223
        • Allison P
        Logistic regression using the SAS system: theory and application.
        SAS Institute, Cary (NC)1999
        • Landefeld C.S
        • Palmer R.M
        • Kresevic D.M
        • et al.
        A randomized trial of care in a hospital medical unit especially designed to improve the functional outcomes of acutely ill older patients.
        N Engl J Med. 1995; 332: 1338-1344
        • Agresti A
        Categorical data analysis.
        John Wiley & Sons, New York1990
        • Welsh A.H
        • Cunningham R.B
        • Chambers R.L
        Methodology for estimating the abundance of rare animals: seabirds nesting on North East Herald Cay.
        Biometrics. 2000; 56: 22-30
        • Abdel-Aty M.A
        • Radwan A.E
        Modeling traffic accident occurrence and involvement.
        Accid Anal Prev. 2000; 32: 633-642
        • Schellhorn M
        • Stuck A.E
        • Minder C.E
        • et al.
        Health services utilization of elderly Swiss: evidence from panel data.
        Health Econ. 2000; 9: 533-545
        • Sormani M.P
        • Bruzzi P
        • Miller D.H
        • et al.
        Modelling MRI enhancing lesion counts in multiple sclerosis using a negative binomial model: implications for clinical trials.
        J Neurol Sci. 1999; 163: 74-80
        • Finch S.J
        • Chen J.B
        • Chen C.H
        • et al.
        Process control procedures to augment quality control of leukocyte-reduced red cell blood products.
        Stat Med. 1999; 18: 1279-1289
        • Gill T.M
        • McGloin J.M
        • Gahbauer E.A
        • et al.
        Two recruitment strategies for a clinical trial of physically frail community-living older persons.
        J Am Geriatr Soc. 2001; 49: 1039-1045
        • Gill T.M
        • Baker D.I
        • Gottschalk M
        • et al.
        A prehabilitation program for physically frail community-living older persons.
        Arch Phys Med Rehabil. 2003; 84: 394-404
      2. BestFit User's Guide. Palisade, Newfield (NY)1995
      3. Law A Kelton W.D Simulation modeling and analysis. 2nd edition. McGraw-Hill, New York1991
      4. SAS Software System. Version 8.1. SAS Institute, Cary (NC)2000
        • Ramsey F.L
        • Schafer D.W
        The statistical sleuth: a course in methods of data analysis.
        Duxbury Press, New York1997
        • Zeger S.C
        • Liang K.Y
        Longitudinal data analysis for discrete and continuous outcomes.
        Biometrics. 1986; 42: 121-130
        • Zeger S.C
        • Liang K.Y
        • Albert P.S
        Models for longitudinal data: a generalized estimating equation approach.
        Biometrics. 1988; 44: 1049-1060
        • Liang K.Y
        • Zeger S
        Regression analysis for correlated data.
        Annu Rev Public Health. 1993; 14: 43-68