Journal of Clinical Epidemiology
Volume 62, Issue 12 , Pages 1242-1247 , December 2009

The Bayesian interpretation of a P-value depends only weakly on statistical power in realistic situations

  • Richard Hooper

      Affiliations

    • Corresponding Author InformationCorresponding author. Respiratory Epidemiology & Public Health Group, Imperial College London, Emmanuel Kaye Building, Manresa Road, London SW3 6LR, UK. Tel.: +44-20-7352-8121 ext. 3502; fax: +44-20-7351-8322.

,Accepted 3 February 2009.

References 

  1. Wacholder S, Chanock S, Garcia-Closas M, El Ghormli L, Rothman N. Assessing the probability that a positive report is false: an approach for molecular epidemiology studies. J Natl Cancer Inst. 2004;96:434–442
  2. Wellcome Trust Case Control Consortium . Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007;447:661–678
  3. Sterne JAC, Davey Smith G. Sifting the evidence—what's wrong with significance tests?. BMJ. 2001;322:226–231
  4. Peto R, Pike MC, Armitage P, Breslow NE, Cox DR, Howard SV, et al. Design and analysis of randomized clinical trials requiring prolonged observation of each patient. I. Introduction and design. Br J Cancer. 1976;34:585–612
  5. Browner WS, Newman TB. Are all significant P-values created equal—the analogy between diagnostic-tests and clinical research. JAMA. 1987;257:2459–2463
  6. Diamond GA, Forrester JS. Clinical trials and statistical verdicts: probable grounds for appeal. Ann Intern Med. 1983;98:385–394
  7. Kirkwood BR, Sterne JAC. Essential medical statistics. Oxford, UK: Blackwell; 2003;
  8. Royall RM. The effect of sample size on the meaning of significance tests. Am Stat. 1986;40:313–315
  9. Senn SJ. Statistical issues in drug development. Chichester, UK: Wiley; 1997;
  10. Lindley DV. A statistical paradox. Biometrika. 1957;44:187–192
  11. Bartlett MS. A comment on D.V. Lindley's statistical paradox. Biometrika. 1957;44:533–534
  12. Wacholder S, Chanock S, Garcia-Closas M, Katki HA, El Ghormli L, Rothman N. Re: Assessing the probability that a positive report is false: an approach for molecular epidemiology studies. J Nat Cancer Inst. 2004;96:1722
  13. Thomas DC, Clayton DG. Betting odds and genetic associations. J Natl Cancer Inst. 2004;96:421–423
  14. Spiegelhalter DJ, Abrams KR, Myles JP. Bayesian approaches to clinical trials and health-care evaluation. Chichester, UK: Wiley; 2004;
  15. Goodman SN, Berlin JA. The use of predicted confidence-intervals when planning experiments and the misuse of power when interpreting results. Ann Intern Med. 1994;121:200–206
  16. Hoenig JM, Heisey DM. The abuse of power: the pervasive fallacy of power calculations for data analysis. Am Stat. 2001;55:19–24
  17. Neyman J, Pearson E. On the problem of the most efficient tests of statistical hypotheses. Philos Trans R Soc A. 1933;231:289–337
  18. Hacking I. Logic of statistical inference. Cambridge, UK: Cambridge University Press; 1965;
  19. Gardner MJ, Altman DG. Confidence intervals rather than P-values—estimation rather than hypothesis testing. BMJ. 1986;292:746–750
  20. Wilks SS. The large-sample distribution of the likelihood ratio for testing composite hypotheses. Ann Math Stat. 1938;9:60–62
  21. Hughes MD. Reporting Bayesian analyses of clinical-trials. Stat Med. 1993;12:1651–1663
  22. Wakefield J. A Bayesian measure of the probability of false discovery in genetic epidemiology studies. Am J Hum Gen. 2007;81:208–227
  23. Wears RL, Lewis RJ. Statistical models and Occam's razor. Acad Emerg Med. 1999;6:93–94
  24. Jeffreys H. Theory of probability. Oxford, UK: Oxford University Press; 1961;
  25. Brophy JM, Joseph L. Placing trials in context using Bayesian analysis: GUSTO revisited by Reverend Bayes. JAMA. 1995;273:871–875
  26. Moher D, Dulberg CS, Wells GA. Statistical power, sample-size, and their reporting in randomized controlled trials. JAMA. 1994;272:122–124
  27. Altman DG. Statistics and ethics in medical research: III How large a sample?. BMJ. 1980;281:1336–1338
  28. Sellke T, Bayarri MJ, Berger JO. Calibration of P-values for testing precise null hypotheses. Am Stat. 2001;55:62–71
  29. Ioannidis JPA. Why most published research findings are false. PLoS Med. 2005;2:696–701

PII: S0895-4356(09)00059-6

doi: 10.1016/j.jclinepi.2009.02.004

Journal of Clinical Epidemiology
Volume 62, Issue 12 , Pages 1242-1247 , December 2009