Journal of Clinical Epidemiology
Volume 52, Issue 3 , Pages 229-235 , March 1999

Increasing Physicians’ Awareness of the Impact of Statistics on Research Outcomes: Comparative Power of the t-test and Wilcoxon Rank-Sum Test in Small Samples Applied Research

  • Patrick D Bridge

      Affiliations

    • Department of Family Medicine, Wayne State University School of Medicine, Detroit, MI USA
    • Corresponding Author InformationAddress for correspondence: Patrick D. Bridge, University Health Center, 4201 St. Antoine, Room 4J, Wayne State University, Detroit, MI 48201
  • ,
  • Shlomo S Sawilowsky

      Affiliations

    • Department of Theoretical and Behavioral Foundations, College of Education, Wayne State University, Detroit, MI USA

,Accepted 6 November 1998.

References 

  1. Altman DG, Goodman SN. Transfer of technology from statistical journals to the biomedical literature. JAMA. 1994;272(2):129–132
  2. Altman DG. Statistics in medical journals. Stat Med. 1982;1:59–71
  3. Altman DG. The scandal of poor medical research. BMJ. 1994;308:283–284
  4. Altman DG. Statistics in medical journals (Developments in the 1980’s). Stat Med. 1991;10:1897–1913
  5. Avram MJ, Shanks CA. Statistical methods in anesthesia articles (An evaluation of two American journals during two six-month periods). Anesth Analg. 1985;64:607–611
  6. Glantz SA. Biostatistics (How to detect, correct and prevent errors in the medical literature). Circulation. 1980;61(1):1–7
  7. Lent V, Langenbach A. A retrospective quality analysis of 102 randomized trials in four leading urological journals form 1984–1989. Urol Res. 1996;24(2):119–122
  8. Mosteller JP, Gilbert JP. Reporting standards and research strategies for controlled trials. Control Clin Trials. 1980;1:37–58
  9. Mulrow CD. The medical review article (State of the science). Ann Intern Med. 1987;106:485–488
  10. Pocock PJ, Huges MD, Lee RJ. Statistical problems in the reporting of clinical trials. A survey of three medical journals. N Engl J Med. 1987;317(7):426–487
  11. Silagy CA, Jewell D, Mant D. An analysis of randomized controlled trials published in the US family medicine literature, 1987–1991. J Fam Pract. 1994;39(3):236–242
  12. Williams HC, Seed P. Inadequate size of “negative” clinical trials in dermatology. Br J Dermatol. 1993;128:317–326
  13. Emerson JD, Colditz GA. Use of statistical analysis in the New England Journal of Medicine. N Engl J Med. 1983;309(12):709–713
  14. Fromm BS, Snyder VL. Research design and statistical procedures used in the Journal of Family Practice. J Fam Pract. 1986;23(6):564–566
  15. Longnecker DE. Support versus illumination (Trends in medical statistics). J Anesthesiol. 1982;57(2):73–74
  16. Sawilowsky SS. Nonparametric tests of interaction in experimental design. Rev Educ Res. 1990;60(1):91–126
  17. Micceri T. The unicorn, the normal curve, and other improbable creatures. Psychol Bull. 1989;105(1):156–166
  18. Harwell MR. A general approach to hypothesis testing for nonparametric tests. J Exp Educ. 1990;58(2):143–156
  19. Hunter MA, May RB. Some myths concerning parametric and nonparmetric tests. Can Psychol. 1993;34(4):384–389
  20. Pearson K. Contributions to the mathematical theory of evolution. II. Skew variation in homogeneous material. Philo Transac R Soc 1895; Series A: 343–414.
  21. Geary RC. Testing for normality. Biometrika. 1947;34:209–242
  22. Nunnally J. Psychometric Theory. 2. New York: McGraw-Hill; 1978;
  23. Pearson ES, Please NW. Relation between the shape of population distribution and the robustness of four simple test statistics. Biometrika. 1975;62(2):223–241
  24. Tan WY. Sampling distributions and robustness of t, f, and variance-ratio in two samples and ANOVA models with respect to departures from normality. Commun Stat. 1982;A11:2485–2511
  25. Dobi RA, Wilson MJ. A comparison of t-test, F-test, and coherence methods of detecting steady-state auditory-evoked potentials, distortion-product otoacoustic emission, or other sinusoids. J Acoust Soc Am. 1996;100(4/1):2236–2246
  26. Bohning D, Hempfling A, Schelp FP, Schlattmenn P. The area between curves (ABC) measure in nutritional anthropometry. Stat Med. 1992;11(10):1289–1304
  27. Gilbert GH, Longmate J, Branch LG. Factors influencing the effectiveness of mailed health surveys. Public Health Rep. 1992;107(5):576–584
  28. Elashoff JD, Cantor RM, Shain S. Power and validity of methods to identify variability genes. Genet Epidemiol. 1991;8(6):381–388
  29. Lucke JF. Students t test and the glasgow coma scale. Ann Emerg Med. 1996;28(4):408–413
  30. Maxfield M, Schweitzer J, Gouvier WD. Measures of central tendency, variability, and relative standing in nonnormal distributions (Alternatives to the mean and standard score). Arch Phys Med Rehabil. 1988;69(6):406–409
  31. Zellner D, Frankewitsch T, Simon S, Keller F. Statistical analysis of hetrogenious pharmacokinetic data from the literature. Eur J Clin Chem Clin Biochem. 1996;34(7):585–589
  32. Niklasson A, Ericson A, Fryer JG. An update of the Swedish reference standards for weight, length and head circumference at birth for given gestational age (1977–1981). Acta Paediatrica Scandinavica. 1991;80(8-9):756–762
  33. Valenta HL, Fischer SK. A monte carlo simulation of range for an invasive impedance respiration monitor. Biomed Sci Instrum. 1990;26:181–184
  34. Colliver JA, Manchikanti L, Markwell SJ. Evaluation and comparison of the distributions of gastric pH and hydrogen ion concentration. Anesthesiol. 1987;67(3):391–394
  35. Sullivan LM, D’Agostino RB. Robustness of the t test applied to data distorted from normality by floor effects. J Dent Res. 1992;71(12):1938–1943
  36. Blair RC. A reaction to “consequences of failure to meet assumptions underlying the fixed effects analysis of variance and covariance.”. Rev Educ Res. 1981;51(4):499–507
  37. Scheffe H. The Analysis of Variance. New York: Wiley; 1958;
  38. Sawilowsky SS, Blair RC. A more realistic look at the robustness and type II error properties of the t test to departures from population normality. Psychol Bull. 1992;111(2):352–360
  39. Mansfield E. Basic Statistics with Applications. New York: W. W. Norton; 1986;
  40. Sawilowsky SS. Comments on using alternatives to normal theory statistics in social and behavioural science. Can Psycho. 1993;34(4):432–439
  41. Friedman JA, Chalmers TC, Smith H, Kuebler RR. The importance of beta, the type II error and sample size in the design and interpretation of the randomized control trial (Survey of 71 “negative” trials). N Engl J Med. 1978;299:690–694
  42. Mengel MB, Davis AB. The statistical power of family practice research. Fam Pract Res J. 1993;13(2):105–111
  43. Edlund MJ, Overall JE, Rhoades HM. Beta, or type II error in psychiatric controlled clinical trials. J Psychiat Res. 1985;19(4):563–567
  44. Kraemer HC, Thiemann S. How Many Subjects? (Statistical Power Analysis in Research). Newbury Park, CA: Sage Publications; 1987;
  45. Cohen J. Statistical Power Analysis for the Behavioral Sciences. 2. Hillsdale, NJ: Erlbaum; 1988;
  46. Cohen JA. Power primer. Psychol Bull. 1992;112(1):155–159
  47. Dixon WJ. Power under normality of several nonparametric tests. Ann Mathemat Stat. 1954;25:610–614
  48. Chernoff H, Savage IR. Asymptotic normality and efficiency of certain nonparametric test statistics. Ann Mathemat Stat. 1958;29:972–999
  49. Neave HR, Granger CWJ. A Monte Carlo study comparing various two-sample tests for differences in mean. Technometrics. 1968;10:509–522
  50. Lehmann EL. Nonparametrics. San Francisco, CA: Holden-Day; 1975;
  51. Randles RH, Wolfe DA. Introduction to the Theory of Nonparametric Tests. New York: John Wiley; 1979;
  52. Blair RC, Higgins JJ. A comparison of the power of the Wilcoxon’s rank-sum statistic to that of Student’s t statistic under various non-normal distributions. J Educ Stat. 1980;5(4):309–335
  53. Blair RC, Higgins JJ. A note on the asymptotic relative efficiency of the Wilcoxon rank-sum test relative to the independent means t test under mixtures of two normal distributions. Br J Math Stat Psychol. 1981;31:124–128
  54. Zimmerman DW, Zumbo BD. Effects of outliers on the relative power of parametric and nonparametric statistical tests. Percep Motor Skills. 1990;71:339–349
  55. Conover WJ, Iman RL. Rank transformations as a bridge between parametric and nonparametric statistics. Am Stat. 1981;35:124–129
  56. International Mathematical and Statistical Libraries. IMSL Library Reference Manual. 10th ed. Houston, TX: 1987.
  57. Hill JC, Schoener EP. The age-dependent decline of ADHD. ADHD Report. 1998;6(1):4–5
  58. Barkley RA. Age-dependent decline in ADHD (A final rejoinder). ADHD Report. 1998;6(1):7–8
  59. Sawilowsky S, Musial JL. Modeling ADHD as exponential decay. ADHD Report. 1998;6(1):10–11

PII: S0895-4356(98)00168-1

Journal of Clinical Epidemiology
Volume 52, Issue 3 , Pages 229-235 , March 1999