User testing and stakeholder feedback contributed to the development of understandable and useful Summary of Findings tables for Cochrane reviews
Article Outline
- Abstract
- 1. Background
- 2. Methods
- 3. Results
- 4. Discussion
- 5. Conclusion, guidelines, and further research
- Acknowledgments
- References
- Copyright
Abstract
Objective
To develop a Summary of Findings (SoF) table for use in Cochrane reviews that is understandable and useful for health professionals, acceptable to Cochrane Collaboration stakeholders, and feasible to implement.
Study Design and Setting
We gathered stakeholder feedback on the format and content of an SoF table from an advisory group of more than 50 participants and their constituencies through e-mail consultations. We conducted user tests using a think-aloud protocol method, collecting feedback from 21 health professionals and researchers in Norway and the UK. We analyzed the feedback, defined problem areas, and generated new solutions in brainstorming workshops.
Results
Stakeholders were concerned about precision in the data representation and about production feasibility. User testing revealed unexpected comprehension problems, mainly confusion about what the different numbers referred to (class reference). Resolving the tension between achieving table precision and table simplicity became the main focus of the working group.
Conclusion
User testing led to a table more useful and understandable for clinical audiences. We arrived at an SoF table that was acceptable to the stakeholders and in principle feasible to implement technically. Some challenges remain, including presenting continuous outcomes and technical/editorial implementation.
Keywords: Knowledge translation, Health numeracy, Risk communication, Systematic reviews, Information design, Usability, User experience, Fuzzy traces theory
Everything should be made as simple as possible but not simpler. (Albert Einstein)
Simplicity is highly overrated. (Donald Normann)
Key finding:
What this adds to what is known?
What are the implications, what should change now?
1. Background
Limited time is a frequently cited barrier to clinicians' use of evidence in practice [1], [2], [3], [4], [5], [6]. Systematic reviews help to address this problem by summarizing evidence [7] but are still too time consuming to be practical for busy professionals. Further summarization of systematic reviews could help make evidence more useful and easy to find for decision makers with limited time [8], [9].
This is the first of two articles on the development and evaluation of summaries of Cochrane reviews for clinicians and other typical users of The Cochrane Library or general medical journals [10]. The challenges and solutions we discuss here are also likely to be relevant for other systematic reviews and health technology assessments.
Summarized evidence for clinicians exists in many different formats, for instance as structured abstracts, synopses published in secondary journals, and online services. Haynes' 5-S pyramid describes a typology of increasingly condensed and clinically useful formats: from studies (and their abstracts) to syntheses (systematic reviews), synopses (eg, ACP Journal Club), summaries (eg, Clinical Evidence), and systems (eg, EPJ reminders) [11], [12], [13], [14]. The PRISMA statement [15] provides a consensus-based checklist for producing abstracts for systematic reviews, and Hartley [16] reviews how these abstracts might be made clearer for a wide target group. However, little research has been published describing how clinicians experience summaries of systematic reviews. Numerical presentations of risk can be difficult, even for highly educated populations [17]. On the other hand, risk communication studies have shown that text-based descriptions of the effect of an intervention tend to be interpreted inconsistently by different people [18], [19], [20] and that numbers may be preferred by people making important health care decisions [21].
Earlier work on creating summaries of Cochrane reviews has also illustrated that to summarize already synthesized evidence is challenging and can lead to misrepresentation of the original data [22]. When attempting to summarize evidence for consumers with back pain, researchers encountered several critical issues:
The GRADE system offers possible solutions to some of these challenges. GRADE is a structured, transparent system that allows authors to evaluate and report the quality of evidence [23], [24]. An output of GRADE is a “Summary of Findings” (SoF) table, where authors are encouraged to focus on the most important outcomes, including those outcomes with no data or statistically nonsignificant data and adverse effects. Authors' judgments about the quality of evidence are presented together with the results for each outcome. The GRADE SoF table offers a useful starting point for summary authors by bringing the most important information to the foreground, regardless of the results or lack of them, and explicitly highlighting the quality of the evidence for each outcome.
Since 2004, open discussions have taken place in the Cochrane Collaboration about including SoF tables in Cochrane reviews [25], and extensive input has been gathered from stakeholders on the content and formatting of such tables. However, a number of issues continued to remain unresolved. A working group was, therefore, established to continue developing an SoF table designed for inclusion in Cochrane reviews and to evaluate this table.
The SoF table should summarize the key results of the review by presenting what is known and not known about the benefits and harms of an intervention, as well as how sure we can be of the evidence. It should be understandable and useful for a clinical audience, without oversimplifying or incorrectly presenting the data. We also needed to ensure that the content and data presentation was acceptable to Cochrane stakeholders and that the formatting was feasible to produce within the technical constraints of the system for publishing Cochrane reviews. In this article, we present and discuss the development process that led to our final decisions regarding table content, format, and data representation. In a second article [10], we present the effect of including a table in a Cochrane review on user satisfaction, understanding, and time spent finding key results.
2. Methods
To develop a table that works for different types of data, we searched for a Cochrane review that included dichotomous and continuous outcomes and outcomes with no data. The Cochrane review on the effect of compression stockings for preventing deep vein thrombosis in airline passengers [26] had all these types of results. It also covered a topic that was of potential interest to many people, making it easy to use in an evaluation process involving participants with different backgrounds. Using GRADE, we generated an SoF table for this review (Fig. 1).
We used cycles of multiple methods to develop the table:
We also carried out two randomized controlled trials (RCTs) between development cycles to measure user satisfaction, correct understanding, and time spent to find main messages in the review, the results of which are reported in another article [10]. We fed all stakeholder and user feedback into the brainstorming workshops. For an overview of the entire process, see Fig. 2.
2.1. Brainstorming workshops
We began the project with a brainstorming workshop where a working group of four people met to generate a range of ideas to address the issues uncovered by the other methods. We applied principles from our professional perspectives including information design, journalism, and clinical epidemiology. Workshops were repeated after each round of advisory group feedback and user testing.
2.2. Advisory group feedback
The advisory group provided feedback on the table from a stakeholder perspective. This group consisted of more than 50 people with a range of roles in the Cochrane Collaboration, including statisticians and other methodologists, review authors, editors, consumer representatives, publishers, and members of the Steering Group. We consulted them by e-mail at three different phases of the development, encouraging them to collect feedback from their constituencies when reporting back to us. We then analyzed their feedback, looking for issues with a high level of agreement or disagreement, issues we had not previously considered, or issues of critical importance such as incorrect presentation of data or formatting that was not technically feasible.
2.3. User testing
User testing provided feedback from a user perspective. Participants from Norway and the UK with a variety of healthcare-related professional backgrounds took part in these tests.
2.3.1. ParticipantsFor the first set of user tests, we recruited participants attending a Norwegian workshop for newcomers to evidence-based practice. Workshop leaders asked for volunteers who could describe the basic principles of a systematic review and who had visited the Cochrane Library at least once, to minimize confounding because of unfamiliarity with Cochrane or systematic reviews. Participants' backgrounds were primarily clinical, and English was not their first language. For the second set of tests, we recruited participants through the Centre for Evidence-Based Medicine in Oxford, UK. Potential participants were identified by the centre, who contacted them by telephone or e-mail. Although we used the same inclusion criteria as above, this group was on the whole more familiar with Cochrane reviews. Although it included several clinicians, many had a more research-related background than the Norwegian participants. English was the first language of all members of this group.
2.3.2. Think-aloud protocolThe user tests were performed individually and took 1
hour. With the participant's written permission, we audio recorded each test, and an observer took notes. Using a semistructured interview guide, we explored immediate first impressions of the table as a whole and then detailed descriptions of each table element. The interview guide was designed to explore six of the seven different facets of “user experience” as described in a model by Morville [27]: usability (defined here as “correct understanding and ease of use”), credibility, usefulness, desirability, findability, and value (see Fig. 3). The seventh facet of this model—accessibility—was not addressed as we were still testing on paper and could not explore issues relevant to online accessibility. Follow-up questions covered overall impressions and suggestions for improvement.
One designer and one researcher reviewed all the notes and transcriptions together, looking for barriers and facilitators to the six facets referred to above and tracing findings back to the elements or characteristics of the tables that appeared to cause problems. Findings were rated in three categories according to the severity of the problem: high (critical errors such as incorrect interpretation or high degree of uncertainty or dissatisfaction), medium (much frustration or unnecessarily slow use), and low (minor or cosmetic problems). We also registered things users explicitly liked and suggestions for improvement.
These findings were discussed in the brainstorming workshops, particularly those of high severity. For some issues, specific input was sought from the advisory group.
3. Results
3.1. Brainstorming workshop results
In the workshops, we initially focused much of our efforts on improving legibility and comprehension through changes in visual and verbal elements. For instance, to highlight key information while taking into account the technical constraints of the publishing system, we made the following changes:
We made continual efforts to find terms and phrases that correctly described the data but that could be understood by nonstatisticians. We initiated an explanation sheet for descriptions of terms used in the table (Table 1).
Table 1. Explanations for Cochrane Summary of Findings tables
As we collected input from the advisory group and the user tests, the main focus in the brainstorming workshops became more apparent: to address the tension between achieving precision and simplicity. Tables that included enough information to meet the precision goals of the advisory group tended to be too complicated for participants to understand or want to read. There was, therefore, a continuous reevaluation about what information was most critical to include, and much effort was spent trying to find solutions that accommodated both perspectives.
3.2. Advisory group feedback results
We received 58 responses from 52 individuals or groups. Comments fell mostly into two categories: precision of the data representation and feasibility of publishing the tables within the current Cochrane system.
In general, the advisory group was concerned with presenting information in a form that they thought users would understand. However, there was some resistance to taking this too far:
Feedback related to precision of data representation included comments about:
hours in duration.”
Examples of feedback regarding production and publishing within the Cochrane system:
3.3. User testing results
Twenty-one people from Norway and UK took part in the user tests. During the first set of tests, we found several problems that we ranked as high severity. After modifying the table several times, we tested a new version. No findings in the high severity category were observed in this second set of user tests. The findings that led to most changes in the table were concentrated in two of the seven facets of the user experience model: usability and usefulness.
3.3.1. Usability (correct understanding and ease of use)A major finding, particularly in the first set of user tests, was that participants misunderstood or were uncertain about a range of elements:
For instance, 5 of 13 participants dramatically misunderstood “9 fewer per 1000” in the column for “Absolute difference,” stating that it meant “9” or “9 or fewer.” This mistake was made by some even when they correctly read the effect statement out loud. Two participants understood the statement correctly but were unsure if their interpretation was right. Three of 13 participants mentioned specifically that they used “Favours stockings” to confirm that they had understood the numbers correctly.
Continuous outcomes caused confusion, usually because participants could not identify what the numbers related to: “5 to 9 what? People?” Explanations, placed in the Comments column, were often overlooked. Other numbers also caused confusion: 4 of 13 test persons in the first set of user tests said that either the number of studies “(9)” was a reference to a footnote or they did not know what it meant.
Participants also exhibited unfamiliarity with language and concepts used in the table. Sixteen of 21 participants did not understand the headings “Illustrative comparative risk,” “Assumed risk,” and “Corresponding risk,” and 12 of 21 did not understand what was meant by “no data available” or empty cells. Abbreviations such as “RR” (relative risk) and “CI” (confidence interval) also caused confusion regarding both what the abbreviation stood for and the concept it referred to.
Participants did not have critical problems related to understanding the GRADE ratings, despite most not having prior knowledge of GRADE.
3.3.2. UsefulnessParticipants offered suggestions for changes that would make the tables more useful in a clinical setting. These included
Eighteen of 21 test persons indicated that their perception of the credibility of the table was directly related to the GRADE ratings. “I would say that if the quality of evidence (referring to the GRADE score) was high, then I would believe in it more.”
3.3.4. FindabilityMost participants indicated that an SoF table should be near the front of the review, near the abstract. User preference regarding placement was measured explicitly in our randomized trial of the table [10].
3.3.5. Desirability and valueFourteen of 21 participants said that the table would be a valuable addition to Cochrane reviews. One person did not like tables in general. One participant explained that she did not like it but anticipated that she would feel differently over time after becoming more familiar with the format. User satisfaction was also measured in our randomized trial [10].
3.3.6. First impressions vs. exposure over timeAlthough 11 of 21 participants felt that the table contained large amounts of information, this was not necessarily negative. Some said that they expected a learning curve for this kind of information and were confident that they would find these tables easier to read upon repeated exposure.
3.4. Resulting SoF table
Our work resulted in many iterations of the SoF table. Figure 4 shows the last version.
4. Discussion
Through feedback from the advisory group and our efforts in the brainstorming workshops, we arrived at a table that was acceptable to the stakeholders and in principle feasible to implement technically. User testing helped us to improve the table for a clinical audience. There are remaining challenges, including presenting continuous outcomes and implementing the table in the Cochrane publishing system.
Before the start of our project, the GRADE Working Group had made several choices regarding the formatting of the table guided by what was known about how people understand risk information. One key choice was that data should be represented numerically, partly because this would provide a supplement to the already text-based abstract and plain language summary, but also because a numerical presentation of results would be a more precise starting point for other summaries based on the review.
The manner in which numerical results were presented was also guided by research evidence indicating that
4.1. Trouble understanding the class references
Although numbers may be more precise than qualitative presentations, they still have problems. We initially thought that the focus of our project was to arrive at a table that users were satisfied with. However, achieving user satisfaction does not guarantee that information is being understood correctly. During the first set of user tests, we became aware that correct comprehension was a much larger issue than we had anticipated. Much of the difficulty that we observed was related to confusion about what numbers referred to (“class reference”). Problems correctly identifying reference class have been uncovered in past work [30], [34].
4.1.1. Trouble with absolute effectInstead of making the table easier to read by reducing computational tasks, the statement “9 fewer per 1000” caused uncertainty and errors. This is possibly because of the subtle change of class reference between the control group risk column and absolute effect column: “X number of people per 1000” and “X fewer number of people per 1000.” In a recent review of formats for conveying health risks, Lipkus [18] recommends consistency in use of numerical formats. When we reformatted the way magnitude of effect was represented in this column—eliminating the absolute difference format (x fewer per 1000) and changing it to absolute risk (x per 1000)—users no longer made these errors.
4.1.2. Continuous outcomes—continuous challengeMany test participants also struggled to interpret continuous outcomes. This problem also seemed to be related to inconsistent class references: dichotomous results and continuous results appeared in the same columns, but the numbers for these two outcome types referred to different classes of phenomena. “1 per 1000” refers to numbers of people, whereas “mean 6 to 9” refer to a range on a scale. We experimented presenting continuous outcomes using both sentences and numbers so that the scale references became more apparent but are uncertain how effective this format is as it was not tested explicitly.
In addition, the column heading “Corresponding Risk With Stockings” is technically wrong for these outcomes. This kind of discrepancy could be dealt with if the text in column headings were less precise, for instance only “Without Stockings” and “With Stockings,” leaving the more accurate descriptions of the column content to a footnote. This issue and the issue of how to present continuous outcomes need further work.
4.1.3. Trouble identifying other numbers' class referencesReaders' uncertainty about the class reference also cropped up in other places. Throughout the table, different numbers refer to different classes of things. Figure 5 (an early version of the table) illustrates this more clearly. Here “30/1000” in the DVT row refers to people, “(1 to 8)” refers to per 1000 people, “(8)” refers to studies, whereas “6 to 9” in the oedema row refers to range on a scale. Although the row and column headings explain what these different numbers mean, this was not enough for many participants. When the formatting is similar but means two different things, such as “6 to 9” meaning range on a continuous outcome scale and “(1 to 8)” meaning confidence interval, readers at any level may be challenged.
4.1.4. Trade-offs between class cues and clutterDifficulties associated with class reference have been pointed out in earlier studies: combining information from different classes, leaving class open to interpretation [30], and overlapping or nested classes [35]. The confusion we observed appeared to be because of difficulty identifying different classes. Text labels in direct proximity to the numbers (eg, “Mean oedema range: 6 to 8” or “9 studies”) help clarify the class reference. The trade-off is to balance this information without creating an overly cluttered table that may both demotivate readers and interfere with their task of quickly taking in key information.
4.2. Precision or simplicity—verbatim or gist?
The tug-of-war between precision and simplicity, reflected in the feedback from the advisory group and the test participants, was our main challenge when designing the table. A good example of this conflict was the differing feedback we received concerning the phrase “Favours stockings.” This phrase was inserted directly underneath the numbers expressing magnitude of effect for one outcome in an early version of the table. User test participants said that this phrase was helpful, explaining that this kind of cue helped them feel more confident in their understanding of the figures. The advisory group recommended taking these phrases out, because they were seen as misleading and oversimplifying. However, participants' favorable reactions to “Favours stockings” may tell us something about how numbers are actually used in decision making.
There is some evidence suggesting that people may not calculate with precise numbers (such as “10 per 1000” or “1 per 1000”) in real-world problem situations involving decision making or interpretation but prefer to rely on gists—semantic representations of the information [36], [37]—such as “Favours stockings.” Fuzzy traces theory can explain this preference, claiming that people display a dual processing of information along a verbatim–gist continuum. Readers register both the verbatim (the precise information) and the gist (the qualitative interpretation of what is being communicated) but have a gist preference [38]. Extracting the correct gist can prevent basic comprehension errors [35]. Some work also indicates that gist preference may increase with higher levels of expertise [39], [40], [41].
“Favours stockings” is a phrase that helps readers quickly form a correct gist of what the numbers mean and saves time. This gist may be sufficient to answer a decision maker's initial questions about a particular outcome—“Did the intervention have an effect? (yes/no)” and “Is this effect desirable in this situation? (yes/no)”—before actually paying attention to the exact amount of the effect. Such levels of precision may not be necessary until both these initial questions are answered affirmatively, and the process of balancing actual amounts of benefits, harms, costs, and uncertainty can begin.
Perhaps equally important, the phrase “Favours stockings” is less prone to being dramatically misunderstood (ie, is not easily confused with “does not favour stockings”), whereas small easy-to-make mistakes in processing the precise data could produce major errors. For instance, assuming that the intervention group results were in the first of the two effect columns or misunderstanding the framing of a continuous outcome scale (is high on this scale good or bad?), would provide a totally incorrect gist of the data. Preferences for text cues such as “Favours stockings” might reflect an appropriate safeguarding behavior for those who feel that they may be at risk of making mistakes when faced with a complex table of numbers.
4.2.1. If gists cannot be represented explicitly, make them easy to extractAs the advisory group pointed out, although potentially helpful, the phrase “Favours stocking” may lead to overinterpretation when the effect difference is actually very small or the confidence interval is wide. The word “favours” also implies an imbedded value judgment about the desirability of the outcome that should not be made by a systematic review author [42]. Despite user preference, these cues were, therefore, eliminated. An alternative to providing cues may be to ensure that data are presented in ways that enables readers to easily extract the correct gist out of the verbatim information. For instance, the information can be visually layered through use of color or varying type size/weight so that key messages pop out more clearly [43]. Numbers can be aligned to create more visual order, aiding comparison and gist extraction. Neglecting to do so may scramble the information and render it less usable /useful as well more error prone [34].
4.2.2. Technical barriers to enabling gist extractionThe table was designed to fit within the constraints of the Cochrane publishing system, although actual implementation of several features of the table have proven to be difficult, in both HTML and PDF versions. These include the features that help readers quickly focus attention on the main messages and aid gist extraction (shading of cells, variation of font type/size/weight). We are currently working to resolve these issues.
4.3. Evidence into practice—making information useful for clinical contexts
Part of the challenge of bringing research into practice is making the information useful for a clinical context. Through user testing, we collected feedback on specific elements that would render the SoF table more useful in a clinical context, including specifying the criteria for high- and low-risk populations and describing the intervention in more detail. Glasziou [44] has pointed out that detailed description of the intervention is critical for the clinical reader but is often lacking in both systematic reviews and articles reporting on clinical trials.
4.4. Limitations
The strengths of this study include the use of multiple methods and involvement of a range of stakeholders with complementary perspectives. However, the study has some limitations:
5. Conclusion, guidelines, and further research
Aspects of the SoF table design (including details about numerical representation and text and visual formatting) have a strong influence on users' perceptions, especially regarding their understanding of the data. General guidelines for these kinds of tables are the following:
To make tables more useful for clinicians, include
The table met with broad approval by the advisory group and by the health professionals in the user testing. The Cochrane Collaboration now recommends including SoF tables in Cochrane reviews, placed after the abstract [45]. Formatting will be somewhat limited because of technical issues in the publishing system. Results from two RCTs measuring the table's effect on user satisfaction, understanding, and time spent finding results in a systematic review are reported in a separate article [10].
Further work in progress includes how to update existing reviews with SoF tables, how to implement them in the production of new reviews, how to present continuous outcomes, and how to produce tables targeted at consumers and at policy makers. The SoF format was developed using only one example (compression stockings). Although this summary was complicated and most summaries will be simpler, other reviews may present additional challenges, such as summarizing several comparisons and presenting results for outcomes when a meta-analysis was not possible.
Future research should include comparisons of this summary table with other summary formats currently in use.
The proposed format is being used by other organizations publishing summaries of findings. Software is available to generate SoF tables using this format [46].
Acknowledgments
Thanks to Arild Bjørndal for his help with the manuscript.
References
- . How do primary care physicians seek answers to clinical questions? A literature review. J Med Libr Assoc. 2006;94:55–60
- . Variation in the use of online clinical evidence: a qualitative analysis. Int J Med Inform [research]. 2003;69:1–16
- . What drives change? Barriers to and incentives for achieving evidence-based practice. Med J Aust. 2004;180(6 Suppl):57–60
- . Why do residents fail to answer their clinical questions? A qualitative study of barriers to practicing evidence-based medicine. Acad Med. [research]. 2005;80(2):176–182
- . Incentives and barriers that influence clinical computerization in Hong Kong: a population-based physician survey. J Am Med Inform Assoc. [research]. 2003;10:201–212
- Obstacles to answering doctors' questions about patient care with evidence: qualitative study. BMJ. [research (qualitative study)]. 2002;324(7339):710
- . How to read a paper: papers that summarise other papers (systematic reviews and meta-analyses). BMJ. 1997;315(7109):672–675
- . Using evidence-based medicine to help physicians keep up-to-date. Serials: J Serials Community. [research—survey/poll]. 1996;9(2):178–181
- . The paths from research to improved health outcomes. Evid Based Nurs. 2005;8(2):36–38
- . Summary-of-findings tables in Cochrane reviews improved understanding and rapid retrieval of key information. J Clin Epidemiol. 2010;63:618–624
- . Of studies, syntheses, synopses, summaries, and systems: the "5S" evolution of information services for evidence-based healthcare decisions. Evidence Based Nurs. [EBN notebook]. 2007;10:6–7
- . More informative abstracts revisited. Ann Intern Med. 1990;113(1):69–76
- ACP Journal Club. Clinical synopses. Quality-assessed, clinically rated original studies and reviews from over 130 clinical journals. Available at: http://www.acpjc.org. Accessed Dec 30, 2008.
- Clinical Evidence. BMJ Publishing Group Ltd; Available at: http://clinicalevidence.bmj.com/ceweb/index.jsp. Accessed Dec 30, 2008.
- . Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Med.[peer-reviewed]. 2009;6(7):e1000097
- . Clarifying the abstracts of systematic literature reviews. Bull Med Libr Assoc. [literature review]. 2000;88(4):332–337
- . General performance on a numeracy scale among highly educated samples. Med Decis Making. 2001;21(1):37–44
- . Numeric, verbal, and visual formats of conveying health risks: suggested best practices and future recommendations. Med Decis Making. 2007;27:696–713
- . How age, outcome severity, and scale influence general medicine clinic patients' interpretations of verbal probability terms. J Gen Intern Med. 1994;9(5):268–271
- . Patients' interpretations of probability terms. J Gen Intern Med. 1991;6(3):237–240
- . The effect of numerical statements of risk on trust and comfort with hypothetical physician risk communication. Med Decis Making. 2004;24(3):265–271
- . Summaries of findings, descriptions of interventions, and information about adverse effects would make reviews more informative. J Clin Epidemiol. 2006;59:770–778
- . Grading quality of evidence and strength of recommendations. BMJ. 2004;328(7454):1490
- GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ. 2008;336:924–926
- . Summaries of findings in Cochrane reviews. Cochrane Collaboration Methods Group Newsl. 2004;8–9Sect
- . Compression stockings for preventing deep vein thrombosis in airline passengers. Cochrane Database Syst Rev. 2006;(2):CD004002
- Morville P. User Experience Design [website]: Sematic Studios LLC;2004 [updated June 21, 2004–April 09, 2008]; [honeycomb model]. Available at: http://www.semanticstudios.com/publications/semantics/000029.php.
- . Patient comprehension of information for shared treatment decision making: state of the art and future directions. Patient Educ Couns. 2003;50(3):285–290
- . A systematic review on communicating with patients about evidence. J Eval Clin Pract. 2006;12(1):13–23
- . Simple tools for understanding risks: from innumeracy to insight. BMJ. [education and debate]. 2003;327(7417):741
- . Strategies to help patients understand risks. BMJ. 2003;327(7417):745–748
- Symbols were superior to numbers for presenting strength of recommendations to health care consumers: a randomized trial. J Clin Epidemiol. 2007;60(12):1298–1305
- . Letters, numbers, symbols and words: how to communicate grades of evidence and recommendations. CMAJ. 2003;169(7):677–680
- . A theory of medical decision making and health: fuzzy trace theory. Med Decis Making. 2008;28(6):850–865
- . How people make decisions that involve risk. A dual-processes approach. Curr Dir Psychol Sci. 2004;13(2):60–66
- . Numeracy, ratio bias, and denominator neglect in judgments of risk and probability. Learn Indiv Diff. [theory]. 2007;18(1):89–107
- . A web exercise in evidence-based medicine using cognitive theory. J Gen Intern Med. 2001;16(2):94–99
- . Fuzzy-trace theory: dual processes in memory, reasoning, and cognitive neuroscience. Adv Child Dev Behav. [theory]. 2001;28:41–100
- . The development of judgment and decision making during childhood and adolescence. Curr Dir Psychol Sci. 2002;11(4):145–149
- . Fuzzy-trace theory and framing effects in children's risky decision making. Psychol Sci. 1994;5(5):275–279
- . Physician decision making and cardiac risk: effects of knowledge, risk perception, risk tolerance, and fuzzy processing. J Exp Psychol Appl. 2006;12(3):179–195
- In: Higgins JPT, Green S editor. Cochrane Handbook for Systematic Reviews of Interventions Version 5.0.2 [updated September 2009]. The Cochrane Collaboration; 2009;Available from www.cochrane-handbook.org
- . Envisioning information. Cheshire, CT: Graphics Press; 1990;
- . What is missing from descriptions of treatment in trials and reviews?. BMJ. 2008;336(7659):1472–1474
- . Chapter 11: presenting results and ‘summary of findings’ tables. In: Higgens JP, Green S editor. Cochrane handbook for systematic reviews of interventions. Chichester, England: Wiley-Blackwell; 2008;
- GRADEpro—Information Management System (IMS). [07 January 2009]; GRADEpro (GRADEprofiler) is the software used to create Summary of Findings (SoF) tables in Cochrane systematic reviews. Software download and technical information, support, feedback and other resources. Available at: http://www.cc-ims.net/revman/gradepro/gradepro. Accessed Jan 07, 2009
PII: S0895-4356(10)00024-7
doi:10.1016/j.jclinepi.2009.12.013
© 2010 Elsevier Inc. All rights reserved.







