Account

How can we improve the interpretation of systematic reviews?

Commentary
Open access
Published: 30 March 2011

Volume 9, article number 31, (2011)
Cite this article

You have full access to this open access article

BMC Medicine Aims and scope Submit manuscript

How can we improve the interpretation of systematic reviews?

Andrea C Tricco¹,
Sharon E Straus^1,2 &
David Moher^3,4

6994 Accesses
18 Citations
10 Altmetric
1 Mention
Explore all metrics

Abstract

A study conducted by Lai and colleagues, published this week in BMC Medicine, suggests that more guidance might be required for interpreting systematic review (SR) results. In the study by Lai and colleagues, positive (or favorable) results were influential in changing participants' prior beliefs about the interventions presented in the systematic review. Other studies have examined the relationship between favorable systematic review results and the publication of systematic reviews. An international registry may decrease the number of unpublished systematic reviews and will hopefully decrease redundancy, increase transparency, and increase collaboration within the SR community. In addition, using guidance from the Preferred Items for Systematic Reviews and Meta-analyses (PRISMA: http://www.prisma-statement.org/) Statement and the Grading of Recommendations Assessment, Development, and Evaluation (GRADE: http://www.gradeworkinggroup.org/) approach can also be used to improve the interpretation of systematic reviews. In this commentary, we highlight important methodological issues related to the conduct and reporting of systematic reviews and also present our own guidance on interpreting systematic reviews.

Please see Research article: http://www.biomedcentral.com/1741-7015/9/30/.

View this article's peer review reports

Introduction

In BMC Medicine this week, Lai and colleagues examined the ability of 95 hospital clinicians, allied health professionals, laboratory technicians, and 35 medical students to accurately generate conclusions from four systematic review (SR) abstracts [1]. SRs are syntheses of relevant research consisting of a clearly formulated question and explicit methods to identify, select, critically appraise, extract, and analyze data (The Cochrane Handbook: http://www.cochrane.org/training/cochrane-handbook). A meta-analysis is a statistical technique to quantitatively integrate the results of included studies and is not always conducted in a SR. Lai et al. found that although medical students were better able to decipher the correct conclusion compared to hospital staff, only 30.1% of participants correctly identified both the direction of effect and strength of evidence.

A similar study examined the level of agreement between SR results and reviewers' conclusion statements [2]. Two reviewers independently used a categorization guide to classify SR results and conclusions from a sample of 296 SRs indexed in MEDLINE in November 2004. Conflicts were resolved by discussion or the involvement of a third reviewer. Only moderate agreement between SR results and conclusions was observed (kappa = 0.55; 95% confidence interval: 0.47, 0.64). The results of these two studies suggest that more guidance might be required for interpreting SR results. In this commentary, we highlight important methodological issues related to the conduct and reporting of SRs and also present our own guidance on interpreting SRs.

Discussion

Methodological issues related to the conduct and reporting of systematic reviews

Publication bias occurs when "investigators, reviewers, and editors submit or accept manuscripts for publication based on the direction or strength of the study findings" [3]. The impact of publication bias has been widely examined for clinical trials [4–8], for which it has been suggested that studies with statistically positive results and large effect sizes can exaggerate a treatment's effectiveness by 20% [9]. These results highlight the importance of including unpublished studies in SRs. However, unpublished studies are often difficult to locate, especially when funded by private industry [10, 11]. Clinical trial registries were developed to surmount issues related to publication bias of clinical research, yet challenges to their use persist [12, 13].

In the study by Lai and colleagues, positive (or favorable) results were more influential in changing participants' prior beliefs about the interventions presented in the SRs than negative results [1]. Previous studies have examined the relationship between SR results and the publication of SRs. In a cross-sectional study of 296 SRs indexed in MEDLINE, 36.5% of the overall sample had favorable results [2]. This increased to 57.7% for Cochrane and 64.3% for non-Cochrane reviews with a meta-analysis of the primary outcome. In an international survey of 348 SR authors, 1,405 published (median: 2.0, range: 1 to 150) and 199 unpublished (median: 2.0, range: 1 to 33) SRs were reported [14]. Participants reported that 13 out of 19 of the most recent unpublished SRs for which a meta-analysis was conducted had favorable results for their primary outcome. In another study including 93 published Cochrane reviews, the median time to publication was 1.63 years (range 0.15 to 7.31 years); positive and negative results were not associated with the time to publication [15].

The PRISMA Statement calls for an international registry for SR protocols [16], which is currently under development [17]. An international registry may decrease the number of unpublished SRs and will hopefully decrease redundancy, increase transparency, and increase collaboration within the SR community.

Guidance on interpreting SR results

In the study by Lai and colleagues, the medical students received a structured and clinically integrated evidence-based medicine course, while the hospital practitioners received an introductory course on evidence-based medicine [1]. The medical students were better able to correctly match the SR abstract with the respective conclusion statement, suggesting that different forms of SR training may have a different impact. Their results imply that systematic reviewers and end users of SRs may benefit from education on conducting SRs, including how to interpret SR results. In addition, enhancing the format of SRs to make them user-friendly may improve the interpretation of SR results [18–20]. Examples of such initiatives include Clinical Evidence (http://clinicalevidence.bmj.com) and the Program in Policy Decision-Making (http://www.researchtopolicy.ca/).

Some systematic reviewers include end users of the review (for example, patients, policy makers, health care professionals) in the SR process [21] or circulate a draft of their discussion to their target audience. These efforts increase the applicability and relevance of SR results and promote adequate interpretation of the results from the different stakeholders' perspectives. Peer review feedback on the interpretation of SR results can also be sought by presenting the SR at a conference. Other approaches include using guidance from the Preferred Items for Systematic Reviews and Meta-analyses (PRISMA) Statement, Grading of Recommendations Assessment, Development, and Evaluation (GRADE), and using a categorization guide to interpret meta-analyses.

The PRISMA Statement provides reporting guidance for SR authors [16] and suggests that systematic reviewers should summarize their main SR findings in a balanced manner, including the strength of evidence for each of the main outcomes. The results should be put into context by considering not only their statistical significance, but also the clinical, political, and resource implications of relevance to patients, healthcare providers, and policy-makers. Limitations of the included studies should be discussed by focusing on the risk of bias (or methodological quality) results. Limitations in the SR process itself should be noted, which can be assessed using tools for appraising SR quality (for example, Assessment of Multiple Systematic Reviews; AMSTAR) [22–24].

GRADE considers four factors in grading the strength of recommendations; quality, benefit versus harm, values and preferences, and resources [25, 26]. The quality of evidence is based on study limitations, inconsistency of results, imprecision, reporting bias, and indirectness of evidence. GRADE was originally designed for assessing clinical practice guidelines, yet has gained popularity within the SR community and is endorsed by The Cochrane Collaboration (The Cochrane Handbook). Limitations of GRADE include that it requires training, provides limited guidance for examining non-intervention or non-diagnostic studies, and requires 'scientific value judgments' to be made about a body of evidence, which is often difficult for non-experimental research (for example, observational studies, qualitative studies) [27, 28].

If a meta-analysis was conducted, SR authors may also find it useful to use a categorization guide to interpret the results [2, 9]. Using the example of a SR examining a particular intervention versus a comparator, favorable results (that is, statistically significant positive effect in favor of the intervention with an associated P-value ≤0.05 or a trend towards a positive result that is non-statistically significant, see Text Box) would be classified as a positive finding and the authors would recommend the intervention. If the intervention also had a statistically significant increase in adverse events then the authors may recommend the intervention at the discretion of the patient. The authors may not recommend the intervention if there is a statistically significant increase in serious adverse events. Unfavorable results (that is, statistically significant negative effect in favor of the nonintervention comparator with an associated P-value ≤0.05 or a trend towards a negative result that is non-statistically significant) would be classified as a negative finding; hence, the authors would advise against the use of the intervention or not recommend the intervention. A neutral result (that is, effect size between 0.95 and 1.05 and the confidence interval (CI) crosses 1 for dichotomous outcomes or the CI crosses 0 for continuous outcomes) would be classified as a neutral finding and the authors would report no evidence supporting or refuting the intervention's effectiveness. Indeterminate results include whether the SR has more than one primary outcome with different results, the meta-analysis is based on few studies or patients or the SR results are likely affected by bias. In these circumstances, the authors may report insufficient evidence or that more research is required.

Conclusions

An international registry of systematic review protocols may decrease the number of unpublished SRs and will hopefully decrease redundancy, increase transparency, and increase collaboration within the SR community. The interpretation of SR results may be improved by educating systematic reviewers and end users of SRs, enhancing the format of SRs to make them user-friendly, and including end users in the entire review process. Other approaches include using the PRISMA Statement, GRADE, and a categorization guide for meta-analysis results. Such efforts will increase the applicability and relevance of the SR results and may help to ensure adequate interpretation of the results.

Text box: Meta-analysis

The outcome used in a meta-analysis is determined by the data obtained from the included studies. Binary (or event) data can be meta-analyzed using odds ratios, relative risk, and the risk difference. Continuous data can be meta-analyzed using the mean difference and standardized mean difference. Other types of outcomes that can be meta-analyzed include hazard ratios (takes time into consideration) and correlation coefficients. Two main types of models used for meta-analysis include the fixed effects model and the random effects model. The fixed effects model does not take between-study variability into account, while the random effects model does [29]. Meta-regression is a statistical tool that can be used to examine how variables of interest are related to the meta-analysis results [30, 31]. Bayesian approaches to meta-analyses have also been used [32, 33]. Further information on meta-analysis can be found in The Cochrane Handbook.

Abbreviations

AMSTAR:: Assessment of Multiple Systematic Reviews
GRADE:: Grading of Recommendations Assessment, Development, and Evaluation
PRISMA:: Preferred Items for Systematic Reviews and Meta-analyses
SR:: systematic review

References

Lai NM, Teng CL, Lee ML: Interpreting systematic reviews: are we ready to make our own conclusions? A cross-sectional study. BMC Med. 2011, 9: 30.
Article PubMed PubMed Central Google Scholar
Tricco AC, Tetzlaff J, Pham B, Brehaut J, Moher D: Non-Cochrane vs. Cochrane reviews were twice as likely to have positive conclusion statements: cross-sectional study. J Clin Epidemiol. 2009, 62: 380-386. 10.1016/j.jclinepi.2008.08.008.
Article PubMed Google Scholar
Dickersin K: How important is publication bias? A synthesis of available data. AIDS Educ Prev. 1997, 9: 15-21.
CAS PubMed Google Scholar
Begg CB, Berlin JA: Publication bias and dissemination of clinical research. J Natl Cancer Inst. 1989, 81: 107-115. 10.1093/jnci/81.2.107.
Article CAS PubMed Google Scholar
Easterbrook PJ, Berlin JA, Gopalan R, Matthews DR: Publication bias in clinical research. Lancet. 1991, 337: 867-872. 10.1016/0140-6736(91)90201-Y.
Article CAS PubMed Google Scholar
Dickersin K: The existence of publication bias and risk factors for its occurrence. JAMA. 1990, 263: 1385-1389. 10.1001/jama.263.10.1385.
Article CAS PubMed Google Scholar
Dickersin K, Chan S, Chalmers TC, Sacks HS, Smith H: Publication bias and clinical trials. Control Clin Trials. 1987, 8: 343-353. 10.1016/0197-2456(87)90155-3.
Article CAS PubMed Google Scholar
Ioannidis JP: Effect of the statistical significance of results on the time to completion and publication of randomized efficacy trials. JAMA. 1998, 279: 281-286. 10.1001/jama.279.4.281.
Article CAS PubMed Google Scholar
Song F, Parekh S, Hooper L, Loke YK, Ryder J, Sutton AJ, Hing C, Kwok CS, Pang C, Harvey I: Dissemination and publication of research findings: an updated review of related biases. Health Technol Assess. 2010, 14: 1-193. iii, ix-xi
Article Google Scholar
Jefferson T, Doshi P, Thompson M, Heneghan C: Ensuring safe and effective drugs: who can do what it takes?. BMJ. 2011, 342: c7258-10.1136/bmj.c7258.
Article PubMed Google Scholar
van Driel ML, De Sutter A, De Maeseneer J, Christiaens T: Searching for unpublished trials in Cochrane reviews may not be worth the effort. J Clin Epidemiol. 2009, 62: 838-844. 10.1016/j.jclinepi.2008.09.010.
Article PubMed Google Scholar
Chan AW, Laupacis A, Moher D: Registering results from clinical trials. JAMA. 2010, 303: 2138-2139. 10.1001/jama.2010.702. author reply 2139
Article CAS PubMed Google Scholar
Miller JD: Registering clinical trial results: the next step. JAMA. 2010, 303: 773-774. 10.1001/jama.2010.207.
Article CAS PubMed Google Scholar
Tricco AC, Pham B, Brehaut J, Tetroe J, Cappelli M, Hopewell S, Lavis JN, Berlin JA, Moher D: An international survey indicated that unpublished systematic reviews exist. J Clin Epidemiol. 2009, 62: 617-623. 10.1016/j.jclinepi.2008.09.014.
Article PubMed Google Scholar
Tricco AC, Moher D, Chen MH, Daniel R: Factors predicting completion and time to publication of Cochrane reviews. Open Medicine. 2009, 3: E210-214.
PubMed PubMed Central Google Scholar
Liberati A, Altman DG, Tetzlaff J, Mulrow C, Gotzsche PC, Ioannidis JP, Clarke M, Devereaux PJ, Kleijnen J, Moher D: The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate healthcare interventions: explanation and elaboration. BMJ. 2009, 339: b2700-10.1136/bmj.b2700.
Article PubMed PubMed Central Google Scholar
Booth A, Clarke M, Ghersi D, Moher D, Petticrew M, Stewart L: An international registry of systematic-review protocols. Lancet. 2011, 377: 108-109. 10.1016/S0140-6736(10)60903-8.
Article PubMed Google Scholar
Lavis JN: Research, public policymaking, and knowledge-translation processes: Canadian efforts to build bridges. J Contin Educ Health Prof. 2006, 26: 37-45. 10.1002/chp.49.
Article PubMed Google Scholar
Ciliska D, Hayward S, Dobbins M, Brunton G, Underwood J: Transferring public-health nursing research to health-system planning: assessing the relevance and accessibility of systematic reviews. Can J Nurs Res. 1999, 31: 23-36.
CAS PubMed Google Scholar
Svaninger G, Nordgren S, Palselius IR, Fasth S, Hulten L: Sodium and potassium excretion in patients with ileostomies. Eur J Surg. 1991, 157: 601-605.
CAS PubMed Google Scholar
Keown K, Van Eerd D, Irvin E: Stakeholder engagement opportunities in systematic reviews: knowledge transfer for policy and practice. J Contin Educ Health Prof. 2008, 28: 67-72. 10.1002/chp.159.
Article PubMed Google Scholar
Shea BJ, Hamel C, Wells GA, Bouter LM, Kristjansson E, Grimshaw J, Henry DA, Boers M: AMSTAR is a reliable and valid measurement tool to assess the methodological quality of systematic reviews. J Clin Epidemiol. 2009, 62: 1013-1020. 10.1016/j.jclinepi.2008.10.009.
Article PubMed Google Scholar
Shea BJ, Bouter LM, Peterson J, Boers M, Andersson N, Ortiz Z, Ramsay T, Bai A, Shukla VK, Grimshaw JM: External validation of a measurement tool to assess systematic reviews (AMSTAR). PLoS One. 2007, 2: e1350-10.1371/journal.pone.0001350.
Article PubMed PubMed Central Google Scholar
Shea BJ, Grimshaw JM, Wells GA, Boers M, Andersson N, Hamel C, Porter AC, Tugwell P, Moher D, Bouter LM: Development of AMSTAR: a measurement tool to assess the methodological quality of systematic reviews. BMC Med Res Methodol. 2007, 7: 10-10.1186/1471-2288-7-10.
Article PubMed PubMed Central Google Scholar
Guyatt GH, Oxman AD, Kunz R, Falck-Ytter Y, Vist GE, Liberati A, Schunemann HJ: Going from evidence to recommendations. BMJ. 2008, 336: 1049-1051. 10.1136/bmj.39493.646875.AE.
Article PubMed PubMed Central Google Scholar
Guyatt GH, Oxman AD, Vist GE, Kunz R, Falck-Ytter Y, Alonso-Coello P, Schunemann HJ: GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ. 2008, 336: 924-926. 10.1136/bmj.39489.470347.AD.
Article PubMed PubMed Central Google Scholar
Kavanagh BP: The GRADE system for rating clinical guidelines. PLoS Med. 2009, 6: e1000094-10.1371/journal.pmed.1000094.
Article PubMed PubMed Central Google Scholar
Ansari MT, Tsertsvadze A, Moher D: Grading quality of evidence and strength of recommendations: a perspective. PLoS Med. 2009, 6: e1000151-10.1371/journal.pmed.1000151.
Article PubMed PubMed Central Google Scholar
DerSimonian R, Laird N: Meta-analysis in clinical trials. Control Clin Trials. 1986, 7: 177-188. 10.1016/0197-2456(86)90046-2.
Article CAS PubMed Google Scholar
van Houwelingen HC, Arends LR, Stijnen T: Advanced methods in meta-analysis: multivariate approach and meta-regression. Stat Med. 2002, 21: 589-624. 10.1002/sim.1040.
Article PubMed Google Scholar
Berkey CS, Hoaglin DC, Mosteller F, Colditz GA: A random-effects regression model for meta-analysis. Stat Med. 1995, 14: 395-411. 10.1002/sim.4780140406.
Article CAS PubMed Google Scholar
Berry SM, Ishak KJ, Luce BR, Berry DA: Bayesian meta-analyses for comparative effectiveness and informing coverage decisions. Med Care. 2010, 48: S137-144. 10.1097/MLR.0b013e3181e24563.
Article PubMed Google Scholar
Carlin JB: Meta-analysis for 2 × 2 tables: a Bayesian approach. Stat Med. 1992, 11: 141-158. 10.1002/sim.4780110202.
Article CAS PubMed Google Scholar

Pre-publication history

The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1741-7015/9/31/prepub

Download references

Acknowledgements

Dr Straus is funded by a Tier 1 Canada Research Chair and Dr Moher is funded by a University of Ottawa Research Chair. We thank Dr Rosy Hosking for all of her useful guidance on the draft manuscript.

Author information

Authors and Affiliations

Li Ka Shing Knowledge Institute of St Michael's Hospital, Toronto, ON, Canada
Andrea C Tricco & Sharon E Straus
Division of Geriatrics, University of Toronto, Toronto, ON, Canada
Sharon E Straus
Ottawa Methods Centre, Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada
David Moher
Department of Epidemiology and Community Medicine, Faculty of Medicine, University of Ottawa, Ottawa, ON, Canada
David Moher

Authors

Andrea C Tricco
View author publications
You can also search for this author in PubMed Google Scholar
Sharon E Straus
View author publications
You can also search for this author in PubMed Google Scholar
David Moher
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrea C Tricco.

Additional information

Competing interests

Drs Tricco and Moher published an article on the agreement between systematic review results and conclusion statements. Dr Moher is an author of the Preferred Reporting Items for Systematic Reviews and Meta-analyses. Dr Straus has nothing to declare.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Tricco, A.C., Straus, S.E. & Moher, D. How can we improve the interpretation of systematic reviews?. BMC Med 9, 31 (2011). https://doi.org/10.1186/1741-7015-9-31

Download citation

Received: 27 January 2011
Accepted: 30 March 2011
Published: 30 March 2011
DOI: https://doi.org/10.1186/1741-7015-9-31

Keywords