Background

Systematic reviews and meta-analyses of randomized controlled trials (RCTs) are fundamental tools which can be used to generate reliable summaries of health care information directed to clinicians, decision makers, and patients as well [1]. RCTs by their design generally provide the best quality of evidence required for health care decisions about interventions compared to observational studies [2, 3]. As such, they should be reported according to the highest possible pre-defined standards, as well as their systematic reviews and meta-analyses. Indeed, systematic reviews of RCTs provide information on clinical benefits and harms of interventions, inform the development of clinical recommendations, and help to identify future research needs. Besides, they are the reference standard for synthesizing evidence in healthcare about interventions because of their methodological rigor [4]. Clinicians read them to keep up to date with their field [5, 6]. As with any other research, the value of a systematic review depends on what was done, what was found, and the clarity of its reporting. Like other publications, low quality of reporting of systematic reviews can limit readers’ ability to assess the strengths and weaknesses of these reviews.

With an overwhelming day-to-day workload, the availability of large volumes of scientific publications, limited access to many full-text articles, healthcare professionals often make recourse to information contained in abstracts to guide or sustain their decisions [7, 8]. Within queries to PubMed, most readers look only at titles; only half of searches result in any click on content [9]. What makes matters worse, the average number of titles clicked on to obtain the abstract or full text, even after retrieving several searches in a row, is less than five. Of those clicks, abstracts will be represented about 2.5 times more often than full texts [9].

Abstracts can be useful for: screening by study type, facilitating quick assessment of validity, enabling efficient perusal of electronic search results, clarifying to which patients and settings the results apply, providing readers and peer reviewers with explicit summaries of results, facilitating the pre-publication peer review process, and increasing precision of computerized searches [1013]. Consequently, journal and conference abstracts must contain sufficiently clear and accurate information that will permit adequate comprehension and interpretation of systematic reviews and meta-analyses findings [14].

After observing that the quality of abstracts of systematic reviews is still poor [15], the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) for Abstracts Group developed an extension to the PRISMA Statement in order to provide guidance on how to write and present abstracts for systematic reviews and meta-analyses [14]. These guidelines were published in April 9 2013. To the best of our knowledge, there is no published study that has scrutinized the impact of release of PRISMA for abstracts (PRISMA-A) yet, comparing the quality of abstracts of papers published before (2012) and after (2014 and 2015) this release. Consequently, in an effort to promote quality reports of abstracts in systematic reviews and meta-analyses of RCTs, we firstly compared the quality of reporting abstracts in high-impact biomedical journals, before and after publication of PRISMA guidelines for abstracts. The secondary objectives included comparing the quality of reporting of each item of PRISMA-A guidelines before and after the publication of PRISMA-A. We also aimed to investigate factors associated with better adherence to PRISMA-A guidelines.

Methods

Design

We identified abstracts of systematic reviews published in 2012, 2014, and 2015 in high-impact general medicine journals. We assessed the reporting of these abstracts according to PRISMA-A recommendations [14]. We compared reporting from 2012 (before PRISMA-A) with reporting in 2014 and 2015 (after PRISMA-A). The present review was not registered. The PRISMA guidelines served as the template for reporting the present review [16]. The PRISMA checklist can be found in the Additional file 1.

Data sources

Based on impact factor published in 2015 by Thomson Reuters [17], the top nine high-impact general medical journals were selected for this study, namely: New England Journal of Medicine (NEJM), The Lancet, Journal of American Medical Association (JAMA), Annals of Internal Medicine, British Medical Journal (BMJ), Archives of Internal Medicine, PLOS Medicine, JAMA Internal Medicine, BMC Medicine, and Mayo Clinic Proceedings. In the PRISMA website, NEJM, Annals of Internal Medicine, Archives of Internal Medicine, and Mayo Clinic Proceedings are not cited as endorsers of PRISMA guidelines by contrast to the other journals which are PRISMA endorsers [18]. Journals were not excluded on the basis of their lack of endorsement of the PRISMA statement. We conducted a PubMed search of all systematic reviews and meta-analyses published in years 2012, 2014, and 2015. Our search strategy included “meta-analysis” as publication type, journal names, and limits were set for the specific time periods of interest (2012/01/01 to 2012/12/31, 2014/01/01 to 2014/12/31, and 2015/01/01 to 2015/12/31). The search strategy is presented in the Additional file 2.

Studies selection and data extraction

Two reviewers (JJRB and LNU) independently selected abstracts of systematic reviews of RCTs including meta-analyses. A “yes” or “no” answer was assigned to each item indicating whether the authors had reported it or not. Data extraction was independently performed by two reviewers (JJRB and LNU) in compliance with recommendations of the evaluation of PRISMA-A guidelines [14]. This was done for the 12 items recommended by the PRISMA-A guidelines. Agreement was measured using the Kappa statistic [19]. Disagreements were resolved by consensus after discussion between authors.

In addition to the PRISMA-A items, we collected the journal name, number of authors, type of abstract format (Introduction, Methods, Results, and Discussion [IMRAD]; 8-headings [objective, design, setting, participants, intervention, main outcome measured, results, and conclusions]), PRISMA endorser journal (yes or no), and actual observed abstract word count (<300 versus ≥300).

Outcomes measured

The primary outcome was the number of items reported among the 12 recommended items of the PRISMA-A guidelines; the secondary one was the proportion of abstracts reporting each of these 12 recommended items.

Statistical analysis

Data were entered and analyzed using the Statistical Package for Social Science (SPSS) version 21.0 for Windows (IBM Corp. Released 2012. IBM SPSS Statistics for Windows, Version 21.0. Armonk, NY: IBM Corp.). Categorical variables were expressed as numbers (N) with percentages (%). Continuous variables were expressed as means with standard deviation (SD).

We expressed the number of items for each year as mean (SD) and estimated the unadjusted and adjusted differences using the independent two-sample Student t test and generalized estimation equations (GEE), respectively [20]. The mean differences and adjusted means were reported with their 95 % confidence intervals (95 % CI) and p values. For continuous variables (number of items reported, for example), we assumed a linear distribution. We compared compliance with the 12 items of the PRISMA-A for years 2012 versus 2014 and 2015 using individual chi-squared test or Fischer’s exact test where necessary. Unadjusted analysis was followed by an adjusted analysis using GEE. For binary outcomes, we assumed a binomial distribution and an unstructured correlation matrix. We reported measures of association with odds ratios (OR) for univariable analyses and adjusted odds ratios (aOR) for multivariable analyses together with their 95 % confidence intervals (CI) and p values. Factors associated with overall better reporting were investigated using GEE and assuming Poisson distribution for number of items reported. The magnitude of association between better reporting and investigated factors was measured with adjusted incidence rate ratio (aIRR) alongside its 95 % CI. Additionally, we conducted sensitive analyses to determine factors associated with a better quality of reporting of the “Methods” and “Results” sections of abstracts for years 2014 and 2015.

For GEE, adjustments were made for actual observed abstract word count (<300 versus ≥300), PRISMA endorser journal (yes versus no), abstract format (IMRAD versus 8-headings), publication on behalf of a group (yes versus no), number of authors (≤6 versus >6), with journal as a grouping factor to adjust for potential clustering or similarity in articles published in the same journal. Evidence against the null hypothesis was considered for a two-tailed p value of <0.05. The 300-word cut-off was chosen as it has been shown, like for other abstract reporting guidelines such as the Consolidated Standards of Reporting Trials (CONSORT) extension for abstracts, that a word count around 300 is sufficient to fill all items [20]. The adjustment was done for abstract format because there are previous studies reporting relationship between abstract format and quality reporting [21, 22]. The adjustment was also done for number of authors and publication on behalf a group because it was reported an association between higher quality of work and increased number of collaborators [23, 24], though some other studies did not end up with the same finding [25, 26].

Results

General characteristics of abstracts selected

Our search yielded 505 articles of which 183 were published in 2012, 192 in 2014, and 130 in 2015. Three hundred and one abstracts did not meet our inclusion criteria, hence were excluded because most of them not included RCTs in the meta-analysis (n = 280). On the whole, we included 204 abstracts: 84 from 2012, 59 from 2014, and 61 from 2015 as shown in the flow diagram (Fig. 1). Agreement between reviewers on all PRISMA-A items was high (Kappa = 0.79, p < 0.001). Additional file 3 lists all abstracts included in this review.

Fig. 1
figure 1

Flow chart of studies considered for inclusion

As concerns the journals in which the abstracts were published, 67 (32.8 %) were published in BMJ (Table 1). The 8-headings format was used in 132 (64.7 %) abstracts. Among all the journals, 163 (79.9) were published with an actual observed abstract word count ≥300, 19 (9.3 %) abstracts reported publications on behalf of a group in authorship, and 150 abstracts (73.5 %) were from journals which endorsed PRISMA. Four of the 10 journals were PRISMA endorsers. The mean number of authors per article was 8.5 (SD 5.4) (Table 1).

Table 1 Distribution of abstracts by year and characteristics

Comparison of quality of abstract reporting between 2014 and 2012 and between 2015 and 2012

The mean number of items reported in 2014 (mean = 6.8; SD = 1.6) did not statistically differ from that reported in 2012 (mean = 6.7; SD = 1.6), mean difference (MD): 0.1; 95 % CI −0.4; 0.7; p = 0.694. There was no statistically significant difference after adjusting for covariates among which actual observed abstract word count, PRISMA endorsement, abstract format, publication on behalf of a group, and number of authors (MD −0.1, 95 % CI −0.6; 0.4; p = 1.0). The mean number of items reported in 2015 (mean = 7.5; SD = 1.6) differed significantly from that reported in 2012: mean difference (MD): 0.8; 95 % CI 0.2; 1.3; p = 0.007. This difference remained statistically significant after adjustment for covariates (MD 0.9, 95 % CI 0.4; 1.3; p = 0.002) (Table 2). An exploratory comparison between abstracts released in 2015 and those released in 2014 with regard to the overall quality of reporting showed a statistically significant difference (MD 0.63, 95 % CI 0.05; 1.21; p = 0.034).

Table 2 Comparison of mean of PRISMA for abstracts items reported in abstracts of meta-analyses of RCT

Seven items were reported in more than 50 % of the abstracts: title, objectives, eligibility criteria, included studies, synthesis of results, description of effect, and interpretation. Four items were reported in less than 50 % of the abstracts: information sources, strengths and limitations of evidence, funding, and registration. Reporting of risk of bias varied, around 50 % throughout years (49.2–67.8 %) (Table 3).

Table 3 Reporting quality of items of the PRISMA for abstracts of meta-analyses of RCT

In univariable analysis, there was a statistically significant improvement in abstract reporting in 2014 compared to 2012 only for “risk of bias” (crude OR [cOR] = 2.0; 95 % CI 1.003; 4.0). After adjustment, “risk of bias” became insignificant while reporting of “strengths and limitations of evidence” (adjusted OR [aOR] = 0.23; 95 % CI 0.07–0.71) and “Funding” (aOR = 0.25; 95 % CI 0.07–0.81) became statistically associated with a lower quality in 2014 compared to 2012 (Table 4).

Table 4 Comparison of reporting quality of items of the PRISMA for abstracts of meta-analyses of RCT

In univariable analysis, there was a statistically significant improvement in abstract reporting in 2015 compared to 2012 for “synthesis of results” (cOR not estimable, p < 0.001), for “interpretation”

(cOR not estimable, p < 0.001), and for “registration” (cOR = 12.2; 95 % CI 2.7–56.1). The quality of abstract decreased from 2012 to 2015 concerning “strength and limitations of evidence” (cOR = 0.38; 95 % CI 0.18–0.82). After adjustment, “synthesis and results” (aOR not estimable, p < 0.001), “description of effect” (aOR = 2.7; 95 % CI 1.2–6.1), “interpretation” (aOR not estimable, p < 0.001), and “registration” (aOR = 10.8; 95 % CI 2.3–49.6) were statistically associated with a better quality of reporting in 2015 compared to 2012. “Strengths and limitations of evidence” was statistically associated with lower reporting quality in 2015 compared to 2012 (Table 4).

In 2014, factors statistically associated with overall better reporting were structuring abstracts in the 8-headings format compared to the IMRAD format (aIRR 1.26; 95 % CI 1.02–1.56) and abstract word count <300 (aIRR 1.20; 95 % CI 1.09–1.35). The sole factor statistically associated with a better reporting of the “Methods” section was abstract word count <300 (aIRR 1.43; 95 % CI 1.16–1.72). None of the researched factors was statistically associated with a better quality of reporting of the “Results” section (Table 5).

Table 5 Factors associated with a better reporting of items of PRISMA for abstracts published in 2014

In 2015, the sole factor statistically associated with an overall better reporting was abstract word count <300 (aIRR 1.20; 95 % CI 1.09–1.35). Factors statistically associated with a better reporting of the “Methods” section was abstract word count <300 (aIRR 1.43; 95 % CI 1.16–1.72), structuring abstract in the 8-headings format compared to the IMRAD format (aIRR 1.33; 95 % CI 1.08–1.65), and not publishing on behalf of a group (aIRR 1.37; 95 % CI 1.04–1.82). No factor was statistically associated with a better reporting of the “Results” section (Table 6).

Table 6 Factors associated with a better reporting of items of PRISMA for abstracts published in 2015

Discussion

This study aimed to assess, according to the PRISMA-A checklist, differences in the quality of reporting of abstracts of systematic reviews and meta-analyses of RCTs published in top-tier general medicine journals before (in 2012) and after (in 2014 and 2015) publication of the PRISMA extension for abstracts [14]. Our findings demonstrate that the overall reporting quality of PRISMA abstracts has not improved in the 2014 era as compared to the pre-PRISMA-A era; however, there is a small improvement in 2015. There was no improvement of each of the 12 items in 2014; in fact, two items regressed. There was one regression and four improvements in 2015. The sole factor associated with better reporting was presenting abstracts following the 8-headings format.

These results are consistent with previous studies that reported inconsistencies and patterns of suboptimal reporting quality of abstracts across journals and fields of medicine over time [21, 2735]. As demonstrated by Hopewell and colleagues, there are serious deficiencies in the reporting of abstracts of systematic reviews which make it difficult for readers to reliably assess study findings [36], although it has been bolstered that the endorsement of PRISMA guidelines increases adherence to recommendations [37].

Reporting of abstracts with the 8-headings format was associated with an overall and “Results” section better adherence to PRISMA-A guidelines in 2014. Actually, the use of structured abstracts is warranted to improve their quality of reporting [21, 22]. Surprisingly, abstract word count <300 was associated with better reporting overall, and better “Methods” section and “Results” section reporting in 2014 and 2015. This indicates that a word count of 300 is not too short to provide useful, complete, and comprehensive information. In addition, it has been demonstrated for the CONSORT guidelines that checklist items can be easily incorporated within a word count limit of 250 to 300 words [20]. We found that endorsement of PRISMA was not statistically associated with an improvement in the quality of abstracts. Therefore, editors, reviewers and authors should ensure that not only the guidelines for the reporting of full texts are respected, but also those for abstracts, as Hopewell and colleagues have figured out huge discrepancies in the quality of reporting between abstracts and full texts [36].

Our study presents some flaws. First, our study was based on a limited number of published studies. Nonetheless, it included all meta-analyses of RCT abstracts from high-impact medical journals found in PubMed. Second, our study may not be a representative sample of all medical journals because we selected only general medicine journals with high-impact factors. There is need for further studies to investigate all types of systematic reviews and screen randomly selected abstracts, not only from top-tier journals. However, as our findings show that the quality of reporting of systematic reviews including meta-analyses abstracts is suboptimal in these top journals, we can extrapolate that the quality of reporting may be lower in other journals. Hence, this call for improvement in the quality of reporting standards could perhaps be generalized.

Conclusions

The reporting quality of abstracts of systematic reviews including meta-analyses of RCTs in leading general medicine journals did not improve in 2014 after the publication of the PRISMA-A guidelines and only improved slightly in 2015. There is still room for improvement to meet the standards of the PRISMA-A guidelines. Better structuring of abstracts and stricter adherence to the PRISMA-A by authors, reviewers, and journal editors is highly warranted.