Introduction

Modelling studies are used widely to help inform decisions about health care and policy and their use is increasing [1, 2]. A model is “an analytical methodology that accounts for events over time and across populations, that is based on data drawn from primary or secondary sources…” and in the context of health care-evaluation “…whose purpose is to estimate the effects of an intervention on valued health consequences and costs” [3]. Its value lies not only in its results, but also in its ability to reveal the connections between its data and assumptions and model outputs [3]. But, as pointed out by Garrison, models don’t have to be mathematically sophisticated to be hard to follow [4]. For these reasons, a model should not be a “black box” for the end-user but be as transparent as possible [3].

To address the problem of poorly reported research, multiple reporting guidelines have been developed and validated for use with a number of study designs. Reporting guidelines are evidence-based tools that employ expert consensus to specify minimum criteria for authors to report their research such that readers can both critically appraise and interpret study findings [5, 6]. The EQUATOR Network, an international initiative whose aim is to improve the reliability of medical research by promoting transparent and accurate reporting of research studies, indexes more than 100 reporting guidelines on their Web site (http://www.equator-network.org).

The growth in the number and range of reporting guidelines has prompted guidance on how to develop one using a well-structured development process [6]. This study addresses the needs assessment-that is, to determine whether there is a need for population modelling reporting guidelines. More specifically, the objectives of our study were: to locate and assess any existing reporting guidelines for population modelling studies; to identify key quality criteria for the reporting of population modelling studies; and to determine if and how these criteria are being reported in the literature.

Methods

We began this process with a search of the MEDLINE electronic database (MEDLINE (1950 – February 2011) via Ovid. Our electronic search strategy (see appendix), developed in consultation with a library scientist, was pragmatically designed to avoid being overwhelmed with irrelevant records. We hand-searched the reference lists and used the related articles feature in PubMED for all papers meeting our eligibility criteria. In addition, we reviewed relevant textbooks and Web sites. One reviewer screened the titles and abstracts of all unique citations to identify papers that met our inclusion criteria—that is, English language papers that provided explicit guidance on the reporting of population modelling studies or provided evidence on the quality of reporting of population modelling studies in the health science literature. The full-text report of each record passing title/abstract screening was retrieved and reviewed by the research team and its inclusion/exclusion status was established.

For records that provided explicit guidance on reporting of population modelling studies, the list of criteria identified was analysed using a thematic approach and the data was summarised as frequencies. For those papers that presented evidence on the quality of reporting of population modelling studies, we identified the aspects of reporting that were assessed and summarised the results descriptively.

Results and discussion

We identified 806 unique records through our search strategy, 30 full-text articles were reviewed to determine eligibility (Figure 1).

Figure 1
figure 1

Flow diagram of records – guidelines for reporting modelling studies and evidence on the quality of reporting of modelling studies.

Existence of guidelines for modelling studies

There were no guidelines that specifically addressed the reporting of population modelling studies. However, there were a number of reporting guidelines for economic evaluation studies: one of which was related to modelling [7] and one included a section which focused on the generalisability of modelling studies [8]. Additionally, we identified one paper that provided reporting guidance for a specific aspect of simulation modelling methodology – calibration [9].

Numerous guidelines have been published defining good practice for the conduct of economic evaluations in general and model-based evaluations in particular. We identified two papers that provided guidance for assessing the quality of decision-analytic modelling studies [3, 10] and one paper that provided guidance for assessing validation of population-based disease simulation models [2].

Identification of key reporting items

Amongst the relevant records that were analysed, we identified 69 quality criteria that have distinct reporting characteristics (Table 1).

Table 1 Checklist items for reporting modelling studies

We identified 22 items relating to the structure of the model and broadly classified them into 10 domains: 1) statement of decision problem/objective; 2) statement of scope/perspective; 3) rationale for structure; 4) structural assumptions; 5) strategies/comparators; 6) model type; 7) time horizon; 8) disease states/pathways; 9) cycle length; and, 10) parsimony.

We identified 28 items related to data issues and broadly classified them into 11 domains: 1) data identification; 2) data modelling; 3) baseline data; 4) treatment effects; 5) risk factors; 6) data incorporation; 7) assessment of uncertainty; 8) methodological; 9) structural; 10) heterogeneity; and, 11) parameter.

We identified 14 items related to consistency (internal and external) and validity (output plausibility and predictive validity). The final five items fell under computer implementation, transparency or funding.

The items are not mutually exclusive, and there is overlap if one takes into account implicit and explicit considerations. Even considering this, the records differed in terms of their comprehensiveness and the areas of model quality they considered. No item was identified by all of the resources, one item appeared in five lists, four items appeared in four lists, three items appeared in 17 lists and the remainder of the items appeared in only one or two lists (Table 1).

Quality of reporting

We identified two papers that addressed reporting practices of modelling studies, the first of which was a systematic review of coronary heart disease policy models [11].

The authors evaluated 75 papers on the basis of whether a sensitivity analysis was carried out, the validity was checked, data quality was reported, illustrative examples were provided, if the model was potentially available to the reader (transparency), and if potential limitations were specified or discussed. This evaluation was based on authors reporting on the specific item in the articles.

Relatively few papers included in the review reported on quality issues: sensitivity analysis and assessment of validity were reported in very few models, 33% provided illustrative examples, working versions of the model were available in 10%, and 19% reported on limitations of their methodology,

The second paper examining the reporting practices of modelling studies looked more specifically at the reporting of calibration methods in 154 cancer simulation models [9]. Data elements abstracted included whether model validation was mentioned (52%) and if a description of the calibration protocol was provided (66%). The authors further characterized calibration protocols by five components. A description of the data used as calibration targets was reported by 95% of the studies and goodness-of-fit metrics were reported in 54% of the studies. However, the search algorithm used for selected alternative parameter values, the criteria for identifying parameter sets that provide an acceptable model fit, and the stopping criteria were not well reported (quantitative values not provided).

Few studies were identified that addressed the quality of reporting of population modelling studies. Overall, with the exception of describing the data used for calibration, there is little consistency in the reporting of items that have been identified as key quality items.

Conclusions

Population modelling studies can fill an important role for policy makers. Their ability to synthesize data from multiple sources and estimate the effects of interventions can be invaluable, especially in areas where primary data collection may be infeasible. However, in order for modelling to gain strength as a tool for health policy, it is critical that key model factors are made transparent so that users of models have a clear understanding of the model and its limitations.

While numerous guidelines exist for developing and evaluating health technology assessment and economic evaluation models, which by extension can be applicable to population modelling studies, there is variation in their comprehensiveness and in the consistency of reporting these methods. There is evidence to suggest that key items are under-reported.

In other areas where reporting guidelines have been developed, there has been a favourable impact on the transparency and accuracy of reporting [1215]. Population modelling studies may be another area which would benefit from the development of a reporting guideline. Moher and colleagues have outlined the importance of a structured approach to the development of reporting guidelines [6]. This paper provides results from initial steps in this structure approach. Future work should focus on identifying key information related to potential sources of bias in population modelling studies and identifying a multidisciplinary expert panel to steer the guideline development process.

Appendix

Search strategy

  1. 1.

    "Reproducibility of Results"

  2. 2.

    Quality control/

  3. 3.

    ((valid$ or reliab$ or quality or accura$) adj2 (result$ or report$ or data)).tw.

  4. 4.

    (good adj1 practice$).tw.

  5. 5.

    Guidelines as Topic/

  6. 6.

    (guideline$ or checklist$).tw.

  7. 7.

    or/1-6

  8. 8.

    (model$ adj3 (stud$ or method$ or process$ or simulation)).tw.

  9. 9.

    (modelling or modeling).tw.

  10. 10.

    8 or 9

  11. 11.

    Research design/

  12. 12.

    Decision Support Techniques/

  13. 13.

    published literature.tw.

  14. 14.

    Research/

  15. 15.

    or/11-14

  16. 16.

    16 7 and 10 and 15