Background

Adolescent idiopathic scoliosis (AIS) is considered by some authors to be a condition fulfilling the criterion of being an “important healthcare problem meeting the requirements for a screening program” [1, 2]: the prevalence is estimated to be at 2–3 % of adolescents aged 10–16 years, the condition, depending on the severity of the deformity and with individual variations, affecting both physical and psychological functioning and with pain and decreased quality of life in the long term [1]. Other authors, however, classify the condition as not meeting this criterion [3, 4].

The mandate for school-based screening programs for scoliosis is highly controversial [58] and is an issue that is strongly debated [914]. National institutions and professional organizations in different countries and/or states currently either opt for mandatory screening [15], recommendation [2, 1620] or simply discourage screening [1, 3, 6, 2123]. Even very recent recommendations are contradictory [2, 21, 23, 24]. A summary and chronology of the recommendations made by different organizations in different countries since 1979 can be found in Table 1.

Table 1 Chronology and methods of development of the recommendations addressing school screening for scoliosis

Unexpectedly, the ‘update’ document, based on evidence published prior to 2003 [30], has remained the background for the 2012 US Preventive Services Task Force (USPSTF) recommendations [25], despite both qualitative [16, 35] and then quantitative [32] systematic reviews favoring screening being published in 2009 and 2010. Conversely, the 2011 review prepared for the United Kingdom National Screening Committee (UK NCS) as an update of the USPSTF 2004 review [30] provides conclusions that discourage screening [1].

Consequently, the recommendations, including those that are evidence-based, remain controversial, with different reviews reaching different conclusions. Hence an analysis of the content and quality of these reviews is urgently needed to help health care professionals as well as service users and commissioners make appropriate policies and evidence-based decisions [36, 37].

Our objective was to critically evaluate the methodological quality of current published systematically developed reviews (including those from systematically developed recommendations), that address school screening for scoliosis.

Methods

We followed the PRISMA statement for systematic reviews [38] for conducting and reporting this study.

Criteria for inclusion in the review

Types of studies

Eligible studies considered included systematic reviews of any types of primary studies. Papers were considered as systematically developed reviews if they reported on methods to identify and select papers, and which critically appraised relevant evidence [39]. These minimal criteria also applied for reviews of evidence, prepared for, or reported in, systematically developed clinical practice guidelines and recommendations [40]. Exclusion criteria were narrative reviews, expert opinions, letters to the editor and editorials.

The eligibility criteria defined for population, exposure and outcomes are presented in Box 1.

Box 1 Criteria for inclusion in the review regarding population, exposure/issue and outcomes (PEO)

 

Inclusion criteria

Exclusion criteria

Population

School children, both girls and boys, with no geographical or other (e.g. societal, racial, cultural) restrictions, in age period associated with the risk of development of AIS, typically 10–12 years of age; however, no strict age criteria were defined

Papers including other populations of children, e.g. with co-morbidities, such as Duchenne muscular dystrophy

Exposure or issue

Any reviews addressing ‘school screening for AIS’

Papers addressing other screening programs, that did not exclusively address screening tests for adolescent idiopathic scoliosis, e.g. general health examinations; screening programs for other types of scoliosis were also excluded (e.g. adult scoliosis)

Outcomes

Primary outcome: any recommendation that stated to recommend screening or not to recommend screening, i.e. a “yes” or “no” with regard to the authors` recommendations; also any secondary outcomes, if analyzed in the retrieved reviews

Not specified

Search strategy

Given the fact that the USPSTF formulated their recommendations based on evidence available between 1994 through 2002, our search was limited, where applicable, to the time period from 01 January 2003 to the most current available date. The databases were last searched between 30 May and 19 August 2013. An updated PubMed and reference list search was also conducted on retrieved papers on 02 November 2013. The search was limited to articles with at a minimum an abstract in English. Databases searched as well as the order of searching are shown in Box 2.

Box 2 Databases searched and the order of searching

Databases of systematic reviews, guideline registries and databases with separate indexing of systematic reviews and guidelines

The Centre for Reviews and Dissemination databases—DARE, HTA, NHSEED, Cochrane Database of Systematic Reviews (CDSR), Joanna Briggs Institute, Campbell Library, Cochrane Effective Practice and Organisation of Care (EPOC) Group, the AHRQ databases and resource lists from USPSTF, AHRQ Evidence-based Practice Centers (EPC Reports) and National Guideline Clearinghouse, PEDro, INAHTA, TRIP

Websites of institutions

USPSTF, CTFPHC, NHMRC, UK Screening Portal/ UK NSC Policy Database, Scottish Intercollegiate Guideline Network (SIGN), National Institute for Clinical Excellence, UK (NICE)

General bibliographic databases

MEDLINE through PubMed, Web of Science and SportsDiscus through EBSCO, Google Scholar

Gray literature—registered protocols, reviews in progress, guidelines in development and registered titles

PROSPERO, CDSR, the USPSTF registry of the topics in progress, the CTFPHC protocols, HSR Project Database, NICE, AHRQ EPC Reports database (for the EPC Reports in Progress), HSRProj Database, the NHMRC website and the Systematic Review Data Repository (SRDR) database, the Conference Proceedings Citation Index—Science from the Web of Knowledge

  1. For abbreviations not explained here see Table 2

The search terms “scoliosis” and “screening” were used to search the registries of systematic reviews, guideline databases and portals of screening recommendations. In PubMed the MeSH terms “scoliosis” and “mass screening” as well as the free terms “scoliosis” and “screening” were used together with the filters “review” and “meta-analysis”. For other bibliographic databases corresponding search terms and limits were used. Additionally, the reference lists of papers included for full text analysis as well as other publications in this area were handsearched. Authors of published papers and reviews in progress were not contacted, as our aim was to assess published reviews.

Study selection

We independently conducted the searches as well as the initial selection of studies by their title and/or abstract. Full papers were then examined for eligibility. Disagreements were resolved by discussion. The PRISMA search flow for the selection of included studies is shown in Fig. 1.

Fig. 1
figure 1

PRISMA flow diagram of included studies. 1 CRD York (DARE, HTA, NHSEED), n = 72, Cochrane CDSR, n = 35, Joanna Briggs Institute, n = 1, Campbell Library, n = 0, PEDro, n = 17, TRIP, n = 14, PubMed/MEDLINE, n = 109, Web of Science, n = 203, SportsDiscus, n = 13, GoogleScholar, n = 104; 2 registries, websites of institutions: Cochrane EPOC Group, n = 0, USPSTF, n = 3, AHRQ EBP Reports, n = 0, National Guideline Clearinghouse, n = 25, INAHTA, n = 1, CTFPHC, n = 1, SIGN, n = 0, NICE, n = 15, UK NSC, n = 5, NHMRC, n = 4, HSRProj, n = 2, PROSPERO, n = 2; hand searching of reference lists, n = 2; 3 some full text papers were excluded for more than one reason

Methodological assessment

The “Assessment of Multiple Systematic Reviews”, AMSTAR measurement tool [41] was used to assess the methodological quality of included reviews. The AMSTAR comprises 11 quality criteria (Table 3). The items are scored “yes”, “no”, “cannot answer”, or “not applicable”. The maximum score is 11. Scores 0–4, 5–8 and 9–11 indicate low-, moderate- and high-quality reviews, respectively [42]. The appraisal was conducted independently by MP and JB-S, using guides for scoring AMSTAR questions [41, 42]. Disagreements were resolved by discussion and compromise.

Further, the quality analysis of the USPSTF review [30] was expanded with a search for information addressing AMSTAR questions in related USPSTF documents, referenced in the appraised paper [27, 28, 43, 44]. When appraising the UK NSC [1] and NHMRC [3] reviews, the supplementing documentation was also evaluated [4548].

Overlap across studies

The reference lists of the included reviews were compared and Venn diagrams were created to examine potential overlaps across included papers within different reviews (Figs. 2, 3, 4, 5, Table A3—supplementary material).

Fig. 2
figure 2

Venn diagram illustrating the overlap of papers included in the USPSTF report and the more recent systematic reviews. Numbers and percentages inside the ellipses show the overlap of studies included in the four reviews. Percentages outside the ellipses illustrate the percentage of all 109 studies included in each of the four reviews. Total n = 109 (100 %). USPSTF—the US Preventive Services Task Force; MaHTAS Health Technology Assessment Section, Ministry of Health Malaysia; UK NSC the UK National Screening Committee; 1 this review also included the systematic review by Fong et al.

Fig. 3
figure 3

Venn diagram illustrating the overlap of papers included in the NHMRC report and the more recent systematic reviews. Numbers and percentages inside the ellipses show the overlap of studies included in the four reviews. Percentages outside the ellipses illustrate the percentage of all 113 studies included in each of the four reviews. Total n = 113 (100 %). NHMRC National Health and Medical Research Council, Australia; MaHTAS Health Technology Assessment Section, Ministry of Health Malaysia; UK NSC the UK National Screening Committee; 1 quality assessment for four key articles documented in the report; 2 Fong et al. is included in the UK NSC review

Fig. 4
figure 4

Venn diagram illustrating the overlap of older reviews included in current recommendations from NHMRC, USPSTF, and Italian guidelines. Numbers and percentages inside the circles show the overlap of studies included in the four reviews. Percentages outside the circles illustrate the percentage of all 40 studies included in each of the three reviews. Total n = 40 (100 %). NHMRC – National Health and Medical Research Council, Australia; USPSTF U.S. Preventive Services Task Force; Italian guidelines—“Italian guidelines on rehabilitation treatment of adolescents with scoliosis or other spinal deformities”; 1 quality assessment of four articles is documented in the report, the remaining papers are described narratively

Fig. 5
figure 5

Venn diagram illustrating the overlap of papers included in the more recent systematic reviews. Numbers and percentages inside the circles show the overlap of studies included in the three reviews. Percentages outside the circles illustrate the percentage of all 40 studies included in each of the three reviews. Total n = 102 (100 %). UK NSC the UK National Screening Committee; MaHTAS Health Technology Assessment Section, Ministry of Health Malaysia; 1 this review also included the systematic review by Fong et al.; 2 UK NSC review cites Fong et al.

Findings

Search results

From a total of 224 papers, six articles met the criteria for inclusion within the analysis: two quantitative systematic reviews [1, 32] one of which included a meta-analysis [32 ] and four systematic analyses of evidence which were part of [3, 16, 18] or supplementing [30] recommendation documents. Two further publications were duplicates [35, 49]. One supplement [48] was excluded as it duplicated the content of the resource review [3]. The included papers are characterized in Table 2. Excluded publications are listed, with reasons for exclusion, in Table A2 (supplementary material).

Table 2 Summary and characteristics of the included reviews

Gray literature

A protocol [50] as well as an information statement was found [2] together with a reference to one systematic review being in press [51]. However, as the paper was not published at the time of writing this manuscript, it was not included in the analysis. The review is, however, listed in Table A2 (appendices)—the list of excluded papers.

Quality analysis

Three papers [3, 16, 32] were of moderate and three [1, 18, 29] of low quality, respectively. The review by Sabirin et al. [16] and the meta-analysis by Fong et al. [32] received the highest scores of 6 points. The NHMRC [3] and the UK NSC [1] reviews received scores of 5 and 4, respectively. The Italian guidelines [18] as well as the USPSTF review [30] received the lowest scores of 2 points. Table 3 provides details of the AMSTAR quality assessment, with explanations regarding the scoring decisions.

Table 3 AMSTAR ratings for included reviews

Overlap across included reviews

Included reviews differed substantially with regard to both the numbers as well as the choice of included papers. The USPSTF evidence analysis [30] included 10 studies (with 14 % of reports included in all reviews), while Sabirin et al. [16] and Fong et al. [32] included 28 and 36 papers (41 and 52 % of reports included in all analyzed reviews), respectively. The USPSTF and the NHMRC recommendations that are based on 10 and 13 papers, respectively, had only three papers overlap (Figs. 2, 3).

The Italian review covering a similar time period is based on 21 papers, of which only one overlaps with the USPSTF and none with the NHMRC recommendations, respectively (Fig. 4).

Further, the 2011 UK NSC review, referring to 53 publications, overlaps with only two papers in the MaHTAS review but does not overlap with any papers included in the meta-analysis by Fong et al. (although this paper is cited). In brief the three papers, developed within similar time frames, only have 2 or 2 % of included studies that overlap (Fig. 5).

Table A3 (supplementary material) shows how the studies included in the analyzed reviews were contrasted in order to produce the Venn diagrams.

Discussion

Brief summary of results

Searching process and selection of the reviews

Comprehensive literature searches and selection of retrieved papers (Box 2, Fig. 2, Table A2—supplementary material) resulted in including six systematic reviews [1, 3, 16, 18, 30, 32], of which five [1, 3, 16, 18, 30] were conducted in the process of systematically developing recommendations. Full text analyses of a further 34 papers resulted in excluding 15 reviews addressing the subject matter, but not fulfilling the minimal criteria for a systematic review. Nineteen full texts were also excluded for other different reasons (Fig. 1, Table A2—supplementary material). Therefore, to our knowledge this study presents the first comprehensive and in-depth analysis of systematic reviews regarding screening programs for adolescents for idiopathic scoliosis.

Quality and recency of the reviews

Overall, the quality of systematic reviews ranged from the comparatively recent (2009) moderate quality (AMSTAR score 6) Fong et al. [32] and Sabirin et al. [16] through to the outdated (2002) moderate quality NHMRC, reviews [3], to the poor quality recent (2011) UK NSC [1] and outdated (2004, nonetheless used for recent (2012) recommendations [24, 25] ) USPSTF [30] and Italian (2005) [18], reviews.

The reviews which supported the recommendation for school screening [16, 18, 32] as well as those recommending against screening [1, 3, 30] are based on different papers selected for inclusion (Figs. 2, 3, 4, 5) [and thus on different evidence or assumptions of the absence of evidence (Table 2)]. Further, regardless of the different time limits used within searches, the reviews also differed significantly with regard to the databases selected, and other resources, searched (Table 2). Moreover, significant heterogeneity was found within the reviews as follows: different research designs were considered (prospective trials and retrospective observational studies, systematic reviews, editorials), which were, except in the Fong et al. [32] meta-analysis, analyzed separately (Table 2). Conclusions were based on different criteria as follows: the set of criteria for appraising—feasibility, effectiveness and appropriateness of a screening program [1, 3], accuracy of screening tests [16, 18, 32], treatment effectiveness as a criterion justifying the need for screening [16, 18, 30] and cost-effectiveness [1, 16].

Methodology issues

The reviews included within this study were heterogeneous, both with regard to the research questions asked as well as the methodology used for their development. Therefore, we conducted an umbrella review, as described by Grant and Booth [52]Footnote 1 and also by Ioannidis [53]. No primary studies were considered. It is important to note that the term “umbrella review” is also used by some authors [e.g. 54] to describe a systematic review of high-quality systematic reviews which is not the case in this evaluation.

Discussion and critique of included reviews

Higher quality systematic reviews

Three of the systematic reviews were found to be of moderate quality; Fong et al. [10], Sabirin et al. [16] and NHMRC [3].

Fong et al. [32] (2010) reported that only 17 % of the primary studies included within their meta-analysis of retrospective cohort studies found screening to be ineffective. The authors advocate for school screening, but recommended that the forward bend test should not be used alone within screening programs and that large, retrospective cohort studies are needed to better study the reliability of screening tests. As regards the studies included for analysis, the results of the Fong review only overlaps partially with the Sabirin et al. [16] study, but shows no overlap with the UK NSC 2011 review [1] (Fig. 5) and the older reviews reported by the USPSTF [30] (Fig. 2) and NHMRC [3] (Fig. 3). The conclusions from the Fong study were in part based on different evidence as well as different assumptions to the other reviews. Whilst the AMSTAR score for this meta-analysis is only of moderate quality (6), the report does meet the essential criteria of a good quality quantitative systematic review (Tables 2 and 3).

MAHTaS (2009) systematic review by Sabirin et al. [16] included a methodological assessment of individual studies as well as a cost–utility analysis. The authors recommended continuation of the screening program in Malaysia, under specific conditions: in high-risk group of 12-year-old girls, using combination of screening tests, and with well-secured training, manpower, good referral system, treatment and funding. The review was clearly reported (AMSTAR score of 6 (Table 3); however, we encountered difficulties with regard to the assessment of question 8 within AMSTAR: unfortunately, only the results of 8 out of 24 papers appraised, utilizing the CASP tool were reported. Furthermore, studies of different types designs were included and analyzed separately (Table 2), which in some instances may diminish the internal validity of the review as well as the strength of the conclusions.

The NHMRC (2002) document [3] recommends against screening for scoliosis. All the criteria for conducting a systematic review are described in detail. Moreover, unlike the other reviews evaluated, a defined methodology for the quality assessment of systematic reviews was provided. Nonetheless, the meta-analysis included in the systematic review of evidence regarding screening for scoliosis [55], and 8 out of 12 included primary studies, were only discussed descriptively (Table 2), without conducting a critical appraisal of their quality. Moreover, several other eligible primary studies were not included for analysis (Fig. 4).

The NHMRC report has not been updated since 2002, while the AMSTAR does not reflect the recency of the analyzed reviews. Also, the NHMRC review [3] was not developed according to the subsequently (2009) improved NHMRC evidence hierarchies [56].

In fact, most of the six reviews analyzed matched neither the improved (2009) NHMRC [56], nor the new (2011) Oxford Centre for Evidence-Based Medicine (OCEBM) levels of evidence hierarchy (Levels of Evidence 2 [57]), with the exception of Fong et al., which can be classified as a level 3 evidence in the OCEBP classification (Box 3).

Box 3 Definitions of evidence hierarchy for diagnostic accuracy and screening interventions described by different organizations

Diagnostic and monitoring tests/diagnostic accuracy

The new (2011) Oxford Centre for Evidence-Based Medicine (OCEBM) levels of evidence hierarchy (Levels of Evidence 2 [56] ) states that “systematic review of cross sectional studies with consistently applied reference standard and blinding” are step 1 (level 1) of evidence and consequently individual studies are level II evidence;

The improved (2009) Australian NHMRC “Additional Levels of Evidence and Grades for Recommendations for Developers of Guidelines” document [55], providing hierarchy of evidence for different types of research questions, defines level II evidence as “a study of test accuracy with: an independent, blinded comparison with a valid reference standard, among consecutive persons with a defined clinical presentation”, and systematic reviews of level II studies are level I evidence;

Both classifications list case–control studies (level IV—OCEBM, level III-3—NHMRC) but not retrospective cohort studies.

Screening interventions (both OCEBM and NHMRC)

Systematic reviews of prospective, randomized controlled trials (RCTs)—level I evidence;

Individual RCTs—level II evidence;

Lower quality reviews

The UK NSC review (2011) [1], prepared as an update of the 2004 USPSTF analysis [30], sustains the recommendations against screening. The report scored 4 with AMSTAR (low quality review, Tables 2 and 3). The authors addressed several criteria regarding the mandate for a screening program, including the cost-effectiveness of the programs, as well as the effectiveness of conservative and surgical treatment. The observational, experimental studies as well as the systematic reviews included within this paper discussed diverse aspects of scoliosis management and are described broadly and narratively. Eligibility criteria for paper inclusion are not reported. The lists of 470 retrieved reports [4547] supplement the review, which factually discusses 53 papers but no list of excluded studies is provided. The review reports on search limits from 2002 to 2011, but single older studies are also included. Thus, it would be difficult to judge how the studies described in the review were chosen. Further, while the hierarchy of evidence level is suggested—the designs of the discussed papers are reported, crucially the included studies are not critically appraised. Without knowledge of the methodological quality of included papers it is very difficult to come to a conclusion as to whether the results of this review can be applied to practice and whether the recommendations are truly valid and evidence-based.

The USPSTF resource review for the 2012 USPSTF recommendations against screening [24, 25]—the 2004 “Brief Evidence Update” [30], recommending against school screening—has been shown to be of poor quality, with an AMSTAR score of 2. The “critical” “key question”—“Is there new evidence that scoliosis treatment leads to better health outcomes if applied at an early stage”, was answered by analyzing one RCT, three reports of a large retrospective observational studies, one case series and a meta-analysis of studies from between 1975 and 1995 on the effectiveness of conservative treatments [55]. No list of excluded studies was provided, and crucially no methodological quality appraisal of their included studies was conducted. Further, the search period is outdated (1994–2002) and the literature search was not comprehensive (Fig. 4). Consequently, this review scored a low value of 2 with AMSTAR (Table 3). Whilst the USPSTF did elaborate on the development process for their recommendations, including the process of the evidence review [27, 28, 43, 44], this development process does not seem to have been followed in their study (Tables 2 and 3). Nonetheless, when appraising the review, we considered the USPSTF methodology descriptions current for that period of time, described elsewhere [27, 28, 43, 44] (Table 3). Otherwise, the review would have attained an even lower score. As the review was published in 2004, the present USPSTF methodology of guideline development, improved subsequently in 2007 and 2008 [5860], was not analyzed. It also means that the current (2012) USPSTF recommendations against scoliosis screening are based on the outdated methodology of the USPSTF guideline development.

Finally, in the USPSTF “review of the process” paper the authors say that comprehensive systematic reviews are “long, detailed reports of interest to a minority of readers and of limited value to busy clinicians” [44]. It is difficult to agree with this notion, as researchers, clinicians, commissioners and service users routinely make use of conclusions/recommendations from systematic reviews (e.g. Cochrane reviews with separately published summaries). In fact, systematic reviews are considered to be equivalent to research studies that produce the highest level of evidence according not only to OCEBM [57] and NHMRC [56] but to numerous other organizations. Additionally, the detailed guidelines for reporting explicit systematic reviews of different types of studies have been in use (such as QUORUM (1999) [61], then PRISMA (2009) [38], MOOSE (2000) [62], and reporting guidelines from the US Institute of Medicine (2011) [63]).

The authors of the Italian guidelines (2005) [18], recommending for school screening, reported on a systematic analysis of the literature. It achieved an AMSTAR score of 2 (Table 3). A classification of the levels of evidence is provided in this paper, including expert opinions. Within these recommendations the evidence for scoliosis screening was classified as “E2: fair scientific consensus” [18]. In brief, the document provides a recommendation on screening for scoliosis driven from a consensus of a panel of experts, rather than on a systematic analysis of available evidence.

Strengths and limitations of the paper

The database search was supplemented with an inspection of dedicated registries as well as websites of organizations (e.g. International Network of the Agencies of Health Technology Assessment). This resulted in retrieving important publications (UK NSC [1], NHMRC [3] and MaHTAS [16]) not indexed in major databases (as reported in detail in Fig. 3) and omitted in recent reviews [4, 64, 65]. Reference lists of publications that were retrieved were also hand searched. Potential authors of unpublished data were not contacted, as the aim of this study was to appraise published systematic reviews rather than performing a meta-analysis of primary studies. The findings from the grey literature search revealed that another systematic review was currently in press [51]. Another limitation to this paper was the fact that the papers included within this review were not fully congruent with regard to the research objectives, questions posed, strategies of review development and reporting. We believe that selection bias was unlikely in this study as only reviews meeting the specified inclusion criteria were selected.

Finally, as to our knowledge this is the first evidence-based appraisal of systematic reviews addressing school screening for scoliosis, the findings of this review could not be compared to similar reports.

Conclusions

The results found within this paper appear on the surface to be very controversial. However, further detailed inquiry based on in-depth evaluations of the methodological quality of included papers within this reviews provide much needed clarification. The most recent and current reviews provided by Fong et al. in 2010 and Sabirin et al. in 2009 both of which scored a moderate quality 6 made recommendations to continue school screening programs. These reviews were based on papers that were current, and most crucially were evaluated for their methodological quality so that readers can be assured of the reliability and validity of their results and conclusions.

The results presented within our evaluation suggest that many of the recommendations made against school screening are based on research papers that are over 10 years old and are, therefore, outdated. More to the point and of utmost importance, most of these recommendations did not evaluate the methodological quality of the papers they included within their reviews. Thus it is impossible to know whether these results and consequently their recommendations are truly valid and can reliably and justifiably be applied to clinical practice. Until such time, therefore, we suggest that the recommendations based on low-quality reviews need to be interpreted with caution.

Implications for research and practice

There is an urgent need for high-quality primary studies to be conducted on school screening for scoliosis to provide a definitive answer to this very important question. Although very difficult to conduct and not unanimously recommended in cases of school screening programs (Box 3, Fong et al. [32]), RCTs are indicated in the latest classifications of the Hierarchy of Evidence as the second highest level of evidence from primary diagnostic studies which investigate important patient outcomes [66]. The first or highest level of evidence is systematic reviews of such papers. In practice, however, we acknowledge that an RCT comparing a group undergoing school screening to one not undergoing screening would be very difficult if not impossible to conduct. Whilst Fong recommended retrospective studies, the Cochrane Back Review Group does not currently recognize such research designs to be of sufficiently good quality to be included within a Cochrane back review due to the high degree of error resulting from bias and confounding variables [67]. What we would recommend, however, are PROSPECTIVE cohort studies with a control group, with participants matched for age, gender and other key variables.

In brief, once a number of prospective studies with a control group have been conducted then well-conducted systematic reviews which are crucially needed can be carried out. Standards similar to those found within the Cochrane collaboration and other similar organizations would need to be followed to provide the much needed reliable and trustworthy evidence for researchers, clinicians, commissioners and most importantly for the patients and families concerned.

Protocol registration

This paper reports on a section of an overview of systematic reviews regarding nonsurgical management of idiopathic scoliosis, including screening and is registered at PROSPERO, CRD York, CRD42013003538.