Introduction

The treatment of children with spinal manipulative therapy (SMT) by chiropractors is controversial [1,2,3]. A recent study of Ontario chiropractors suggests that 5.5% of patients who consulted a chiropractor in the previous month are between the ages of 0 and 14 years [4]. A cross-sectional survey of 140 chiropractors in Alberta indicates that all respondents treated children between 0 and 18 years of age and 13% of all visits in the previous month were with children [5]. Worldwide, the estimated 12 month median utilization and interquartile range (IQR) of chiropractic care is 8.1% (IQR 3.8–20.00), with a lifetime median utilization of 11.1% (IQR 4.0–21.6) for patients 18 years old or less [6]. Children visit chiropractors for a variety of reasons, including health promotion, asthma, otitis media, allergies, infantile colic, tonsillitis, ADHD, and enuresis, but most commonly for musculoskeletal (MSK) problems [6,7,8,9,10,11,12].

Although the effectiveness of chiropractic care for the management of pediatric MSK and non-MSK conditions is debatable, most of the controversy surrounds the safety of SMT in children [3, 13]. A survey of Canadian pediatricians suggests that serious adverse events of SMT in children may be rare [14]. Previous systematic reviews identified case reports of serious adverse events of SMT including death and temporary paraplegia [13, 15]. However, these systematic reviews did not assess the methodological quality of the included studies. This is problematic because of the high risk of bias associated with such reports [16, 17]. Although previous systematic reviews reported on the type and frequency of adverse events, they did not describe the incidence of adverse events, or determine whether SMT increases the risk of adverse events compared to other interventions [13, 15, 18]. Therefore, little evidence is available to understand the risk of adverse events associated with SMT in children [13, 15, 18]. There is a need to update these reviews to inform the current policy debate about the safety of SMT in children.

To assist in informing this public debate, the College of Chiropractors of British Columbia called for a review of the evidence on the safety of SMT for children under the age of 10 years [19]. At the request of the College of Chiropractors of British Columbia, we conducted an independent rapid review of observational studies and randomized controlled trials (RCTs) to investigate the type and risk of adverse events in children under the age of 10 years (0–9 years) who receive SMT from any health care provider. Specifically, we aimed to: 1) describe the reported adverse events; 2) report the incidence of adverse events; 3) determine whether SMT is associated with an increased risk in adverse events compared to other interventions used to manage children for any health condition, or the promotion of health and wellness.

Methods

Rapid reviews are used by health decision-makers (clinicians, patients, managers, and policy makers) who need timely access to health information to plan, develop and implement health policies [20, 21]. Rapid reviews are a valuable method to provide actionable and relevant evidence to make informed decisions in a short amount of time [20, 21]. They follow the key principles of knowledge synthesis used in systematic reviews, including clear objectives, a priori definition of eligibility criteria, a systematic search for relevant evidence, assessment of validity of findings and a systematic presentation and synthesis of results [20]. However, certain components of the systematic review process are simplified or narrowed to produce information in a timely manner, such as a focused research question, limited databases searched, and one reviewer for screening, critical appraisal and data extraction using standardized established procedures [21]. We followed the methodology recommended by the World Health Organization [20].

Protocol and registration

We reported our review according to the PRISMA and PRISMA harms checklists (Additional file 3) [22, 23]. We registered our review with the International Prospective Register of Systematic Reviews (PROSPERO) on August 1, 2019 (CRD42019145581).

Eligibility criteria

Participants

We included studies of children between 0 and 9 years of age [24] who received spinal manipulation or mobilization for the prevention or management of any health condition (i.e., MSK or non-MSK disorders) or for the promotion of health and wellness. We did not restrict our review to studies that reported adverse events, rather we considered any study that focused on the treatment of children between 0 and 9 years old with SMT.

Interventions

SMT includes spinal manipulation and spinal mobilization provided by any type of provider. Spinal manipulation includes techniques incorporating a high-velocity, low-amplitude impulse or thrust applied at or near the end of a joint’s passive range of motion [25, 26]. Spinal mobilization includes techniques incorporating a low-velocity and varying amplitude oscillatory movement within a joint’s passive range of motion [26,27,28]. Spinal manipulation and mobilization involve manual and mechanically-assisted procedures.

Comparators

We considered all control interventions tested in cohort studies and RCTs to determine the relative risk of adverse events. This may include, but are not limited to placebo, sham manual therapies, wait listing, usual care, no interventions, medication and other manual therapies.

Outcomes

We investigated adverse events including indirect harms (Table 1). We defined adverse events as any unfavorable sign, symptom, or disease temporally associated with the treatment, whether or not caused by the treatment [29]. We used predefined categories to rate them as mild, moderate, severe or serious [29]. We also considered indirect harms, where the use of an intervention delays a diagnosis or treatment, and such delay holds a potential harm [14]. We used the classification adapted by a multi-disciplinary team of content experts and providers of SMT [29]. We critically appraised studies reporting on adverse events and then classified the severity and nature of all reported adverse events according to the definitions provided in Table 1. To be eligible for inclusion, case reports and case series had to describe that the presence (or absence) of adverse events was investigated.

Table 1 Classification of adverse events [14, 29]

Study designs

Eligible study designs included: case report, case series, case-control study, cohort study or RCT. We used case reports, case series, case-control studies, cohort studies and RCTs to describe the adverse events reported in the literature (aim 1). We used cohort studies and RCTs to determine the incidence of adverse events associated with SMT (aim 2), and to determine the relative risk of adverse events associated with SMT compared to other interventions (aim 3).

We excluded guidelines, letters, editorials, commentaries, unpublished manuscripts, dissertations, government reports, books and book chapters, conference proceedings, meeting abstracts, lectures and addresses, consensus development statements, guideline statements, cadaveric, laboratory or animal studies, qualitative studies, systematic reviews and meta-analyses.

Information sources

We developed our search strategy in consultation with a health sciences librarian, and a second librarian reviewed the strategy to ensure accuracy. We systematically searched three databases that thoroughly index the manual therapy literature published by various health professions from January 1, 1990 to August 1, 2019: MEDLINE (U.S. National Library of Medicine, through Ovid Technologies Inc.), Cumulative Index to Nursing and Allied Health (CINAHL, through EBSCOhost), and Index to Chiropractic Literature (ICL, Chiropractic Library Collaboration). Search terms consisted of subject headings specific to each database (e.g., MeSH in MEDLINE) and free text words relevant to our objectives and study design [see Additional file 1]. We restricted our search to papers published in English.

Study selection

We used a two-phase screening process to identify eligible studies. In phase one screening, we reviewed titles and abstracts and classified articles as relevant, possibly relevant or irrelevant. During phase two screening, we reviewed the full text of possibly relevant articles for final determination of eligibility.

A trained investigator (MC) conducted all of the screening. Prior to phase one screening, we validated the quality of screening by MC. Ten percent of all eligible articles were randomly selected and the titles and abstracts of these articles were screened independently by a second experienced investigator (PC). A 95% level of agreement was required between two reviewers before moving to full phase one screening. Once the 95% agreement was achieved, one reviewer (MC) completed phase one and two screening.

Risk of Bias in individual studies

The lead author (MC) critically appraised the internal validity of relevant articles using the Scottish Intercollegiate Guidelines Network (SIGN) criteria for RCTs, cohort studies and case-control studies [30, 31]. The SIGN methodology provides the reviewer with a list of standardized criteria to determine the risk of bias related to selection bias, measurement bias and confounding. When evaluating an RCT, the reviewer assessed methods of randomization, concealment, blinding, comparability of baseline characteristics, contamination, outcome measurement, loss to follow-up, intention-to-treat and between site differences (for multi-center RCTs). In cohort studies, the items focus on the source population, participation rate, drop-out rate, outcome measurement, blinding, exposure measurement, confounders and statistical analysis. We did not identify any case-control studies; thus, we do not describe the SIGN criteria for this study type.

There are no SIGN criteria for case reports or case studies. Therefore, we adapted the critical appraisal tool proposed by Murad et al. to assess the quality of case reports and case series. We modified the tool by creating a series of critical appraisal criteria and notes similar to the SIGN criteria [see Additional file 2] [17]. The adapted Murad tool allowed us to critically appraise patient selection, exposure and outcome measurement, alternative causes, challenge-rechallenge phenomenon, dose-response, and length of follow-up.

We also included a quality control step in the critical appraisal of studies. The investigator who assessed the risk of bias of the studies (MC) presented a summary of the critically appraised papers to three experienced methodologists (PC, SM, CC) who validated the outcome of the appraisals. Disagreements regarding the internal validity of papers were resolved through discussion. We restricted our synthesis to studies agreed judged to have a low risk of bias. The lead author created risk of bias tables for all eligible studies (including low and high risk of bias studies), which were validated by the other investigators (PC, SM, CC). Studies were rated as high quality, acceptable, low quality or unacceptable.

Data items

Information extracted from each RCT included participant characteristics (age, indications for treatment or condition treated, and location); sample size; type and description of intervention; type and description of comparison group; follow-up period; method of outcome ascertainment; and number and description of adverse events. Information extracted from each cohort study included source population; sample size; participant characteristics (age, indications for treatment or condition treated); exposure/description of intervention; method of outcome ascertainment; confounders; follow-up period; and number and description of adverse events. Information extracted from case reports and case series included participant characteristics (age, indications for treatment or condition treated); type and description of intervention; method of outcome ascertainment; follow-up period; and number and description of adverse events.

Data extraction

The lead author (MC) extracted data from high and acceptable quality studies and built evidence tables stratified by research objective (Tables 2, 3, 4 and 5). Data extraction was validated by one of three reviewers (PC, CC, SM). We contacted the study authors when clarification or additional information/data was necessary to build the evidence tables [32, 33]. Evidence tables summarized the information relevant to each objective and we used this information to create the summary statements. We used the terminology used by authors to describe adverse events when building our evidence table. For example, authors may have labelled adverse events as “side effects” or adverse reactions [34, 35]. We considered these terms were synonymous to adverse events in our synthesis.

Table 2 Evidence Table: Randomized Controlled Trialsa
Table 3 Evidence Table: Cohort Studiesa
Table 4 Evidence Table: Case Series
Table 5 Evidence Table: Case Reports

Statistical analyses

When data were available, we computed the incidence (and 95% confidence intervals) of adverse events and relative risk (and 95% confidence intervals) from RCTs and cohort studies. Incidence was measured by calculating the number of adverse events in a group divided by the total number of participants in the same group [36]. Relative risk was measured by dividing the incidence of adverse events in the intervention group by the incidence of adverse events in the comparison group [36]. Confidence intervals (CI) were calculated using incidence of adverse events in each group, total number of participants in each group, and α = 0.05.

Evidence synthesis

We used best evidence synthesis methodology to synthesize evidence from high and acceptable quality studies [37]. The evidence synthesis provides conclusions based on the best available evidence or may conclude that there is insufficient evidence to make any conclusions [37].

Reporting of outcomes

For the two RCTs included in our review, we checked the clinical trials registry for evidence of selective reporting of outcomes or protocol changes. Sawyer et al. was not found in the registry, as it was published in 1999. We retrieved the protocol by Miller et al. (#NCT01513304) which listed a daily crying diary as the primary outcome (no further information about secondary outcomes was provided).

Results

Study selection

Our search retrieved 1812 citations (Fig. 1). We removed 69 duplicates and screened 1743 articles. Interrater agreement for phase 1 screening was 95.4% between MC and PC. We screened 215 full-text articles (phase 2). Of those, 33 articles met the inclusion criteria and were eligible for critical appraisal. Reasons for exclusion during phase 2 screening were ineligible publication type (n = 24), population ≥ 10 years old (n = 33), intervention did not include SMT (n = 43), and outcomes did not include adverse events (n = 73).

Fig. 1
figure 1

Flow diagram of study selection

Risk of Bias within studies

We critically appraised 33 studies. Of those, eight had a low or unacceptable quality and were excluded from the evidence synthesis [38,39,40,41,42,43,44,45]. These included seven case reports or case series, and one cohort study. High risk of bias studies had the following methodological limitations: 1) inadequate outcome ascertainment (n = 4) and 2) inadequate exposure ascertainment (n = 8) (Tables 6, 7 and 8). Twenty-five studies with high or acceptable quality were included in our evidence synthesis [32,33,34,35, 46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66]. Nevertheless, these studies had some methodological limitations including inadequate reporting of adverse events measurement (Tables 6, 7 and 8). We contacted authors from two studies to inquire about the measurements of adverse events [32, 33]. Sawyer et al. clarified that events were measured from reports by the chiropractor at each visit. Similarly, Saedt et al. indicated that adverse events were reported by the manual therapist after each treatment, and further defined the various vegetative responses reported as adverse events. For example, hyperextension (of the trunk or cervical spine) is a reaction an infant can make when they feel discomfort and perspiration is a slight sweating reaction of the infant to the intervention.

Table 6 Risk of Bias table: Low risk of bias - RCT
Table 7 Risk of Bias table: Cohort Studies
Table 8 Risk of Bias table: Case Report & Case Series

Study characteristics

We included two RCTs, [32, 61] one cohort study [33] and 22 case reports/series [34, 35, 46,47,48,49,50,51,52,53,54,55,56,57,58,59,60, 62,63,64,65,66] (Tables 2, 3, 4 and 5). All spinal manipulations and mobilizations were provided by chiropractors. One RCT tested the effectiveness of spinal mobilization for the management of infants with unexplained persistent crying [61]. The second RCT investigated the effectiveness of high-velocity, low-amplitude SMT for the management of children with acute otitis media [32]. In the cohort study, spinal mobilization was used to treat infants with upper cervical dysfunction [33]. In five case reports or case series the intervention was spinal mobilization, [34, 53, 55, 62, 66] seven provided instrument-assisted SMT, [35, 47, 48, 56, 58, 60, 65] and ten provided high-velocity, low-amplitude SMT modified for the age and development of the patient [46, 49,50,51,52, 54, 57, 59, 63, 64].

Description of adverse events

Adverse events were described in five studies; with one study describing a severe adverse event (case report), one describing an indirect harm (case report), and three studies describing mild adverse events (one RCT, one cohort study, one case series).

Regarding the severe adverse event, acute fractures of the posterior 7th and 8th ribs occurred in a 21-day-old female treated for fussiness and colic, after the use of a spring-activated device on the infant’s back [48]. An indirect harm occurred in a 4-month-old male who presented to a chiropractor with head tilt. He was treated with spinal manipulation and a diagnosis of congenital spinal cord astrocytoma was delayed. After the second visit, the child was hospitalized and parents reported the child was difficult to arouse from sleep, limp, pallor, moaning, poor feeding and fever [51].

Three studies described mild adverse events. In an RCT of children between the ages of 6 months and 6 years with acute otitis media, one parent reported mid-back soreness and one reported irritability during a course of 10 treatments of high-velocity, low-amplitude SMT to the cervical region [32]. In a cohort study of infants less than 27 weeks old with indications of upper cervical dysfunction, adverse events reported by the chiropractor included vegetative responses such as flushing, hyper-extension, perspiration, and gastro-esophageal reflux, and short breathing pattern changes after mild mobilization techniques focusing on C1 [33]. In a case series, four male and female 5 to 8-week-old infants presenting with colic were treated with cervical and thoracic pediatric SMT. Parents reported adverse events of poor feeding, mild distress, and increased crying [50]. The parent of a 6-week-old female infant reported a head tilt after cervical SMT [50]. A 17-week-old male infant started to cry immediately after pelvic SMT which resolved after a corrective SMT, with no residual complaints [50]. The mother of a 12-week-old female infant reported crying during and following a cervical spine SMT treatment for suboccipital strain [50].

No adverse events associated with SMT were reported in one RCT, [61] six case series [34, 35, 47, 49, 56, 66] and 13 case reports [46, 52,53,54,55, 57,58,59,60, 62,63,64,65].

Incidence of adverse events

We computed the incidence of adverse events associated with SMT using data from two RCTs and one cohort study (Tables 2, 3, 4 and 5). In a RCT of infants less than 8 weeks old treated for unexplained persistent crying, the incidence of increased crying in the low force SMT group was 0% compared to 2.94% (95% CI: 0.52, 14.92) in the no SMT group [61]. In another RCT of children between the ages of 6 months and 6 years old with otitis media, the incidence of mild adverse events (mid-back soreness and irritability) was 22.22% (95% CI: 6.32, 54.74) after full spine high-velocity, low-amplitude SMT (focusing on the cervical region) compared to 9.09% (95% CI: 1.62, 37.74) in the placebo group (increased crying) [32]. In a cohort study of infants less than 27 weeks old treated for upper cervical dysfunction with mild mobilization techniques of C1, the incidence of clinician recorded mild adverse events ranged from 0.3% (gastro-esophageal reflux) to 17.8% (flushing) [33].

Association between SMT and adverse events

In one small RCT of children with acute otitis media treated with high-velocity, low-amplitude SMT primarily to the cervical region (nSMT = 9; nplacebo = 11), the relative risk of mild adverse events associated with SMT compared with placebo was 2.44 (95% CI: 0.26, 22.8) [32].

Discussion

A limited body of evidence of adequate methodological quality is available to describe and quantify the risk of adverse events associated with SMT in children under 10 years of age. Although serious adverse events are reported in the literature, the risk of serious adverse events remains unknown in this population [48, 51]. Most studies report mild and transient adverse events (e.g., increased crying, soreness, irritability). Our review suggests that the risk of mild adverse events ranges from 0.3% (95% CI: 0.06, 1.82) in infants < 27 weeks old treated with mild mobilization of C1 for upper cervical dysfunction to 22.22% (95% CI: 6.32, 54.74) in children aged between 6 months and 6 years treated for otitis media. We observe that the methods used to measure adverse events are of questionable validity and reliability.

Our review improved on the methodology of previous reviews by evaluating the internal validity of case reports and case series. This is important because the quality of these designs is highly variable and therefore even the description of an observation can be biased [67, 68]. In comparison to Vohra et al. [13], our study collected one additional case report published in 2012, which reported rib fractures in a 21-day-old infant [48]; this study was included in other reviews [15, 18]. Our review did not include 11 of the 13 studies included in the review by Vohra et al. [13]; one case study was in German, four studies (one RCT, one case report and two case series) did not differentiate a population less than 10 years old, and six studies (four case reports and two case series) were published prior to 1990. In comparison to Todd et al. [15], our review did not include 28 of the 31 studies included in their review. One case report was not in English, seven studies (two RCTs, one cohort study, three case series and one case report) did not distinguish between or include participants less than 10 years old, eight studies (two RCTs, four case series and one case report) did not provide SMT or were unclear whether every participant received SMT, and 12 were systematic or narrative reviews. A number of adverse events in this review were summarized from other systematic or narrative reviews and not directly from the original report in the literature. In comparison to Humphreys et al., [18] our review did not include three case series because one case series did not distinguish participants under the age of 10 years and two case series did not make it clear that the participants were receiving SMT. None of the previously published reviews included the RCT by Miller et al. (2012), which reported no adverse events in the SMT group and one mild adverse event in the comparison group, [61] nor the cohort study by Saedt et al. (2018) that reported multiple mild adverse events [33]. None of these reviews appraised the methodological quality of studies included in their evidence synthesis. Finally, our review improves on the quality of the other systematic reviews because we computed incidence rates and relative risks where possible.

Methodological differences between previous reviews and our review are important to note because variations in methodology can lead to different conclusions [69, 70]. In their review, Vohra et al. report nine serious adverse events and 20 cases of delayed diagnosis associated with SMT [13]. Similarly, the review by Todd et al. reported 15 serious adverse events and 775 mild to moderate adverse events following manual therapy [15]. Our results differ from the findings of those reviews because we excluded studies of poor methodological quality, and those where the use of SMT was unclear [13, 15]. Moreover, our review is up to date and includes recently published studies. We improved on the methodology of previous reviews [13, 15, 18] by excluding case reports where the exposure (i.e. SMT) and outcome (i.e. adverse event) were not adequately measured. Serious and severe adverse events following SMT may be inappropriately reported in the literature [13, 15] because most included studies that were not designed nor powered to measure these rare events. Therefore, we recommend future epidemiological studies be designed to specifically estimate the incidence of adverse events within well-defined populations of individuals treated with SMT.

Strengths of our study include adhering to PRISMA and PRISMA harms reporting checklists (Additional file 3), [22, 23] a protocol established prior to completion and registration with PROSPERO, a clear research question, a robust literature search strategy reviewed by two librarians, screening inter-rater reliability comparison, critical appraisal of eligible studies and a review process by senior scientists at each step of the rapid review. We also included all original research study types (RCTs, cohort studies, case-control studies, case series and case reports) to adequately inform each of our research aims and provided a full electronic search strategy for at least one database including limits used, so our search could be repeated [23]. Our study has limitations. We may have missed studies in our search. A recent study demonstrated that searching MEDLINE alone captured 92% of papers per systematic review regarding adverse effects of surgical interventions but only 65% of those for non-surgical similar effects [71]. The authors recommended that for non-surgical interventions, the search filter should include specific and generic adverse events terms and clearly specify the intervention. Consistent with this recommendation, our search filter included specific terms related to adverse events and harms, as well as SMT related terms. We applied this filter to three databases that capture the majority of manual therapy studies, and that also complies with the WHO rapid review methodology guidelines. We only included studies published in English; however, the majority of studies are published in English [72]. Finally, we only included participants less than 10 years old. While this was specific to our research question, many studies do not categorize participants based on age, which led to a large number of studies being excluded from our review.

On September 30th, 2019, the College of Chiropractors of British Columbia released a public notice reporting on the results of the board regarding SMT treatment of children under the age of 10 years [73]. They determined that the treatment of children with SMT does not pose a significant risk to the public and the College of Chiropractors of British Columbia is not pursuing regulatory action at this time [73]. The results of our rapid review were not the only documents reviewed in their analysis. We did not assess the efficacy or effectiveness of SMT for the management of children under the age of 10 years. Therefore, the results of our review cannot be used to make inferences about the risk-benefit ratio of SMT in this population. We recommend that the future development of public and regulatory policies about the use of SMT to treat children include a review of the literature on the efficacy and/or effectiveness of SMT.

Very little is known about the risk of severe and serious adverse events related to SMT in children below the age of 10 years. In one of the few population-based active surveillance studies involving 2500 Canadian pediatricians and pediatric subspecialties, providing coverage to about 7 million children less than 18 years of age, 12 cases of serious adverse events were reported over a two-year period. Of these, eight were adjudicated for risks associated with complementary and alternative medicine use, of which one was possibly related to SMT provided by a chiropractor in a 13-year-old patient. No serious adverse events involving SMT in patients less than 10 years of age were reported [14]. This suggests that these severe and serious adverse events are rare in the general population and studying this issue would require a large sample size. We recommend the implementation of a population-based active surveillance program to measure the incidence of severe and serious adverse events following SMT treatment in this population. Research is needed to determine the incidence of adverse events associated with SMT in children. Moreover, future research must improve on the clarity and definition of SMT and use a standardized and valid tool to measure adverse events in children [14]. Epidemiological studies are also urgently needed to determine whether SMT is associated with an increased risk of severe and serious adverse events. Due to their rarity, it is unlikely that a large enough number of severe and serious adverse events can be captured in RCTs designed to determine the effectiveness of SMT. Therefore, we recommend the design and conduct of population-based case-control (including case-crossover studies), or cohort studies to measure the association between SMT and severe or serious adverse events. It is likely that this type of research can only be done using high-quality population-based administrative databases.

Conclusion

Most studies report mild and transient adverse events (e.g., increased crying, soreness, irritability) in children under 10 years old receiving SMT. The risk of moderate and severe adverse events is unknown in children treated with SMT. It is unclear whether SMT increases the risk of adverse events in children < 10 years old. Research is urgently needed to determine the incidence of adverse events associated with SMT in children.