Introduction

Drooling, or sialorrhea, is a well-recognised health issue in children with disabilities, especially those with cerebral palsy (CP). It can be defined as the unintentional spill of saliva from the mouth [1], even if several other definitions have been reported [2,3,4,5,6]. Although sialorrhea is normal in infants, it is considered pathological after the age of 4 years old [7]. In addition, severe sialorrhea can give rise to a number of limiting physical and psychosocial complications such as social isolation and low self-esteem [1, 8].

Although sialorrhea severity varies daily, and sometimes hourly or depending on daily life circumstances, there is a need to quantify its frequency and its impact on children’s and their caregivers’ quality of life [9]. Various interventions have been described to reduce or eliminate sialorrhea. These include surgery, botulinum toxin (BoNT-A and BoNT-B), anticholinergic medications, and oral-motor therapies [1]. This challenging condition should always be addressed by a multidisciplinary team, specifically by professionals with experience in disability and in children with special needs [10]. However, there currently is a lack of knowledge among paediatricians on how to adequately quantify sialorrhea. In fact, Parr et al. found that very few paediatricians in the UK use standardised methods to measure sialorrhea and the effectiveness of medications or their adverse effects [9]. Hence, the aim of this review was to appraise the measurement properties of drooling measures validated in the paediatric population.

Methods

Search strategy

Supervised by R.O., E.S. performed a systematic electronic literature search of the following databases: PubMed, Scopus, Cochrane Library, and CINAHL (EBSCO). Search terms combined text words and Medical Subject Headings (MeSH), as shown in Supplementary Table 1. MeSH terms included three components: terms referring to drooling/sialorrhea, target population and assessment methods.

Study eligibility

Following the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) checklist [11] (Supplementary Table 2) and after removing duplicates, all full-text articles were screened by two independent researchers; any discrepancies were solved in a consensus meeting. The articles were included if they reported objective or subjective outcome measures of sialorrhea that were appropriate for use in children aged 0–18 years with or without special needs, that were freely-available, and written in English. No date limit was set, to avoid excluding potentially useful evaluation methods and questionnaires. Exclusion criteria were absence of statistical numerical results within the study except for those studies describing an outcome measure for the first time, those only assessing salivary production and those evaluating post-therapeutic outcomes.

Data collection and assessment

Included studies were assessed independently by two researchers. Sialorrhea outcome measures identified in all selected papers were classified depending on two domains: quantitative measures versus parent or proxy reports with quality of life evaluation. Articles were reviewed for the evaluation of qualitative features, such as domain assessed, time needed for questionnaire administration, population, and age of population. Scoring and its interpretation were also extracted. If the article was deemed worthy of inclusion but was lacking specific information, its corresponding author could be contacted for clarifications.

The COnsensus-based Standards for the selection of health status Measurement INstruments (COSMIN) checklist (July 2019 version) [12] was used to evaluate the methodological quality of each outcome measure described in the included studies. The COSMIN checklist was developed by authors based on previous COSMIN checklist versions [13, 14] and on the COSMIN Risk of Bias checklist for PROMs [1516]. A 4-point rating scale (very good, adequate, doubtful, inadequate) was used to assess each standard recommended by the checklist in each article. As the COSMIN checklist does not provide an overall rating score, we used the “worst-score counts principle” [14] to obtain one.

Data on validity, reliability, and responsiveness (described in Supplementary Table 3) of all measures were also collected, though data collection on construct validity, content validity, and internal consistency was not applicable for quantitative outcome measures. In addition, the quantitative results for each study have been rated against the Terwee et al. [17] criteria.

A positive rating was assigned to sensitivity and specificity when equal or over 0.80 [18], to criterion validity if the correlation with the gold standard was at least 0.70 [17], to reliability when the intraclass correlation coefficient (ICC) or weighted Kappa was at least 0.70 in a sample size of at least 50 patients [17], and to measurement error if authors provided convincing arguments that it was acceptable. A positive rating was given to internal consistency when factor analysis was applied and Cronbach’s alpha was between 0.70 and 0.95 [17]. For responsiveness, the area under the receiver operating characteristic (ROC) curve (AUC) of at least 0.70 or Guyatt’s responsiveness ratio (RR) of at least 1.96 was considered adequate [17]. A gold standard for measuring sialorrhea was considered “gold” only when it was the original long version to which a shortened instrument was compared to. Feasibility was rated as adequate if the test needed up to 10–15 min to be completed and if the questionnaire was self-administered [18]. The primary purpose (predictive, discriminative, or evaluative) of tools evaluating sialorrhea was also assessed [17, 19].

Results

The initial literature search yielded 891 articles. Duplicates (n = 430) were excluded and the remaining 461 “full-text” manuscripts were evaluated. Agreement between the two independent researchers reviewing the articles was high (Cohen’s Kappa > 0.8). Of the 461, 21 studies met the inclusion criteria, as shown in Fig. 1. Only one author (van der Burg) [8] had to be contacted to clarify the exact number of questionnaires developed in his study. Overall, 19 sialorrhea assessment tools were identified (Table 1): 5 quantitative/semi-quantitative outcome measures [20,21,22,23,24] and 14 questionnaires measuring severity and/or impact of sialorrhea on patients’ quality of life [1, 6, 8, 25,26,27,28,29,30,31,32,33,34,35]. Extensive description and explanation of each tool can be found in the Supplementary material.

Fig. 1
figure 1

Diagram of literature search and article selection

Table 1 Measures of sialorrhea

Qualitative and quantitative features

The assessment tools differed in sialorrhea quantification methodology and purpose of assessment.

Sialorrhea was quantified by counting bibs changed daily [20], weighing bibs [21], collecting saliva with a cup [22], direct/standardised observation of sialorrhea episodes over a 5 to 10-min period [23, 24], and through subjective scales or questionnaires [1, 6, 8, 25,26,27,28,29,30,31,32,33,34,35].

The primary purpose of assessment was predictive for bib count [20] and bib weight [21], discriminative for the Modified drooling questionnaire [30], the 5-min Drooling Quotient (DQ5) [24], the Drooling Infants and Preschoolers Scale (DRIPS) [25], and the Blasco Index for the assessment of drooling [1], evaluative for all the others [6, 8, 22, 23, 26,27,28,29, 31, 33,34,35].

Responsiveness data were available only for the Drooling Impact Scale (DIS) [31] and the French version of the Drooling Impact Scale (DIS-F) [32], while other measures had been used in several clinical trials to measure longitudinal changes in sialorrhea after treatment. Data related to target population, sample size, and feasibility are listed in Table 2.

Table 2 Outcome measures, structure, and scoring

Among scales and questionnaires, there was an adequate feasibility for the DRIPS [25], which is self-administered and performed in 15 min, and for the Modified drooling questionnaire [30] that needs a mean administration time of 10 min. Administration time was reported also for the Teacher Drooling Scale (TDS) [27], requiring a full school day observation.

Validity and reliability

With regard to measurement properties, data for both reliability and validity were available for the DQ5 [24], the modified drooling questionnaire [30], the DIS [31], the French version of Drooling Impact Scale (DIS-F) [32], the Brazilian Portuguese language version of DIS [33], the TDS [27], the DQ [23], and the DRIPS [25]. The Drooling Impact Questionnaire (short version) [6, 36], the questionnaire to evaluate impact of drooling on daily living (questionnaires 1 and 2) [8], bib count [20], bib weight [21], the Drooling Severity and Frequency Scale (DSFS) [26], the Visual Analogue Scale (VAS) [29], and the Daniel Drooling Impact Score Questionnaire (DDISQ) [34], reported only data on validity. The remaining outcome measures did not have any measurement properties tested [1, 22, 28, 35]. The different aspects of reliability (i.e. Inter-rater, Intra-rater, test retest) and validity available for each sialorrhea measure are shown in Tables 3 and 4.

Table 3 Outcome measures, validity and responsiveness
Table 4 Outcome measures, reliability

Among instruments with validity and reliability data, the DQ5 [24] and the modified drooling questionnaire [30] had an overall positive score in terms of quantitative results and methodological quality. Specifically, for the DQ5 [24], most measurement properties in the checklist were rated positively with an overall score of ‘very good’. The 5-min Drooling Quotient during activities (DQ5A) was more discriminative for drooling severity than the 5-min Drooling Quotient at rest (DQ5R), with a cut-off point of 18 indicating a constant drooling. Criterion validity had been calculated for the DQ5, showing a positive strong correlation between the DQ5 [24] and the DQ [23]. For inter-rater reliability, the DQ5 showed a higher correlation between the scores of the observers.

The modified drooling questionnaire [30] was rated as ‘adequate’ in terms of content validity. Reliability was rated ‘very good’, as it showed a higher correlation between observers’ scores; a cut-off of 24 discriminates between mild and severe drooling.

For the DIS [31], the DIS-F [32], and the Brazilian Portuguese language version of DIS [33], although most items of measurement properties in the checklist were rated positively, the overall score was rated as ‘doubtful’, due to lack of clarity on how missing items were handled. For both TDS [27] and DQ [23], measurement analysis was considered unsatisfactory. The overall score given to the measurement properties tested in the DRIPS [25] ranged from ‘adequate’ to ‘very good’.

The quality scores using ‘worst score counts’ [14] criteria are reported in Table 5. Data on validity and responsiveness of studies are summarised in Table 3; data on reliability are summarised in Table 4.

Table 5 Outcome measures, quality appraisal

Discussion

The paucity of reviews in the medical literature about sialorrhea measurements in children has not allowed a robust use of assessment tools by paediatric experts in disability. Our review has highlighted that although there is a wide range of approaches in the clinical practice to assess children’s saliva management, very few sialorrhea outcome measures are currently available to guide medical decision-making. Clinical evaluation of children with sialorrhea includes a thorough anamnestic collection and physical examination. Paediatric history should focus on age of sialorrhea onset, chronicity, precipitating factors, associated symptoms, developmental history, use of medications as well as family, perinatal history, or past pathologic data. Data acquisition can be expedited by questionnaire administration, resulting in multiple benefits. In fact, this is a reasonable and time-sparing procedure for clinicians to measure sialorrhea severity and its impact on both quality of life and routine daily life. It also allows planning intervention programs and periodically measure outcomes of each intervention. Questionnaire administration can also facilitate a comprehensive evaluation and improve clinician familiarity with sialorrhea assessment. The measures described in this review could be categorised in two main groups: the first aimed at discriminating children depending on severity of sialorrhea and the second aimed not only at evaluating severity, but also sialorrhea impact on children and parents’ lives. Moreover, treatment of sialorrhea can be considered effective not only if its severity decreases, but also if it lessens its impact on the caregiver and improves the child’s quality of life.

Among all assessment instruments that we analysed, only few of them have a description of psychometric properties. Nevertheless, some of the measures reporting their internal attributes can be properly used to assess sialorrhea. Specifically, the DIS [31], the DIS-F [32], and the modified drooling questionnaire [30] can be used as valid and reliable measures of drooling severity and social acceptability in children with developmental disabilities and CP dealing with sialorrhea. Moreover, the DIS [31] and the DIS-F [32] were the only evaluative tools with responsiveness data, being useful for detecting clinically important changes over time. Instead, the modified drooling questionnaire [30] can be used as a discriminative tool, and is also the first questionnaire validated in the Indian paediatric population with CP.

Furthermore, clinicians may undertake an accurate classification of sialorrhea through a quantitative measure: specifically, the physician can objectively assess sialorrhea frequency using the DQ5A in children with developmental disability and moderate-to-profuse sialorrhea [24]. Discriminative properties for the DQ5 in children with infrequent and slight drooling and population groups other than children with developmental disabilities have not been studied yet. Moreover, among questionnaires, the DRIPS [25] can be used by clinicians to monitor sialorrhea, due to the presence of charts created with a reference cohort of children with typical development.

The integration of patient-reported outcomes into clinical care is becoming a standard practice [37]. For children who drool, the subjective opinion of parents provides insight on drooling severity and its relevance, while quantitative methods can help to corroborate subjective findings. For these reasons and as previously reported by van Hulst et al. [24], sialorrhea evaluation should cover quantitative measures and parent or proxy reports in both clinical and research contexts.

Strengths and limitations

The present review provides insights into the current evidence on the available outcome measures of sialorrhea in children. It also describes important measurement properties that enable dedicated healthcare professionals to choose the best available outcome measure. Strength of this review is the use of a rigorous and stringent methodology. As suggested by the COSMIN checklist [12], the ‘‘worst score counts’’ principle [14] was used to obtain a methodological quality score for each measurement property. A poorer score on any item was considered to represent a fatal flaw. Publication bias is a frequent limitation in most systematic reviews: although many efforts were made to seize all studies, some potentially relevant studies might have been excluded. Specifically, language restriction was an important limitation because it led to the exclusion of a substantial number of potentially relevant studies.

Future research

Further studies investigating the properties of sialorrhea outcome measures are needed in order to obtain more robust data. Outcome measures should be also evaluated in different population groups. An electronic format of these same tools should be also provided, to obtain real-time data in case face-to-face consultations are not deliverable.

Conclusions

The measures included in this systematic review varied in the evaluation methods and domains assessed, and measurement properties were often not available. Our findings suggest that a combination of both quantitative measures and parent/proxy questionnaires might provide an adequate measurement of sialorrhea in children. Given the high rates of moderate and severe sialorrhea in different paediatric conditions with disability, the use of valid and reliable measures of sialorrhea might improve physicians’ confidence in its evaluation, support clinical decision-making, enhance efficacy of follow-up after treatments, and optimise research quality.