Introduction

Pediatric traumatic brain injury (pTBI) is a global health problem affecting millions of children and adolescents worldwide [1]. It can have a profound impact on multiple aspects of a young individual’s life, including their physical, cognitive, emotional, and social well-being [2]. These outcomes are directly related to health-related quality of life (HRQoL), which remains understudied in pTBI patients [3].

To date, only generic HRQoL measures have been available for use in the pTBI context. However, such generic measures, while useful for comparisons between different disease populations [4], have been found to be too general to sensitively capture the outcome of a specific health condition [5]. Disease-specific HRQoL instruments, on the other hand, provide a more comprehensive assessment by capturing the individual aspects of life affected by a particular disease [6]. Until now, HRQoL after pTBI has been assessed using either generic instruments such as the Pediatric Quality of Life Inventory [7] as a self-report measure, or, more recently, the disease-specific Traumatic Brain Injury Quality of Life Communication Item Bank [8] using the parent perspective, which covers however functional communication only. Given the advantages of disease-specific measures over generic ones, the ability of children to evaluate their own HRQoL [9], and the fact that parent–child assessments of HRQoL are not always congruent [10,11,12], age-appropriate TBI-specific patient-reported outcome measures (PROM) are needed.

To fill this gap, the 35-item self-report Quality of Life after Brain Injury for children and adolescents (QOLIBRI-KID/ADO) questionnaire was developed as the first instrument to assess HRQoL in German-speaking children and adolescents after pTBI [13, 14]. It corresponds to the 37-item adult version (but omits items on satisfaction with sexual life and finances) [15], covers six domains (Cognition, Self, Daily Life & Autonomy, Social Relationships, Emotions, and Physical Problems) that are particularly relevant for monitoring after (p)TBI [15], and aims to assess TBI-specific HRQoL in the pediatric context and to measure the burden of pTBI across the lifespan.

One of the notable advantages of short scales is their economy and ease of administration. These features are particularly important when working with pediatric populations, as their attention span and fatigue tolerance may be limited in general and after pTBI in particular [2]. The development and use of a short overall scale (i.e., QOLIBRI-OS-KID/ADO) may therefore facilitate the collection of information and allow for administration after more severe pTBI without placing additional burden on the young patients. The six-item adult version has already demonstrated robust psychometric properties [16] and high sensitivity to different patient groups [17], making it an attractive tool in both clinical and research settings. The applicability of the overall scale in adults underscores the potential of a similar measure for children and adolescents. By utilizing a short and valid instrument such as the six-item QOLIBRI-OS-KID/ADO questionnaire, clinicians can efficiently assess the comprehensive aspects of HRQoL dimensions in pTBI patients in a condensed form (i.e., by rating overall satisfaction within each domain), thereby optimizing resources and minimizing patient burden.

The availability of reference values from comparable general populations facilitates the interpretation of questionnaire scores obtained from individual patients. A comparison of the level of HRQoL after pTBI with the level of HRQoL in the general pediatric population will provide a more realistic assessment of the post-injury burden.

With these considerations in mind, this study aims to investigate the psychometric properties and the applicability of the recently developed German six-item QOLIBRI-OS-KID/ADO questionnaire. The objectives are to examine the psychometric properties within the framework of classical test theory and to establish reference values based on a German general population sample of children and adolescents.

Methods

Participants

pTBI sample

Individuals after pTBI were recruited based on administrative databases of 12 medical centers in Germany and Austria between February 2021 and February 2022. Medical records of potential participating children and adolescents were retrieved from the databases and screened for eligibility. Participants were eligible to participate if they were 8–17 years of age, had been diagnosed with pTBI three months to ten years prior to enrollment, had information on pTBI severity (either based on Glasgow Coma Scale score [18] or medical reports), and were able to understand and respond to questions in German. Children and adolescents with pre-injury epilepsy, very severe polytrauma, severe premorbid mental illness, or life-threatening health conditions were not eligible to participate. Mail invitations were sent to the parents or legal guardians of children and adolescents who met the inclusion criteria. These invitations explained the purpose of the study, the setting, and contact information for study participation. Written informed consent was obtained from all participants and/or legal guardians at the time of study enrollment if parents consented to participate. Interviews with children and adolescents were conducted face-to-face, with the assistance of trained investigators, at separately arranged appointments (online or on-site). Parents completed paper–pencil versions of the questionnaires. A priori power analyses indicated that the primary objective of the project (i.e., to develop the long version of the TBI-specific HRQoL instrument for children and adolescents and to test its psychometric properties) would require a sample of approximately 140 individuals per age group [19]. A total of 297 children and adolescents were included in the analyses (Fig. 1a).

Fig. 1
figure 1

Sample attrition for the a pTBI sample and for the b general population sample

General population sample

Individuals from general population were recruited via an online survey between March 2022 and April 2022 with the assistance of two international market research agencies based in Germany (Dynata: https://www.dynata.com; respondi: https://www.respondi.com; last accessed 05.07.2024). The agencies used their databases to identify and contact parents of children and adolescents aged 8–17 years. Parents were first informed of the purpose of the data assessment and the privacy policy and were asked to provide consent to assessing of sensitive information (i.e., their children’s health data). They were asked if their child had experienced pTBI or any other life-threatening condition. If they responded affirmatively, study participation was terminated. Otherwise, they continued to answer sociodemographic questions and were then asked about their child’s availability. If the child was available, they were invited to participate, or the survey could be continued later. As an incentive, participants received tokens or certificates. The planned sample size was N = 2000 with an even distribution of children and adolescents to obtain robust reference values, taking into account that stratification by other characteristics such as gender may be necessary.

To ensure the data quality, participants who provided inconsistent or unusable information (e.g., conflicting statements of being both completely satisfied and completely bothered), those with contradictory responses (e.g., reporting no health problems but providing comments in the text box), and those who completed the survey in less than five minutes were excluded. A total of 1997 children and adolescents were included in the analyses (Fig. 1b).

Instruments

The Quality of life after brain injury for children and adolescents – overall scale (QOLIBRI-OS-KID/ADO) is a six-item PROM that assesses the overall satisfaction and corresponds to the adult version [16]. The items relate to five out of the six domains (i.e., Physical problems, Cognition, Emotions, Autonomy, Social aspects) of the 35-item version, the QOLIBRI-KID/ADO [13, 14], supplemented by a question on Future prospects. Answers are to be rated on a five-point Likert-type scale (from “Not at all” to “Very”) with a two-week recall period. The scale is accompanied by smileys, which make it easier to rate, especially for younger children. The total score is calculated as the mean of the item responses, transformed into a scale ranging from 0 (lowest possible HRQoL) to 100 (highest possible HRQoL). The latter is done by subtracting 1 from each item score and multiplying by 25. Supplementary material provides information on the development of the questionnaire and the final wording (Development of the QOLIBRI-OS-KID/ADO and Table S1). For administration to the general population, the reference to injury was removed from the instructions. Table 1 summarizes information on further instruments administered in the study.

Table 1 Overview of further instruments administered in the study

Sociodemographic and health-related information

In both samples, sociodemographic information included age, gender, and school attendance. In the pTBI sample, health-related information included parent-reported health status (presence of chronic health conditions, neurological disorders, and/or developmental health conditions), TBI severity (mild, moderate, severe), time since injury in years, and functional recovery status assessed by the King’s Outcome Scale for Childhood Head Injury (KOSCHI) [27] with the following categories: (3a/b) lower/upper severe disability, (4a/b) lower/upper moderate disability, and (5a/b) good/intact recovery.

In the general population sample, health status information was reported by parents on nine health domains: central nervous system disease, alcohol and/or psychotropic drug abuse, active or uncontrolled systemic disease, psychiatric disturbances, severe sensory disturbances, use of psychotropic or other drugs, intellectual disability or other neurobehavioral disturbances, problems before, during, and after childbirth, and other. A participant was considered to have at least one chronic condition if at least one domain was endorsed.

Statistical analyses

To ensure comparability of the results of the QOLIBRI-OS-KID/ADO with its long version, we largely followed the analysis scheme of the QOLIBRI-KID/ADO study [13]. For descriptive analyses at the item level, we calculated mean (M), standard deviation (SD), skewness (SK), and frequency of item responses. Skewness was classified as absent (− 0.5 and 0.5), moderate (± 0.5 and ± 1), and high (> ± 1) [28]. For the QOLIBRI-OS-KID/ADO total score, we reported the M, SD, median (Mdn) and quartiles (Q1 and Q3).

Differential item functioning (DIF) analyses using a combination of ordinal logistic regression and item response theory (LORDIF) were conducted to investigate whether the aggregation of data from children (aged 8–12 years) and adolescents (aged 13–17 years) is appropriate. For each item, two LORDIF models were compared separately in the pTBI and the general population samples: Model 1 including ability (i.e., the level of the latent trait, HRQoL, measured by an item), and Model 2 including ability, age category, and the interaction between age category and ability. DIF was considered absent if a non-significant difference (p > 0.01) was found and the effect measured by McFadden’s pseudo R2 was < 0.05 [29]. In the absence of DIF, psychometric analyses can be conducted using aggregated data from children and adolescents.

The reliability was examined in both samples using Cronbach’s α and McDonald’s ω (for both characteristics, values ≥ 0.70 were considered desirable [30]). The reporting of McDonald’s ω, which is calculated based on the factorial structure of a measure and thus accounts for variation in the correlation strength between items and the latent factor [31], was chosen to further support the internal consistency results. We calculated Cronbach’s α while omitting each of the six items to ensure that the scale reliability did not increase when one of the items was removed. Associations between the items and the total score were explored using corrected item-total correlations (CITC > 0.40) [32]. Test–retest reliability was examined in a subsample of patients who completed the questionnaire on two occasions, 10 to 20 days apart. For this purpose, the intraclass correlation coefficient using a two-way random effects model (i.e., ICC(2,1)) was calculated on the total score level, with ICC > 0.60 [33] considered desirable. The standard error of measurement (SEm) [34] and the minimal detectable change (MDC) were calculated based on the ICCs to provide information on the score variability and the “true” change in the QOLIBRI-OS-KID/ADO total score.

Factorial validity was examined using confirmatory factor analysis (CFA) with the weighted least square (WLS) estimator in both the pTBI and the general population samples. The one-factor solution was considered to describe the data well if the goodness-of-fit indices met the following criteria: non-significant χ2-value (p ≥ 0.05) [35], ratio of χ2 and degrees of freedom (df) ≤ 2 [35], comparative fit index (CFI) and Tucker-Lewis index (TLI) > 0.95 [36], root mean square error of approximation (RMSEA) < 0.05 and narrow confidence interval (CI) for an excellent to close fit [37] (fair fit at values of 0.06–0.07, mediocre fit at values of 0.08–0.10, and poor fit at values above 0.10 [36]), and standardized root mean square residual (SRMR) < 0.05 [36]. Scaled measures were reported for all indices except SRMR (not available). Given the potential limitations of interpreting fit indices using the WLS estimator [38], we additionally reported fit indices obtained from the models using robust maximum likelihood (MLR).

Pearson correlation analyses were utilized to investigate convergent and divergent validity through correlations between the QOLIBRI-OS-KID/ADO total score and the questionnaires measuring symptom burden in the pTBI sample. We hypothesized strong positive correlations between the QOLIBRI-OS-KID/ADO and its long version and the PedsQL, assessing generic HRQoL, as evidence of convergence. Correlations between the GAD-7, the PHQ-9, and the PCSI-SR8/13 were anticipated to be small to medium and negative, indicating some overlap between the constructs and suggesting that higher symptom burden is associated with lower HRQoL and vice versa, but measuring different constructs as an evidence of divergence. Determination coefficient (R2) was computed to evaluate the amount of explained variance.

Hypotheses were tested using one-tailed t-tests in the pTBI sample. We presumed that girls, adolescents (aged 13–17 years), participants with inferior recovery (i.e., KOSCHI < 5b), and participants with greater symptom burden (i.e., GAD-7 and PHQ-9 scores > 4, and PCSI-SR8/13 scores ≥ M + 1SD), would report lower HRQoL than the boys, children (aged 8–12 years), participants with superior recovery, and those reporting no symptom burden, respectively (see Table 1, column “Cut-off applied” for more details on the cut-offs for the test instruments). These assumptions, also tested for the QOLIBRI-KID/ADO, were based on findings from previous studies [13]. Cohen’s d was calculated to determine effect size, with values of 0.20, 0.50, and 0.80 representing small, medium, and large effects, respectively [39]. Results were reported only for groups with n > 30.

Prior to establishing reference values, we conducted a multi-group confirmatory factor analyses (CFA) [40, 41] to compare HRQoL construct assessment between the pTBI and the general population sample and determine measurement invariance (MI). We estimated three models with increasing constraints and then compared them using the χ2 difference test. We first fitted the configural model, then assumed equality of thresholds, followed by equality of both thresholds and loadings. If the difference test yields a non-significant result (p ≥ 0.05), it suggests that the HRQoL construct is assessed equally in both groups and that the difference between samples is solely due to the HRQoL disparity. Thus, the QOLIBRI-OS-KID/ADO can be utilized in both general population and TBI samples, and providing reference values is appropriate. Furthermore, we estimated a multiple linear regression to identify significant interactions between gender, age, and presence of chronic health conditions and the QOLIBRI-OS-KID/ADO total score for possible stratification of the reference values (see Supplementary Material, Regression analyses).

Finally, the reference values were calculated using percentiles, allowing for a simple and straightforward interpretation of the patient’s scores in relation to a comparable general population without a history of pTBI. The distribution of QOLIBRI-OS-KID/ADO scores in the general population sample was divided into percentiles (i.e., 2.5%, 5%, 16%, 30%, 40%, 50%, 60%, 70%, 85%, 95%, and 97.5%), with values below the 16th percentile indicating below-average HRQoL [42] as compared to the general population and thus a clinically relevant impairment.

We used R version 4.3.0 [43] for all analyses, employing the packages Table 1 [44], psych [45], lordif [29], and lavaan [46]. Unless otherwise stated, the significance level was set at 5%.

Results

Participants

Overall, 297 children and adolescents (54.2% male; mean age 12.6 ± 2.67 years) after pTBI participated in the study. Most sustained a mild TBI (79.8%), with 7% having visible intracranial imaging abnormalities. Injury occurred on average 5.36 ± 2.70 years prior to study entry with falls being the most common cause (72.1%). Most participants had fully recovered (86.9%). The mean QOLIBRI-OS-KID/ADO score was 82.3 ± 13.7 points. A significant difference was observed when comparing HRQoL between participants surveyed in different settings (i.e., online vs. offline), with offline respondents (40.4% of the sample) being slightly more satisfied (ΔM = 3.19, d = 0.23, p = 0.049) than those interviewed online (59.6%). A subsample of n = 57 participants completed the QOLIBRI-OS-KID/ADO twice, with a mean test–retest interval of 12.12 ± 2.38 days.

Overall, 1997 children and adolescents (49.5% male; mean age 12.4 ± 2.85 years) from the German general population participated in the study. Most had no chronic health problems according to parental reports (87.5%). The mean QOLIBRI-OS-KID/ADO score was 77.6 ± 16.7 points. The HRQoL of the general population sample was significantly lower than that of the pTBI sample (ΔM = − 4.66, d = − 0.29, p < 0.001). When comparing the subgroup of participants from the general population sample without chronic health conditions to the pTBI sample, the effect decreased slightly but remained significant (ΔM = − 4.04, d = − 0.26, p < 0.001). For details, see Table 2.

Table 2 Sociodemographic and health-related data

Item characteristics, reliability, and DIF analyses

At the item level, participants in both samples tended to indicate a rather high level of HRQoL, as the mean scores per item were around 4, corresponding to the response category “Quite (satisfied)”. As a result, the items were moderately to highly skewed.

Reliability coefficients were α = 0.75 and ω = 0.81 (pTBI sample) and α = 0.85 and ω = 0.86 (general population sample). The exclusion of none of the items increased the initial Cronbach’s α of the scale. The CITCs were > 0.40 in both samples. The DIF analyses suggested no meaningful differences in the items between children and adolescents. Although two items (i.e., “Autonomy” and “Future Prospects”) exhibited significant p-values in the general population sample, McFadden’s R2 values were less than 0.01, suggesting that these differences were negligible. The QOLIBRI-OS-KID/ADO data from children and adolescents were considered acceptable for joint analyses. For details, see Table 3.

Table 3 Item characteristics and results of reliability and DIF analyses

The test–retest reliability in the pTBI sample was ICC(2,1) = 0.68, CI95% [0.51, 0.80], with a SEm of 6.35 points. A change of MDC = 17.59 points in the total score was considered a “true” change in HRQoL.

Validity

The one-dimensional factorial structure showed a good fit in both samples, with most indices being within acceptable ranges. The scaled fit statistics for the pTBI sample from the model using the WLS estimator were χ2(9) = 20.01, p = 0.018, χ2/df = 2.22, CFI = 0.98, TLI = 0.97, RMSEA[90% CI] 0.06 [0.03, 0.10], and SRMR = 0.05. The robust fit measures additionally derived from the model using the MLR estimator were CFI = 0.96, TLI = 0.94, RMSEA[90% CI] 0.07 [0.03, 0.11], SRMR = 0.04. The scaled fit statistics for the general population sample were χ2(9) = 119.77, p < 0.001, χ2/df = 13.31, CFI = 0.99, TLI = 0.98, RMSEA[90% CI] 0.08 [0.07, 0.09], and SRMR = 0.03. Robust fit measures were CFI = 0.98, TLI = 0.97, RMSEA[90% CI] 0.07 [0.05, 0.09], SRMR = 0.03 Fig. 2 provides an overview on standardized model parameters for the models using the WLS estimator.

Fig. 2
figure 2

Results of the CFA (standardized parameter estimates; model using the WLS estimator) for a the pTBI sample and b for the general population sample

Convergent validity analyses showed that the correlation coefficients between the QOLIBRI-OS-KID/ADO and the QOLIBRI-KID/ADO ranged from 0.45 (Emotions scale) to 0.82 (Total score). The respective amount of variance explained varied from 20% to 67%. The association with the PedsQL measuring generic HRQoL ranged from 0.52 (Social Functioning) to 0.74 (Total score), with variance explained between 27% and 55%, respectively.

Divergent validity analyses results revealed hypothesized negative associations between the QOLIBRI-OS-KID/ADO total score and instruments measuring symptom burden (i.e., GAD-7, PHQ-9, and PCSI-SR8/SR13) in the pTBI sample. The coefficients ranged from -0.29 (anxiety) to -0.56 (post-concussion symptoms), explaining 8% to 31% of the variability of the scores. For details, see Table 4.

Table 4 Pearson correlations between the QOLIBRI-OS-KID/ADO total scores and other instruments

Adolescents (ΔM = − 5.26, d = − 0.39, p < 0.001), participants with parent-reported post-pTBI health problems (ΔM = − 5.21, d = − 0.39, p < 0.001), those with higher levels of anxiety (ΔM = − 6.88, d = − 0.51, p < 0.001), depression (ΔM = − 8.77, d = − 0.67, p < 0.001), and post-concussion symptoms (ΔM = − 15.55, d = − 1.30, p < 0.001) had significantly lower HRQoL compared to the corresponding reference groups. All other comparisons were not significant or did not reach a sufficient number of individuals in at least one group (Table 5).

Table 5 Results of hypothesis testing between predefined groups for the QOLIBRI-OS-KID/ADO total score in the TBI sample

Measurement invariance

The results of the MI analyses showed no significant differences between the models with increasing restrictions (p > 0.05), suggesting that the construct of HRQoL was measured equally in both samples. In view of these results, it seems appropriate to use the data obtained from the general population sample as a reference for the QOLIBRI-OS-KID/ADO. For details, see Table 6.

Table 6 Results of measurement invariance testing (general population sample vs. TBI sample)

Reference values

Based on the results of the regression analyses, the reference values were further stratified by gender and age, but excluding those sustaining chronic health conditions due to insufficient number of cases (see Supplementary Material, Regression analyses and Table S2). Table 7 provides reference values for the QOLIBRI-OS-KID/ADO total score. The following example illustrates the application of the reference values. A 15-year-old female patient has a QOLIBRI-OS-KID/ADO total score of 60. This means that her score is between the 5% and 16% percentiles, and she is in the under-average clinically relevant HRQoL range. Accordingly, compared to girls aged 13–17 from the general population, her HRQoL is below average and there is an indication for further diagnostics. In this case, a longer version of the instrument could be used to identify HRQoL domains that are particularly affected (e.g., cognition or emotions). Depending on the results, further diagnostics, such as neuropsychological testing or screening for depression or anxiety, may be performed to gain more information about what may have negatively impacted the patient’s HRQoL. If the QOLIBRI-OS-KID/ADO total score is in the normal range, there is no indication for further diagnostics.

Table 7 Reference values for the QOLIBRI-OS-KID/ADO from the general population sample stratified by gender and age

Another interpretation would be to determine a cut-off for the assessment of clinical relevance. In this case, QOLIBRI-OS-KID/ADO scores less than or equal to 62 indicate impaired HRQoL in females aged 13–17 years. The score of 89 from our example is above the cut-off of 62, indicating that the patient did not report clinically relevant impairment.

An interactive app for the reference values can be found online (https://reference-values.shinyapps.io/Tables_Reference_values/, tab QOLIBRI-OS-KID/ADO; last access on 05.07.2023).

Discussion

The aim of this study was to investigate the psychometric properties of the QOLIBRI-OS-KID/ADO, a short scale assessing disease-specific HRQoL after pTBI, and to provide reference values from a general population sample. Based on the results of the psychometric analyses, we recommend the use of the QOLIBRI-OS-KID/ADO, especially when a brief overview of TBI-specific HRQoL is needed. The established reference values serve as a basis for screening decisions (e.g., further administration of a longer version of the instrument to gain more detailed insight into impaired HRQoL domains or omission of further steps in the absence of an indication) and provide information on the degree of overall HRQoL impairment after pTBI. Some of the central findings are discussed below.

The reliability of the QOLIBRI-OS-KID/ADO was satisfactory, indicating that the items are internally consistent, and the test results are repeatable within a period of 20 days. A score change of approximately 18 points represents a true change in HRQoL between two measurement occasions. Future studies should further focus on the clinically meaningful change to evaluate the course of recovery and the effectiveness of therapies and treatments after pTBI.

The DIF analyses suggested that the instrument is applicable to children and adolescents, allowing for aggregated analyses, item-level comparison, and longitudinal HRQoL monitoring. The psychometric properties of the QOLIBRI-OS-KID/ADO are similar to those of the QOLIBRI-KID/ADO [13], and the correlation between the total scores indicates strong construct overlap. The overlap with generic HRQoL as measured by the PedsQL was slightly lower. Similar associations between generic and disease-specific HRQoL measures have also been found for other pediatric health conditions (e.g., allergies [47], renal disease [48], or epilepsy [49]). This finding indicates that the constructs are related—also in terms of convergent validity—but not completely identical, suggesting that generic HRQoL measures cannot serve as substitutes for disease-specific PROMs.

Negative associations between the QOLIBRI-OS-KID/ADO and the instruments measuring symptom burden suggest that higher burden is associated with reduced disease-specific HRQoL and vice versa, which is consistent with findings for the QOLIBRI-KID/ADO [13, 14]. Finally, the goodness of fit of the one-factor model indicates that the assessment of HRQoL occurs unidimensionally and that the use of the QOLIBRI-OS-KID/ADO total score is reasonable. The use of both WLS and ML estimators in the CFA analyses supports this finding.

The QOLIBRI-OS-KID/ADO covers the HRQoL domains specified in the 35-item form in a general way and may therefore be less able to detect all differences between predefined sociodemographic and clinical groups than the long version. This may explain why no differences in sex, time since injury, and TBI severity were observed in the known group analyses in the pTBI sample. However, significant differences between age groups and between children and adolescents with and without post-injury health problems indicate that, as hypothesized, adolescents and those reporting health problems report lower HRQoL than comparison groups. This finding suggests that the QOLIBRI-OS-KID/ADO reflects the ability of the 35-item version to discriminate between predefined groups [13]. In addition, significantly lower QOLIBRI-OS-KID/ADO scores were observed in children and adolescents with higher levels of symptom burden compared to the low symptom burden groups. This reflects the findings from the previous studies suggesting decreased overall HRQoL in children with post-pTBI affective disorders [50] and post-concussion symptoms in both the acute [51] and post-acute phases [52]. Similar to the results of the long version [14], we did not observe significant differences in HRQoL between mild and moderate-to-severe pTBI groups. This finding may be explained by the small number of children and adolescents with greater TBI severity, which also reflects the prevalence of more severe TBI in pediatric populations [53], and thus lower power to detect the differences. Given that the adult version of the instrument showed a clear ability to discriminate between different groups of TBI severity at three, six, and 12 months after injury in a large pan-European study [17], further evidence is warranted to support the utility of the QOLIBRI-OS-KID/ADO across all severity levels in children and adolescents.

We found significantly lower HRQoL in the general population sample compared to the pTBI sample, although the effect was small. This may seem surprising at first, but this finding can be further explained. First, the pTBI sample included children and adolescents in the post-acute phase after mild injury. Therefore, the children and adolescents may have mostly recovered from the injury and/or adapted to its consequences, also in terms of HRQoL [54]. Second, by removing the reference to a specific health condition, as was done in adapting the QOLIBRI-OS-KID/ADO for administration to the general population, a broader or different interpretation of life and health domains to which responses can be related may be possible, resulting in greater variability in responses. Finally, due to the large size of the general population sample, the test may have been overpowered, detecting small effects that may be less relevant from a content-related perspective. In fact, the average difference between the samples was 4 points on a scale of 0–100. Future analyses using a matched sample adjusted for age, sex, and presence of chronic health conditions may shed more light on these differences.

Irrespective of gender and age, a cut-off of 62 on the QOLIBRI-OS-KID/ADO total score can be considered clinically relevant. For boys aged 8–12 years and 13–17 years, scores below 66 and 67, respectively, indicate impaired HRQoL, and for girls in both age groups, 67 and 62, respectively. With these scores in mind, possible deficits can be identified, and more detailed and personalized diagnostics can be applied to better understand which HRQoL domains are affected and need therapeutic interventions. Compared to the previously established cut-off value of 52 for the adult version of the questionnaire [55], the critical HRQoL score in children and adolescents is higher. However, the adult cut-off value was methodologically derived from the norm-based Mental Component Summary score of the Short Form Survey [56] using regression analyses and not from the comparable general adult population sample.

Strengths and limitations

The main strength of this study is that it provides a first short scale assessing disease-specific HRQoL after pTBI in the German-speaking language context, complemented by reference values obtained from a German general population. The study fills the gap of economic PROMs and provides interpretation guidelines in the field of pTBI. However, some limitations should be mentioned.

Overall, the number of children and adolescents after pTBI enrolled in the study represents 9% of the total number of invitations sent. Participation in the study was voluntary, and the decision to enroll should have been made by the parents. For privacy reasons, we were unable to perform analyses on known characteristics between those who enrolled and those who did not. Therefore, we can only assume that there may have been several reasons for non-response (ranging from change of address due to relocation, unwillingness to participate, or concerns about re-traumatizing the children through repeated explicit confrontation with the traumatic event). However, the planned number of participants in the project has been reached [14]. These issues have been addressed in more detail elsewhere [14].

The children and adolescents included in the present study mostly suffered from mild pTBI (ca. 80%) and were in the post-acute stage of injury (5.36 ± 2.70 years). Therefore, further validation in a more acute sample and in moderate to severe pTBI is warranted. The same applies to the levels of recovery, which could not be compared due to the small number of cases in the less favorable recovery groups: good recovery (KOSCHI 5a: 7.1%) and upper moderate disability (KOSCHI 4b: 5.1%). Further research is needed to better understand the applicability of the QOLIBRI-OS-KID/ADO total score to differentiate between different levels of recovery/disability.

Furthermore, the significant difference in HRQoL between online and offline interviewed patients can be explained by the higher prevalence of children and adolescents with pTBI within the first year (8.4% vs. 0%) and of moderate and severe TBI (23.4% vs. 14.8%) in the online group.

The general population sample was recruited online, which has both advantages and disadvantages. The main advantage is the accessibility of the relatively large number of participants. The potential disadvantages are a certain bias (e.g., recruitment of individuals with Internet access), non-standardized setting of data collection, and difficulty in verifying individuals [57]. Nevertheless, we have tried to maintain data quality by recruiting participants from internationally recognized market research agencies, controlling for inconsistent responses, and excluding “speeders” from the data set. While the planned number of cases of N = 2000 was almost reached also after data cleaning, the number of survey starts that ended with a termination was 61% (response rate 39%), indicating a possible selection bias and that only those interested completed the questionnaires. However, comparative analyses between respondents and non-respondents were not possible because the research agencies were not allowed to disclose information about those who had withdrawn.

The subsample with chronic health conditions was not sufficient to stratify the reference values. Therefore, the reference values provided represent the ideal norm. Expanding the reference values with information from subgroups with chronic health conditions would provide additional insight into HRQoL in the general population and allow for more nuanced comparisons with pTBI patients.

Conclusions

The QOLIBRI-OS-KID/ADO is a short, reliable, and valid PROM assessing disease-specific HRQoL after pTBI. Combined with reference values obtained from a general population sample, it provides a cost-effective screening for clinical practice and research. Future studies should focus on its applicability in more acute and severe samples and on the long-term validity, especially in combination with the adult version, which would provide further insight into the longitudinal effects of pTBI. Finally, translation into other languages would allow multinational studies to fill the knowledge gap regarding pTBI-specific HRQoL internationally.