Abstract
Purpose
One of the challenges for mental health research is the lack of an agreed set of outcome measures that are used routinely and consistently between disciplines and across studies in order to build a more robust evidence base for how to better understand young people’s mental health and effectively address diverse needs.
Methods
This study involved a scoping review of reviews on consensus of the use of mental health and wellbeing measures with children and young people. We were particularly interested to identify if there are differences in measures that are recommended for children and young people with care experience including those with developmental disabilities.
Findings
We identified 41 reviews, of which two had a focus on child welfare settings, three on childhood trauma and 14 focused on children and young people with developmental disabilities. Overall, our review highlights a lack of consensus and a diversity of measures within the field. We identified 60 recommended measures, of which only nine were recommended by more than one review.
Conclusions
Our review highlights the need for greater agreement in the use of mental health outcome measures. While our review highlights that there is value in identifying measures that can be used with any child or young person, researchers need to take into account additional considerations when working with children and young people with care experience and those with developmental disabilities, to ensure measures are accessible and sensitive to their life experiences.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
There is a growing recognition of a lack of agreement and consistency in the use of mental health measures in research and practice in relation to children and young people, which makes it difficult to compare research findings (Krause et al., 2021). For example, over 280 measures for depression have been developed since 1918 (Santor et al., 2006). How researchers assess mental health outcomes varies. Different measures reflect different mental health outcome domains that researchers decide to assess. Most commonly mental health is defined by looking at mental health problems, although there has also been an increase in positive mental health measures, utilising concepts such as wellbeing or quality of life (Losada-Puente et al., 2019). While it is important that there are robust measures for all aspects of mental wellbeing, common mental health problems and severe mental illness, the lack of consistency in the measures used means that important opportunities to compare findings across studies, settings and time are being missed. Additionally, the increase in the numbers of measures available has also led to concerns about the quality of measures that are used and the absence of independent evaluation of their psychometric properties (Addington et al., 2015; Howe et al., 2020), as well as if measures accurately reflect the different mental health outcome domains claimed (Krause et al., 2022).
The need for greater consensus has been highlighted by a number of initiatives. In 2005, the COSMIN initiative (COnsensus-based Standards for the selection of health Measurement INstruments) was set up by a multi-disciplinary team of international researchers to provide guidance on assessing and selecting suitable outcome measures. More recently the International Alliance of Mental Health Research Funders, the National Institute of Mental Health and the Wellcome Trust (Farber et al., 2020) have suggested a set of common data items and measures which should be routinely used in mental health research. These include: Age; Sex at Birth; WHO Disability Assessment Schedule (WHODAS) 2.0 (for adults); Patient Health Questionnaire (PHQ-9) (for adults); Generalised Anxiety Disorder Assessment (GAD-7) (for adults); and the Revised Children’s Anxiety and Depression Scale (RCADS-25) (for youth). The International Consortium for Health Outcomes Measurement (ICHOM) has recommended a standard set of outcomes specifically for child and youth anxiety, depression, obsessive compulsive disorder, and post-traumatic stress disorder (Krause et al., 2021) which are: the RCADS-25; the Obsessive Compulsive Inventory for Children (OCI-CV); the Children’s Revised Impact of Events Scale (CRIES); the Columbia Suicide Severity Rating Scale (C-SSRS); the KIDSCREEN-10; the Children’s Global Assessment Scale (CGAS); and the Child Anxiety Life Interference Scale (CALIS).
One of the reasons for a lack of agreement on which measures to use, is that different measures are often validated to be administered in specific groups and populations and within defined settings. Specific populations of interest include children and young people with care experience (also referred to as looked after children) including those with developmental disabilities. While there are increasing concerns about the mental health and well-being of children and young people in general (Frith, 2016), there are concerns in particular about looked after children (Bazalgette et al., 2015) and children with developmental disabilities, such as autism or ADHD (Sayal et al., 2018; Lecavalier et al., 2014). Children and young people in care are always, first and foremost, children and young people and there is a risk of othering those with care experience and/or developmental disabilities when viewing them as an entirely distinct and different group. Yet, some of their experiences will be unique and it is important for researchers to be aware of this. Young people who are looked after have consistently been found to have much higher rates of mental health difficulties than the general youth population, with almost half of looked after children (and three quarters of those living in residential group care) meeting the criteria for a psychiatric disorder in the UK (Fleming et al., 2021; McKenna et al., 2023). There are many reasons for this, including the adversities experienced by children before coming into state care, such as abuse, neglect, exploitation and poverty, along with the difficulties children may experience during their time in care, which can both add to and exacerbate their needs.
Given the accumulation of experiences, it is important to understand trajectories and outcomes of poor mental health and wellbeing, and recovery, for these young people to inform policy and practice. Reviews of the extant literature and research (Luke et al., 2014; NICE, 2015, 2021) have highlighted a number of challenges to ensure that the needs of looked after young people are better understood and addressed. The NICE (2015, 2021) Guidelines on Looked After Children and Young People concluded that further work was needed to develop robust methods for evaluating services. This included, for example, developing standardised, validated and reliable measures and robust tools to evaluate quality of life outcomes for use with all looked after children and young people from birth to 25 years, regardless of where they live.
Children and young people with developmental disabilities have also been found to be at greater risk of mental health difficulties due to an interplay of differences in individual functioning and environmental risk factors such as higher prevalence of bullying, experiences of stigma, lack of social inclusion and school exclusion (Sterzing et al., 2012; Honey et al., 2011; Emerson & Hatton, 2007). Additionally, there is now a growing recognition of the presence of developmental disabilities within the care population but this is often overlooked in mental health research with this group, despite potential implications for how such young people should be cared for, and supported (Banerjee et al., 2021). Recognising diversity in individual functioning and life experiences for children with care experience, including children with developmental disabilities raises the question of which measures are, can and should be used across populations, which need to be adapted and what population or domain-specific measures are needed.
Aims and Objectives
The overall aim of the scoping review is to explore variability in the use of mental health outcome measures and to identify measures that have been recommended to be routinely used in research with children and young people in different contexts with a specific focus on children with care experience including those with developmental disabilities. The scoping review is part of a bigger project (blinded for peer review) and findings from this review informed the development of a Delphi study to identify and agree a common core set of measures to be used in mental health research with young people, who are care experienced. Additionally, as part of our project we conducted participatory work with young people and adults with care experience, including some with developmental disability, to help us think about how we should define and understand mental health, given the criticism that young people in out of home care are rarely asked about their perspectives on their own health (Smales et al., 2020). The scoping review was the first step of the process and aims to map the literature on the development and implementation of agreed outcome measures.
Defining Mental Health Outcome Measures
For this review, outcome measures were defined as psychometrically validated measures of mental health. We aimed to take a broad conceptualisation of mental health and included related concepts such as wellbeing and quality of life, to capture clinical definitions, as well as broader social perspectives on mental health (Berghs et al., 2021). Additionally, it was important for the review team not to equate developmental disability with mental health problems and we decided not to include tools that facilitate a diagnosis of developmental disabilities such as autism or ADHD. The focus of this review is not on diagnosis, but on outcome measures that can be used in research to assess and understand young people’s mental health, identify risks or capture change over time.
Methods: A Review of Reviews
Reviews of reviews are helpful in areas of research and practice that are rapidly growing and have an extensive evidence base that make the synthesis of primary studies too burdensome (Smith et al., 2011). An initial database search combining terms for measures, mental health and children in PsychInfo showed over 70,000 results of primary studies. The search results were then filtered to include systematic reviews published in the last 10 years. A further preliminary database search showed that there were several existing reviews that explored the use of mental health measures in research with children and young people with a focus on different age groups, settings and outcome domains. Thus, we made the decision to conduct a review of existing reviews to map recommendations for different populations and outcome domains.
Research Questions
Our primary research question for the scoping review was: What outcome measures are currently used to assess the mental health and wellbeing of children and young people in research? Our aim was to map measures recommended by existing reviews for use in research with children and young people.
Sub-questions of interest were:
-
How is mental health and wellbeing defined and what typologies and dimensions underlie existing measures?
-
What outcome measures are used for children and young people in care and care-leavers? What outcome measures are used for children and young people with developmental disabilities?
-
What are the age groups for which outcome measures have been designed and used?
Search Strategy
The Joanna Briggs Institute (https://jbi.global/) recommends using PCC (Population – Concept – Context) to develop search strategies for scoping reviews, and the PCC format guided the development of our search strategy. The process was supported by an expert support librarian, who was a member of the research team (RJ). The search strategy was developed in PsycInfo, where the subject headings were likely to be the most detailed for mental health related terms, and the sensitivity of the search was tested using a set of papers already identified as relevant. The search strategy was then translated to Medline, Embase and ERIC. A combination of subject headings and keyword (free text) searches were used. The search was conducted between March and April 2021. An overview of our search terms can be found below (Table 1) and details of the full strategy with truncations and search filters can be obtained from the first author on request.
Eligibility Criteria
The following eligibility criteria were developed to guide the screening process:
-
Is it a published review?
-
Is it a review of measures?
-
Is it a review of mental health measures?
-
Does it focus on children and young people (0–26)?
-
Is it available in English or German?
-
Has it been published in the last 10 years (2011 to 2021)?
The age range was chosen to reflect current policy and practice recommendations, reflecting an understanding that the period of transition to adulthood can take several years after young people leave school. Additionally, mid-twenties have been identified as an age when most mental health conditions will have manifested (Kessler et al., 2007). Reviews that included studies with children/young people, as well as adult populations, were only included if they specifically referred to children or young people as a distinct group in their assessment and recommendation of measures.
We excluded specific populations such as children and young people with diabetes or terminal illness. Reviews were defined as following a systematic and transparent search process and included scoping, systematic and narrative reviews. The focus on English and German publications reflects the languages spoken by the research team, however, we recognise that the exclusion of other languages adds bias to the review.
Screening Process and Data Extraction
Overall 25,438 results were identified across the four databases and after removing 3,544 duplicates, 21,894 were screened against our eligibility criteria. 21,387 were excluded after screening all titles and abstracts and 506 were assessed for full-text eligibility, after we were unable to retrieve the full-text for one record. A team of six researchers conducted the screening (PJ, LP, CMC, JD, GD, PMC). 20% of results were assessed by two-reviewers at the title and abstract stage as a standardisation exercise, before moving to single reviewer screening. All full-text records were assessed independently by two reviewers. Conflicts were resolved by a third reviewer, and particularly difficult decisions were taken after discussions with the whole review team. An overview of the screening process can be found in the PRISMA diagram (Fig. 1).
The main reasons for exclusion at the full-text stage were reviews which did not meet our definition of being a systematic review of measures. This included reviews which focused on one or two specific measures and where the selection of those measures was not transparent or systematic. It also included reviews that reported the frequency of use of measures, but failed to provide an assessment of the psychometric properties or the acceptability or utility of identified measures. Additionally, 138 studies were identified as not primarily relating to mental health outcomes. This included reviews which focused only on physical health or physical functioning (e.g. mobility), IQ-tests or standardised diagnostic assessments.
41 reviews were deemed to meet our eligibility criteria. The data extraction process followed several steps. Firstly, extracting information about each included review study, including information about authors, country of origin, methods and the number of included studies and measures. Secondly, we identified recommended measures across the 41 reviews and information about each measure and the context of their use (recommended for which purpose, which setting, context and which age-group) was collated by two members of the research team (PJ and LP). To answer sub-questions of interest we used a framework of four mental health typologies to group reviews and measures (Slade, 2002). These were (i) condition-specific measures, (ii) behaviour associated with poor mental health such as self-harm or substance misuse, (iii) general mental health and (iv) positive mental health. We paid particular attention to reviews that discussed measures in relation to children with care experience and children with disabilities, comparing if different measures were recommended or if other differences were noticeable such as use of outcome domains.
Results
The results section will provide an overview of included reviews, discuss findings in relation to outcome domains being used, identify recommended measures and lastly highlight findings in relation to children with care experience and children with developmental disabilities.
Overview of Included Reviews
An overview of the 41 included reviews can be seen in the tables below and are presented in accordance to the four typologies (condition-specific measures, behaviour, general mental health, positive mental health). Tables include a description of the methods and aims of each included review, alongside a summary of key-findings and recommendations made by the authors (including use of measures with specific age ranges, populations or in specific settings). 21 reviews did recommend specific measures as part of their findings, while 20 reviews felt unable to provide a recommendation. Those reviews often noted that the choice of measure depends on specific research questions, aims, settings and groups of interest. Notably, most endorsed measures were recommended to be used in clinical or mental health specific settings, with none of the reviews exploring use of measures in community settings (such as schools). This seemed to be because authors felt that there was not enough evidence on the use of measures with diverse populations (Eklund et al., 2018). Additionally, few measures were identified that could be used in early childhood Tables 2, 3, 4, 5, and 6.
Dimensions of mental health
The included reviews were based on different concepts of mental health. These included (i) nine reviews of symptom and condition-specific measures (e.g. depression, anxiety, psychosis), which focused on the presence of symptoms, were often closely related to diagnostic criteria and used in clinical settings; (ii) nine reviews that focused on behaviour associated with poor mental health, including substance use, aggression, disruptive behaviour, self-harm and suicide; (iii) ten reviews that focused on general mental health measures, combining an assessment of multiple dimensions such as cognition, social and emotional development and functioning in different environments; and (iv) ten reviews that utilised positive mental health perspectives, assessing wellbeing, quality of life and resilience through concepts such as life satisfaction, participation, sense of belonging in combination with consideration of the impact of environmental factors such as relationships, or housing. Both general mental health measures and positive mental health measures included examples of one-dimensional measures, providing an overall score across domains, as well as multi-dimensional ones considering individual domains alongside each other. Three of the reviews had a wider scope and reviewed measures across typologies (Becker-Haimes et al., 2020; Krause et al., 2021; Newton et al., 2017). Examples of measures for each typology included the Revised Children's Anxiety & Depression Scale as a condition-specific measure for depression and anxiety; the Columbia Suicide Severity Rating Scale which evaluates severity of behaviour and ideation; the Paedtriatic Symptom Checklist, which involves an assessment of psychosocial problems, as well as overall functioning including school and peer relationships; and KIDSCREEN as an example of a measure of wellbeing that includes questions about physical and psychological wellbeing, mood and emotions, autonomy, home life, relationships, social support and school.
Interestingly, we had initially thought that measures of wellbeing and quality of life would reflect a more positive perspective to mental health. However, during the review and data extraction process we became aware that authors were highlighting that some wellbeing and quality of life measures are often applied in studies that take a deficit-view to highlight limitations or difficulties (Davis et al., 2018; Mierau et al., 2020). Thus, wellbeing and quality of life measures were often found to be used within narratives that focus on psychopathology, rather than identifying what helps children and young people to be well (Losada-Puente et al., 2019).
Overview of recommended measures
Overall, 60 measures were recommended by 21 reviews. Interestingly, a number of reviews had the same areas of interest (e.g. measures of anxiety or risk of suicide) but came to different conclusions and recommendations. This appeared to be because of different foci in relation to the exact purpose of the measures or their use with different age-groups or populations and different priorities in the assessment of the measures. For example, some reviews had a stronger consideration of predictive values when making recommendations in relation to the identification of early risk (Harris et al., 2019), while others focused on the sensitivity of measures in relation to using them as screening tools or to capture change over time (Newton et al., 2017). Reviews assessing the use of measures in schools or clinical practice tended to include a stronger consideration of their utility and acceptability to children and young people (McConachie et al., 2015; Rosanbalm et al., 2016). Yet, it was still striking how little consistency there was across reviews on which measures to use. For example, we identified 15 different recommended measures in relation to the assessment of anxiety. Similarly, Bear et al. (2020) identified 15 different measures in their systematic review of outcome measures of anxiety and depression in young people.
To narrow down the list of recommended measures we looked at which measures were recommended by more than one review. Only nine measures were recommended more than once and these are presented in the table below (Table 7). An overview of the full 60 measures including information on recommended populations, settings and number of items, can be found in Appendix 1. The Revised Children's Anxiety & Depression Scale (RCADS, long and short version) was the most recommended measure with four reviews recommending it as a measure for anxiety and depression and it is also included in the set of measures recommended by ICHOM and the Wellcome Trust. Reviews recommended it for the age range of 6 to 18 years, within clinical and community settings. Strengths that were noted included its use in different cultural contexts, but Krause et al. (2021) noted that they did not find evidence of its sensitivity to change.
Two measures were recommended by three reviews. The Paediatric Symptom Checklist (PSC) was recommended as a general mental health measure, assessing internalising, externalising and general mental distress for the ages 4 to 16 years (Becker-Haimes et al., 2020; Zima et al., 2019; McCrae & Brown, 2017). The Screen for Child Anxiety Related Emotional Disorders (SCARED) was recommended as another assessment and outcome measure of anxiety, with reviews highlighting its strong psychometric properties (Becker-Haimes et al., 2020; Lecavalier et al., 2014;).
All other measures were recommended by two reviews. This included a further two anxiety measures. The Anxiety Disorders Interview Schedule (ADIS), was recommended specifically to be used with autistic children for the ages of 6 to 18 years, to detect treatment effect (no longer meeting diagnostic criteria) and for characterization of research participants. However, Lecavalier et al. (2014) noted that administration burden makes it unsuitable as a repeat measure.
The Spence Children’s Anxiety Scale (SCAS) was recommended to be used in community mental health settings, as well as with autistic children and young people for the ages 8 to 15 years (Becker-Haimes et al., 2020).
KIDSCREEN was recommended as a wellbeing and quality of life measure for young people between 8 and 18 years. Identified strengths included its sensitivity to change over time, as well as accessibility and ease of use in practice, having been developed with input from children, young people and their families (Davis et al., 2018; Krause et al., 2021). The Pediatric Quality of Life Inventory (PedsQL) was recommended as another quality of life measure for the ages 8–18 years. Limitations included its poor quality when used with younger children (Mierau et al., 2020), as well as its high cost (Davis et al., 2018).
The Strengths and Difficulties Questionnaire (SDQ) was recommended as a general mental health measure for the ages 3–16 years, with Becker-Haimes et al. (2020) emphasising evidence for its use as a routine measure of progress over time.
Lastly, the Columbia Suicide Severity Rating Scale (C-SSRS) was recommended to evaluate the severity of suicidal behaviour and ideation. Krause et al. (2021) noted that there had been no validation of the C-SSRS recent self-report measure to be used with children and young people, but that the clinician-rated C-SSRS had strong evidence of good internal consistency, inter-rater reliability, and sensitivity to change in adolescent samples.
Recommendations in Relation to Children and Young People with Care Experience and Those with Developmental Disabilities
We identified two reviews with a focus on children and young people with care experience in relation to general mental health measures and measures of wellbeing (McCrae & Brown, 2017; Rosanbalm et al., 2016) and three that focused on trauma related experiences in relation to general mental health (Atazadeh et al., 2019; Eklund et al., 2018) and resilience (Satapathy et al., 2020).
In relation to developmental disabilities, we included five reviews, which focused on symptom or condition-specific measures, which all related to autism and anxiety (Kreiser & White, 2014; Lecavalier et al., 2014; Wigham & McConachie, 2014; Tulbure et al., 2012; Grondhuis & Aman, 2012), three reviews focused on the assessment of specific behaviours, which included aggression and self-harm in autism (Hanratty et al., 2015; Howe et al., 2020; Matson & Cervantes, 2014), one discussed medication management and symptom changes in ADHD (Hall et al., 2016), three focused on quality of life and general mental health outcomes in relation to disabilities as a general concept (Davis et al., 2018; Janssens et al., 2016; Losada-Puente et al., 2019), and two focused on autism and quality of life (Ikeda et al., 2014; McConachie et al., 2015). This shows that while developmental disabilities include a very diverse group of children and young people, there appears to have been greater focus on autism over other disabilities.
Only one of the reviews that focused on children and young people with care experience made recommendations, which included the SDQ and the PSC, which were also recommended to be used with young people in mental health settings. Satapathy et al. (2020), in their review on resilience measures, further discussed that the Child and Youth Resilience Measure and Connor-Davidson Resilience Scale included small samples of children from welfare homes. Three anxiety measures (RCADS, SCARED, SCAS) were recommended for young people in the general population as well as autistic youth, with the ADIS being specifically recommended for autistic children and young people (Lecavalier et al., 2014; Groundhuis & Aman, 2012). Additionally, Lecavalier et al. (2014) highlighted that one study had evaluated the use of RCADS with autistic children (Hallett et al., 2013), which has since been repeated adding further support for its use with autistic children and young people (Sterling et al., 2015). The KIDSCREEN (long and short versions) was recommended for use with children and young people in clinical care, youth with disabilities and children with ADHD. The PedsQL was recommended to be used with young people in mental health services, as well as autistic youth, young people with ADHD and intellectual disabilities (Mierau et al., 2020).
All reviews on children and young people with care experience focused on general mental health or positive mental health measures. This reflected a view that holisitic assessments would help capture the complexity of experiences in this population. Additionally, in relation to oucome domains, all reviews on children and young people with care experience and some of the reviews that focused on children and young people with developmental disabilities highlighted the value of measures that included an assessment and questions on strengths alongside difficulties or deficits, as well as assessments that included a consideration of environmental factors alongside individual ones (Davis et al., 2018; McConachie et al., 2015; McCrae & Brown, 2017). Authors argued that, for both populations, environmental factors often contribute to and sustain poor mental health, and that it is important for researchers and practitioners to understand and capture poor mental health as a response to trauma and experiences of social exclusion or stigma (Davis et al., 2018; Ikeda et al., 2014; McConachie et al., 2015; McCrae & Brown, 2017). Similarly, two of the reviews which involved children and young people in the assessment process of measures with a focus on autism (McConachie et al., 2015) and developmental disabilities (Janssens et al., 2016) highlighted discrepancies between what was being measured and what children, young people or their families identified as important to them, as well as highlighting the importance of measures being accessible. This included a dominant focus on deficits and difficulties, overlooking the strengths and abilities of children and young people.
Discussion
Having identified over 60 recommended measures, only nine were recommended by more than one review which adds to the evidence for the lack of consensus on the use of mental health measures in research with children and young people. Across the included reviews the tension between having specific measures that are validated for use in particular settings, with specific age groups and populations, that can also address defined research aims and questions (such as measuring change over time, having predictive power) was evident. Reviews which focused on developmental disabilities emphasised that many measures were not designed with children and young people with disabilities in mind, which was also true in relation to children and young people with care experience and those who have experienced adversities (McCrae & Brown, 2017; Satapathy et al., 2020). Yet, authors argued that instead of developing new measures it can be more helpful to adapt and develop existing measures to build on existing knowledge. This allows researchers to make comparisons, while remaining aware of specific needs and circumstances, particularly as mental health tools can subsequently be validated for their use with children or young people with developmental disabilities (Biederman et al., 2005; Sterling et al., 2015) or those with care experience. Similarly, Krause et al. (2021) argue for the piloting of existing measures in new populations and contexts to adapt or exchange them in light of new evidence and knowledge.
Only two of the nine measures that were recommended by more than one review were recommended to be used with young children under 6 years of age (proxy versions). These measures (the PSC and SDQ) were also recommended to be used with children and young people in care. None of the nine measures that were recommended more than once were recommended for both children and young people with care experience and children and young people with developmental disabilities, neglecting the intersectionality of both (Gajwani & Minnis, 2023). A focus on autism over other developmental disabilities was noticeable and when considering intersections of care experience and developmental disabilities it will be important for future research to consider other conditions such as FASD and ADHD (Gajwani & Minnis, 2023).
Next to a lack of consensus of which measures to use, our review also identified a lack of consensus of how to assess or review existing measures and how to report psychometric properties. Reviews differed in their reporting of psychometric properties. For example, there were differences between reviews only reporting psychometric data from the original studies of the development of measures, while others synthesised information from subsequent independent studies. This made it difficult to include and compare information on psychometric data. There are existing guidelines on the reporting of psychometric properties, including frameworks by the American Psychological Association (Gehrig, 2019), and our findings point to a poor use of those frameworks. The importance of having consensus in how we assess outcome measures is furthermore highlighted by the COSMIN initiative, which provides guidelines and standards, and to which a number of the reviews in this study referred. Alongside reliability, validity and responsiveness, COSMIN advocates for a consideration of interpretability, which is also sometimes referred to as acceptability or utility. There was less consideration of utility and acceptability within our review of measures, compared to reliability, validity and responsiveness. Similarly, in their review of psychosocial interventions for maltreated children and young people Macdonald et al. (2016) found that researchers often fail to consider issues of accessibility and acceptability. While high quality and evidence-based research relies on reliable and valid outcome measures, researchers have started to pay attention to their acceptability as well. This reflects the importance that children and young people understand the questions and items asked and that they feel those reflect their experiences. Thus, alongside psychometric assessments researchers have started to involve service users and experts by experience to evaluate and adapt assessment and treatment processes (Krause et al., 2021; Macdonald et al., 2016). Equally, the reviews focusing on developmental disabilities also highlighted the importance of involving children and young people (Davis et al., 2018). Reviews with a focus on autism discussed the significance of adapting self-report items and questions to ensure measures are accessible and inclusive (Ikeda et al., 2014; McConachie et al., 2015). This will be similar in relation to children and young people with care experience. Questions around family life and relationships need to be able to capture the diverse experiences of children and young people who might have experienced multiple placement changes, family conflict and for whom the concept of ‘family’ might be ambiguous or sensitive. Research with children and young people in care has highlighted that other significant people such as teachers, sports coaches or friends can be their closest relationships (Frederick et al., 2023), and it might be important to be more inclusive in the assessment process, asking young people who they trust, and to identify who the key people in their life are. As McCrae and Brown (2017) suggest: “Perhaps more of an issue than choosing screening tools with valid scientific properties is ensuring that instruments meet the needs of children and families.” (p. 784). Involving children and young people with care experience in the process of adapting and assessing measures is an important next step (Smales et al., 2020). This will also help us to understand children and young people’s experiences of assessment processes and in how far they are able to help researchers and practitioners to understand their experiences and facilitate engagement (Bradford & Rickwood, 2012; Tsang et al., 2012).
Additionally, in relation to children and young people with care experience and their families it is important to understand that the process of conducting assessments is a relational one. Children might find it difficult to engage in overly restrictive processes and may mistrust professionals due to past experiences (MacCrae & Brown, 2017; Macdonald et al., 2016). Similarly, McConachie et al.’s (2015) work with autistic young people and professionals stressed how the use of measures that take a deficit view can impact negatively on the relationship and engagement between professionals who undertake a problem focused assessment with children and young people. Previous research has shown that clinical definitions of mental health can often be restrictive and not fully supported by the experiences of young people themselves or research (Macdonald et al., 2016; Zhang & Selwyn, 2019). This highlights the importance to not only think about which measures are used, but also if what is being measured matters to children and young people, how measures are used and how the assessment process impacts on children and young people.
Conclusion
It is hoped that this review adds to the ongoing consideration and development of approaches to more effectively and consistently measure the mental health outcomes of young people, including those that are care experienced and those that have developmental disabilities. Research designs which enable links across settings and countries will facilitate comparison, although there should be some caution about what is appropriate to compare. It should also be acknowledged that these are not all of the outcomes that may be important, but by seeking to use an internationally agreed set of mental health and well-being measures in research involving young people there is a greater likelihood of building a comprehensive understanding of the diversity and totality of needs, and how to meet these needs effectively. While a tension remains between having recommended outcome measures to enable consistency in the application of questions, items and scores, and ensuring that measures are sensitive to the contexts of different populations and settings, we agree with Krause et al. (2021) that it will be important to create greater consensus and to understand mental health measures as evolving tools that are co-owned and co-produced with those that should benefit from them, while upholding the value of reliability, validity and responsiveness.
Data Availability
Details of the search and review process can be obtained by contacting the corresponding author.
References
Addington, J., Stowkowy, J., & Weiser, M. (2015). Screening tools for clinical high risk for psychosis. Early Intervention in Psychiatry, 9, 345–356.
Ashra, H., Barnes, C., Stupple, E., & Maratos, F. A. (2021). A systematic review of self-report measures of negative self-referential emotions developed for non-clinical child and adolescent samples. Clinical Child and Family Psychology Review, 24, 224–243.
Atazadeh, N., Mahmoodi, H., & Shaghaghi, A. (2019). Posttrauma psychosocial effects in children: A systematic review of measurement scales. Journal of Child and Adolescent Psychiatric Nursing, 32, 149–161.
Banerjee, T., Khan, A., Ajmal, S., & Arora, R. (2021). 36 Overview of health needs of looked after children: observations from initial health assessment. BMJ Paediatrics Open, 5. https://doi.org/10.1136/bmjpo-2021-RCPCH.24
Bazalgette, L., Rahilly, T., & Trevelyan, G. (2015). Achieving emotional wellbeing for looked after children. London: National Society for the Prevention of Cruelty to Children (NSPCC). Available at: Achieving emotional wellbeing for looked after children: a whole system approach (fwsolutions.net).
Bear, H. A., Edbrooke-Childs, J., Norton, S., Krause, K. R., & Wolpert, M. (2020). Systematic review and meta-analysis: Outcomes of routine specialist mental health care for young people with depression and/or anxiety. Journal of the American Academy of Child & Adolescent Psychiatry, 59, 810–841.
Becker-Haimes, E. M., Tabachnick, A. R., Last, B. S., Stewart, R. E., Hasan-Granier, A., & Beidas, R. S. (2020). Evidence Base Update for Brief, Free, and Accessible Youth Mental Health Measures. Journal of Clinical Child and Adolescent Psychology, 49, 1–17.
Bennett, S. D., Coughtrey, A. E., Shafran, R., & Heyman, I. (2017). Measurement Issues: The measurement of obsessive compulsive disorder in children and young people in clinical practice. Child Adolesc Ment Health, 22, 100–112.
Bentley, N., Hartley, S., & Bucci, S. (2019). Systematic Review of Self-Report Measures of General Mental Health and Wellbeing in Adolescent Mental Health. Clinical Child and Family Psychology Review, 22, 225–252.
Berghs, M., Atkin, K., Graham, H., Hatton, C., & Thomas, C. (2021). Implications for public health research of models and theories of disability: a scoping study and evidence synthesis. UK: National Institute for Health Research. Available at: Implications for public health research of models and theories of disability: a scoping study and evidence synthesis (whiterose.ac.uk).
Biederman, J., Monuteaux, M., Kendrick, E., Klein, K., & Faraone, S. (2005). The CBCL as a screen for psychiatric comorbidity in paediatric patients with ADHD. Archives of Disease in Childhood, 90, 1010–1015.
Birmaher, B., Khetarpal, S., Brent, D., Cully, M., Balach, L., Kaufman, J., & Neer, S. M. (1997). The screen for child anxiety related emotional disorders (SCARED): Scale construction and psychometric characteristics. Journal of the American Academy of Child & Adolescent Psychiatry, 36(4), 545–553.
Bradford, S., & Rickwood, D. (2012). Psychosocial assessments for young people: A systematic review examining acceptability, disclosure and engagement, and predictive utility. Adolescent Health, Medicine and Therapeutics, 3, 111–125.
Carnevale, T. (2011). An integrative review of adolescent depression screening instruments: Applicability for use by school nurses. Journal of Child and Adolescent Psychiatric Nursing, 24, 51–57.
Carter, T., Walker, G. M., Aubeeluck, A., & Manning, J. C. (2019). Assessment tools of immediate risk of self-harm and suicide in children and young people: A scoping review. Journal of Child Health Care, 23, 178–199.
Crowe, L. M., Beauchamp, M. H., Catroppa, C., & Anderson, V. (2011). Social function assessment tools for children and adolescents: A systematic review from 1988 to 2010. Clinical Psychology Review, 31, 767–785.
Davis, E., Young, D., Gilson, K. M., Swift, E., Chan, J., Gibbs, L., Tonmukayakul, U., Reddihough, D., & Williams, K. (2018). A Rights-Based Approach for Service Providers to Measure the Quality of Life of Children with a Disability. Value Health, 21, 1419–1427.
Deighton, J., Croudace, T., Fonagy, P., Brown, J., Patalay, P., & Wolpert, M. (2014). Measuring mental health and wellbeing outcomes for children and adolescents to inform practice and policy: a review of child self-report measures. Child and Adolescent Mental Health, 8, 1–14.
Ebesutani, C., Bernstein, A., Nakamura, B. J., Chorpita, B. F., Weisz, J. R., & Research Network on Youth Mental Health*. (2010). A psychometric analysis of the revised child anxiety and depression scale—parent version in a clinical sample. Journal of Abnormal Child Psychology, 38, 249–260.
Ebesutani, C., Reise, S. P., Chorpita, B. F., Ale, C., Regan, J., Young, J., & Weisz, J. R. (2012). The revised child anxiety and depression scale-short version: Scale reduction via exploratory bifactor modeling of the broad anxiety factor. Psychological Assessment, 24(4), 833.
Eklund, K., Rossen, E., Koriakin, T., Chafouleas, S. M., & Resnick, C. (2018). A systematic review of trauma screening measures for children and adolescents. School Psychology Quarterly, 33, 30–43.
Emerson, E., & Hatton, C. (2007). Mental health of children and adolescents with intellectual disabilities in Britain. The British Journal of Psychiatry, 191(6), 493–499.
Erford, B. T., Bardhoshi, G., Haecker, P., Shingari, B., Schleichter, J., & Atalay, Z. (2018). Selecting Assessment Instruments for Problem Behavior Outcome Research With Youth. Measurement and Evaluation in Counseling and Development, 52, 52–68.
Farber, G., Wolpert, M., & Kemmer, D. (2020). Common measures for mental health science. Laying the foundation. National Institute of Mental Health, Wellcome Trust and International Alliance Mental Health Research Funders. Available at: CMB-and-CMA-July-2020-pdf.pdf (wellcome.org).
Fleming, M., McLay, J. S., Clark, D., King, A., Mackay, D. F., Minnis, H., & Pell, J. P. (2021). Educational and health outcomes of schoolchildren in local authority care in Scotland: A retrospective record linkage study. PLoS Medicine, 18(11), e1003832.
Frederick, J., Spratt, T., & Devaney, J. (2023). Supportive Relationships with Trusted Adults for Children and Young People Who Have Experienced Adversities: Implications for Social Work Service Provision. The British Journal of Social Work, bcad107.
Frith, E. (2016). CentreForum Commission on Children and Young People’s Mental Health: State of the Nation. CentreForum.
Gajwani, R., & Minnis, H. (2023). Double jeopardy: Implications of neurodevelopmental conditions and adverse childhood experiences for child health. European Child & Adolescent Psychiatry, 32(1), 1–4.
Gehrig, E. (2019). American psychological association guidelines on psychometric assessment validity and reliability requirements. Available at: APA_val_rel_requirements.pdf (ttisi.com).
Goodman, R. (1997). The strengths and difficulties questionnaire: A research note. Journal of Child Psychology and Psychiatry, 38(5), 581–586.
Grondhuis, S. N., & Aman, M. G. (2012). Assessment of anxiety in children and adolescents with autism spectrum disorders. Research in Autism Spectrum Disorders, 6, 1345–1365.
Hall, C. L., Valentine, A. Z., Groom, M. J., Walker, G. M., Sayal, K., Daley, D., & Hollis, C. (2016). The clinical utility of the continuous performance test and objective measures of activity for diagnosing and monitoring ADHD in children: A systematic review. European Child and Adolescent Psychiatry, 25, 677–699.
Halle, T. G., & Darling-Churchill, K. E. (2016). Review of measures of social and emotional development. Journal of Applied Developmental Psychology, 45, 8–18.
Hallett, V., Ronald, A., Colvert, E., Ames, C., Woodhouse, E., Lietz, S., & Happe, F. (2013). Exploring anxiety symptoms in a large-scale twin study of children with autism spectrum disorders, their co-twins and controls. Journal of Child Psychology and Psychiatry, 54(11), 1176–1185.
Hanratty, J., Livingstone, N., Robalino, S., Terwee, C. B., Glod, M., Oono, I. P., Rodgers, J., Macdonald, G., & McConachie, H. (2015). Systematic review of the measurement properties of tools used to measure behaviour problems in young children with autism. PLoS ONE, 10, e0144649.
Harris, I. M., Beese, S., & Moore, D. (2019). Predicting future self-harm or suicide in adolescents: A systematic review of risk assessment scales/tools. British Medical Journal Open, 9, e029311.
Hock, R., Priester, M. A., Iachini, A. L., Browne, T., Dehart, D., & Clone, S. (2015). A Review of family engagement measures for adolescent substance use services. Journal of Child and Family Studies, 24, 3700–3710.
Honey, A., Emerson, E., & Llewellyn, G. (2011). The mental health of young people with disabilities: impact of social conditions. Social Psychiatry and Psychiatric Epidemiology, 46, 1–10.
Howe, S. J., Hewitt, K., Baraskewich, J., Cassidy, S., & McMorris, C. A. (2020). Suicidality among children and youth with and without autism spectrum disorder: a systematic review of existing risk assessment tools. Journal of Autism and Developmental Disorders, 50, 3462–3476.
Ikeda, E., Hinckson, E., & Krageloh, C. (2014). Assessment of quality of life in children and youth with autism spectrum disorder: A critical review. Quality of Life Research, 23, 1069–1085.
Janssens, A., Rogers, M., Gumm, R., Jenkinson, C., Tennant, A., Logan, S., & Morris, C. (2016). Measurement properties of multidimensional patient-reported outcome measures in neurodisability: A systematic review of evaluation studies. Developmental Medicine and Child Neurology, 58, 437–451.
Jellinek, M. S., Murphy, J. M., Robinson, J., Feins, A., Lamb, S., & Fenton, T. (1988). Pediatric symptom checklist: Screening school-age children for psychosocial dysfunction. The Journal of Pediatrics, 112(2), 201–209.
Kessler, R. C., Amminger, G. P., Aguilar-Gaxiola, S., Alonso, J., Lee, S., & Ustun, T. B. (2007). Age of onset of mental disorders: A review of recent literature. Current Opinion in Psychiatry, 20, 359.
Krause, K. R., Chung, S., Adewuya, A. O., Albano, A. M., Babins-Wagner, R., Birkinshaw, L., Brann, P., Creswell, C., Delaney, K., Falissard, B., Forrest, C. B., Hudson, J. L., Ishikawa, S.-I., Khatwani, M., Kieling, C., Krause, J., Malik, K., Martínez, V., Mughal, F., … & Wolpert, M. (2021). International consensus on a standard set of outcome measures for child and youth anxiety, depression, obsessive-compulsive disorder, and post-traumatic stress disorder. The Lancet Psychiatry, 8, 76–86.
Krause, K. R., Jacob, J., Szatmari, P., & Hayes, D. (2022). Readability of commonly used quality of life outcome measures for youth self-report. International Journal of Environmental Research and Public Health, 19(15), 9555.
Kreiser, N. L., & White, S. W. (2014). Assessment of social anxiety in children and adolescents with autism spectrum disorder. Clinical Psychology: Science and Practice, 21(1), 18–31.
Kwan, B., & Rickwood, D. J. (2015). A systematic review of mental health outcome measures for young people aged 12 to 25 years. BMC Psychiatry, 15, 279.
Lecavalier, L., Wood, J. J., Halladay, A. K., Jones, N. E., Aman, M. G., Cook, E. H., Handen, B. L., King, B. H., Pearson, D. A., Hallett, V., Sullivan, K. A., Grondhuis, S., Bishop, S. L., Horrigan, J. P., Dawson, G., & Scahill, L. (2014). Measuring anxiety as a treatment endpoint in youth with autism spectrum disorder. Journal of Autism and Developmental Disorders, 44, 1128–1143.
Losada-Puente, L., Araújo, A. M., & Muñoz-Cantero, J. M. (2019). A systematic review of the assessment of quality of life in adolescents. Social Indicators Research, 147, 1039–1057.
Luke, N., Sinclair, I., Woolgar, M., & Sebba, J. (2014). What works in preventing and treating poor mental health in looked after children. NSPCC.
Macdonald, G., Livingstone, N., Hanratty, J., McCartan, C., Cotmor, E. R., Cary, M., Glaser, D., Byford, S., Welton, N. J., & Bosqui, T. (2016). The effectiveness, acceptability and cost-effectiveness of psychosocial interventions for maltreated children and adolescents: an evidence synthesis. Health Technology Assessment (Winchester, England), 20, 1–163.
Matson, J. L., & Cervantes, P. E. (2014). Assessing aggression in persons with autism spectrum disorders: An overview. Research in Developmental Disabilities, 35, 3269–3275.
McConachie, H., Parr, J. R., Glod, M., Hanratty, J., Livingstone, N., Oono, I. P., Robalino, S., Baird, G., Beresford, B., Charman, T., Garland, D., Green, J., Gringras, P., Jones, G., Law, J., le Couteur, A. S., Macdonald, G., McColl, E. M., Morris, C., … Williams, K. (2015). Systematic review of tools to measure outcomes for young children with autism spectrum disorder. Health Technology Assessment, 19, 1–506.
McCrae, J. S., & Brown, S. M. (2017). Systematic Review of Social-Emotional Screening Instruments for Young Children in Child Welfare. Research on Social Work Practice, 28, 767–788.
McKenna, S., O’Reilly, D., & Maguire, A. (2023). The mental health of all children in contact with social services: A population-wide record-linkage study in Northern Ireland. Epidemiology and Psychiatric Sciences, 32, e35.
Mierau, J. O., Kann-Weedage, D., Hoekstra, P. J., Spiegelaar, L., Jansen, D., Vermeulen, K. M., Reijneveld, S. A., van den Hoofdakker, B. J., Buskens, E., van den Akker-van Mrle, M. E., Dirksen, C. D., & Groenman, A. P. (2020). Assessing quality of life in psychosocial and mental health disorders in children: A comprehensive overview and appraisal of generic health related quality of life measures. BMC Pediatrics, 20, 329.
Newton, A. S., Soleimani, A., Kirkland, S. W., & Gokiert, R. J. (2017). A systematic review of instruments to identify mental health and substance use problems among children in the emergency department. Academic Emergency Medicine, 24, 552–568.
NICE. (2015). Guideline on Looked After Children and Young People. National Institute for Health and Care Excellence.
NICE. (2021). Guideline on looked after children and young people. London: National Institute for Health and Care Excellence.
Pilowsky, D. J., & Wu, L. T. (2013). Screening instruments for substance use and brief interventions targeting adolescents in primary care: A literature review. Addictive Behaviors, 38, 2146–2153.
Posner, K., Brown, G. K., Stanley, B., Brent, D. A., Yershova, K. V., Oquendo, M. A., Currier, G. W., Melvin, G. A., Greenhill, L., Shen, S., & Mann, J. J. (2011). The Columbia–suicide severity rating scale: Initial validity and internal consistency findings from three multisite studies with adolescents and adults. American Journal of Psychiatry, 168(12), 1266–1277.
Rosanbalm, K. D., Snyder, E. H., Lawrence, C. N., Coleman, K., Frey, J. J., van den Ende, J. B., & Dodge, K. A. (2016). Child wellbeing assessment in child welfare: A review of four measures. Children and Youth Services Review, 68, 1–16.
Rose, T., Joe, S., Williams, A., Harris, R., Betz, G., & Stewart-Brown, S. (2017). Measuring mental wellbeing among adolescents: a systematic review of instruments. Journal of Child and Family Studies, 26, 2349–2362.
Santor, D. A., Gregus, M., & Welch, A. (2006). Eight decades of measurement in depression measurement. Measurement, 4, 135–155.
Satapathy, S., Dang, S., Sagar, R., & Dwivedi, S. N. (2020). Resilience in Children and Adolescents Survived Psychologically Traumatic Life Events: A Critical Review of Application of Resilience Assessment Tools for Clinical Referral and Intervention. Trauma Violence Abuse, 1524838020939126.
Sayal, K., Prasad, V., Daley, D., Ford, T., & Coghill, D. (2018). ADHD in children and young people: prevalence, care pathways, and service provision. The Lancet Psychiatry, 5(2), 175–186.
Silverman, W. K., & Albano, A. M. (1996). Anxiety disorders interview schedule for DSM-IV: Child version. Oxford University Press.
Slade, M. (2002). What outcomes to measure in routine mental health services, and how to assess them: A systematic review. Australian & New Zealand Journal of Psychiatry, 36(6), 743–753.
Smales, M., Savaglio, M., Webster, S., Skouteris, H., Pizzirani, B., O’Donnell, R., & Green, R. (2020). Are the voices of young people living in out-of-home care represented in research examining their health?: A systematic review of the literature. Children and Youth Services Review, 113, 104966.
Smith, V., Devane, D., Begley, C. M., & Clarke, M. (2011). Methodology in conducting a systematic review of systematic reviews of healthcare interventions. BMC Medical Research Methodology, 11(1), 1–6.
Spence, S. H. (1997). Spence children′ s anxiety scale. Journal of Anxiety Disorders.
Sterling, L., Renno, P., Storch, E. A., Ehrenreich-May, J., Lewin, A. B., Arnold, E., Lin, E., & Wood, J. (2015). Validity of the revised children’s anxiety and depression scale for youth with autism spectrum disorders. Autism, 19, 113–117.
Sterzing, P. R., Shattuck, P. T., Narendorf, S. C., Wagner, M., & Cooper, B. P. (2012). Bullying involvement and autism spectrum disorders: Prevalence and correlates of bullying involvement among adolescents with an autism spectrum disorder. Archives of Pediatrics & Adolescent Medicine, 166(11), 1058–1064.
Tsang, K. L., Wong, P. Y., & Lo, S. K. (2012). Assessing psychosocial well-being of adolescents: A systematic review of measuring instruments. Child: Care, Health and Development, 38, 629–646.
Tulbure, B. T., Szentagotai, A., Dobrean, A., & David, D. (2012). Evidence based clinical assessment of child and adolescent social phobia: A critical review of rating scales. Child Psychiatry and Human Development, 43, 795–820.
Varni, J. W., Seid, M., & Rode, C. A. (1999). The PedsQL™: measurement model for the pediatric quality of life inventory. Medical care, 37, 126–139.
Wigham, S., & McConachie, H. (2014). Systematic review of the properties of tools used to measure outcomes in anxiety intervention studies for children with autism spectrum disorders. PLoS ONE, 9, e85268.
Zhang, M. F., & Selwyn, J. (2019). The subjective well-being of children and young people in out of home care: Psychometric analyses of the “Your Life, your Care” survey. Child Indicators Research, 1, 24.
Zima, B. T., Marti, F. A., Lee, C. E., & Pourat, N. (2019). Selection of a child clinical outcome measure for statewide use in publicly funded outpatient mental health programs. Psychiatric Services (washington, D. C.), 70, 381–388.
Funding
The review was funded as part of a larger project by the Medical Research Council.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of Interest
The authors declare no conflicts of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Jacobs, P., Power, L., Davidson, G. et al. A Scoping Review of Mental Health and Wellbeing Outcome Measures for Children and Young People: Implications for Children in Out-of-home Care. Journ Child Adol Trauma 17, 159–185 (2024). https://doi.org/10.1007/s40653-023-00566-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40653-023-00566-6