Introduction

Pediatric type 2 diabetes mellitus (T2DM) is a relatively new disease that is escalating due to the global rise in childhood obesity [1]. While close to 420 million adults have T2DM around the world, an increasing number of children are being diagnosed with T2DM and are transitioning to adult care, and this trend is likely to increase exponentially [2]. In the USA, there has been a 30.5% increase in the prevalence of pediatric T2DM between 2001 and 2009 [3].

On a mechanistic level, many adults present with risk factors for T2DM in the early years of life, and while genetics likely play a role in this diabetes risk, environmental and epigenetic factors lead the way in the development of diabetes in youth and young adults [4].

Pediatric T2DM often occurs among those who have existing medical conditions, e.g., transplant patients, and is associated with the development of comorbidities early in the course of the disease. These comorbidities include dyslipidemia, hypertension, non-alcoholic fatty liver disease, nephropathy, and obstructive sleep apnea and will likely contribute to adverse long-term cardiovascular outcomes in this population [3, 5]. While pediatric T2DM has been reported most frequently among certain ethnic groups (e.g., Indigenous North American communities, South Asians, and Latinos), a global spread of the condition has been reported [2]. This highlights the need for guidance on management developed for various populations. The increasing burden of T2DM has major implications for the quality of life, longevity, and healthcare system use in these patients globally [6]. Due to the relative novelty of this disease in pediatric populations and the importance of early management to avoid future complications, it is imperative that high-quality clinical practice guidelines (CPGs) are available for clinicians. CPGs are statements consisting of clinical recommendations that aid healthcare providers in decision-making and also inform policy-related and system-level decisions [7]. Systematically developed and evidence-based CPGs support best practices and have the potential to optimize patient care. Guidance for clinicians on managing pediatric T2DM has emerged in the past decade, although its quality has not been evaluated. Assessing the quality of CPGs for pediatric T2DM management will help determine areas requiring improvement in future guidelines and updates, in addition to providing information about overall quality to clinicians who consult these guidelines.

The objectives of this systematic review are to (1) evaluate the quality of published CPGs for the management of pediatric T2DM using the Appraisal of Guidelines for Research and Evaluation II (AGREE II) tool and (2) assess the change in quality of guidelines within an organization over time when applicable.

Methods

This systematic review is registered with PROSPERO (registration number: CRD42016034187), and detailed methods are available in the published protocol [8]. This systematic review is reported according to the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines and a checklist is available as a supplementary material (Additional file 1) [9].

Eligibility criteria

We included CPGs from national and international diabetes and pediatric associations that were published as standalone guidelines for T2DM in children and adolescents (2–18 years of age). We also included guidelines targeting T2DM in adults and pediatric type 1 diabetes if a section addressing T2DM management in children and adolescents was available. In order to determine the changes in quality of reporting over time, we retrieved the two most recent guidelines from each organization when available. For this review, we excluded all CPGs that were developed exclusively for use within a single institution and those that were not published or were under development at the time of our appraisal. We excluded individual systematic reviews and primary studies that provided recommendations on the management of T2DM. Guidelines addressing drug-induced and genetic forms of diabetes were also excluded [8].

Search strategy

We searched MEDLINE, Embase, Cumulative Index to Nursing and Allied Health Literature (CINAHL), Turning research into practice (Trip) database, and the National Guideline Clearinghouse (guideline.gov) from inception through January 2016 to identify eligible CPGs. The database search was updated in August 2017. As we were aware of the Diabetes Canada guidelines launch in 2018, we updated the included guidelines accordingly. The complete search strategy used to retrieve guidelines is available as supplementary material (Additional file 2). Additionally, we extended our search to include guidelines that were not published in indexed journals. Four authors identified national and international diabetes and pediatric associations around the world and conducted a comprehensive search of their webpages for eligible guidelines. Earlier versions of CPGs were also obtained from the organization’s webpage, when available. We further extended our search by contacting the organizations in July 2017 to inquire about the two latest CPGs issued for pediatric T2DM, when it was unclear if they had earlier versions of guidelines from the webpage. The search was not limited by language, and we used Google Translate to appraise eligible guidelines that were not available in English. We judged the accuracy of the translation by back-translating the guidelines to the original language and assessed the publication for overall similarity. Our review required the use of Google Translate for guidelines in the following languages: Portuguese, Japanese, Italian, Greek, Latvian, Lithuanian, Dutch, Polish, Slovenian, and Thai. The overall accuracy of Google Translate has been examined by a 2011 study, finding acceptable scores (> 70 out of 100) for Portuguese, Italian, Greek, Dutch, and Slovenian [10]. However, the same study found lower scores (< 50 out of 100) for Japanese, Latvian, Lithuanian, and Polish and a poor score (0 out of 100) for Thai [10]. Another study on the use of Google Translate specifically during data extraction for systematic reviews examined agreement between reviewers, showing good agreement for Portuguese and fair agreement for Japanese, but did not examine other languages translated for our review [11]. Since we appraised the overall guideline based on broad criteria rather than extracting specific data points, we expected greater accuracy than when conducting data extraction for systematic reviews.

Study selection and data extraction

Trained independent reviewers conducted title and abstract screening and full-text review of eligible studies in duplicate. Disagreements between reviewers were resolved by discussion to consensus, and an expert Pediatric Endocrinologist was consulted to resolve persisting disagreements. Two authors independently extracted general study characteristics (title, issuing society/association, country, year of publication) and details of management recommendations for T2DM including diagnostic criteria, treatment recommendations, screening tests for comorbidities, and frequency of screening.

Quality appraisal

The AGREE II tool was used to evaluate the quality of CPGs included in this review [12]. The 23-item instrument evaluates guideline methodology and quality of reporting across six domains: (1) Scope and purpose, (2) Stakeholder involvement, (3) Rigour of development, (4) Clarity of presentation, (5) Applicability, and (6) Editorial independence. Each domain consists of a number of statements pertaining to the topic, and individual appraisers rank each statement on a 7-point Likert scale, varying from strongly disagree to strongly agree (1 to 7, respectively). Domain scores from individual appraisers are summed and then scaled to a percentage of the maximum possible score for that domain. The maximum and minimum possible scores are determined based on the number of statements pertaining to each domain and the number of appraisers, and the following formula is used to calculate the scaled domain score: \( \frac{\mathrm{Obtained}\ \mathrm{score}-\mathrm{Minimum}\ \mathrm{possible}\ \mathrm{score}}{\mathrm{Maximum}\ \mathrm{possible}\ \mathrm{score}-\mathrm{Minimum}\ \mathrm{possible}\ \mathrm{score}} \). Therefore, higher scores indicate higher quality. The domain scores are independent and are not aggregated into a single quality score. Additionally, the instrument requires appraisers to provide an overall judgment of guideline quality and whether the guideline can be recommended for use with possible responses of “yes,” “yes with modifications,” and “no.”

The AGREE II developers recommend at least two but preferably four appraisers in order to increase the reliability of the assessment [13]. In the current systematic review, four trained independent appraisers evaluated eligible CPGs using the AGREE II tool. When guidelines included a pediatric T2DM section but also addressed adult diabetes or pediatric type 1 diabetes, the overall guideline methods were used to assess stakeholder involvement, rigour of development, and editorial independence whereas the remaining domains were assessed specifically for the pediatric T2DM chapter.

Data synthesis

All statistical analyses were conducted using SPSS Statistics Version 23 (IBM SPSS Statistics for Macintosh, Version 23.0, Armonk, NY). We calculated agreement between reviewers for guideline eligibility using Cohen’s kappa statistic and inter-rater reliability using the intraclass correlation coefficient (ICC). We summarized general characteristics of CPGs using descriptive statistics.

For each included CPG, we calculated a quality score for each of the six AGREE II domains by summing up the appraisers’ scores for individual items and scaling the total as a percentage of the maximum possible score for that domain [12]. The overall scores of the included CPGs are presented for each AGREE II domain using summary statistics (mean, SD). The AGREE II domains were evaluated independently, such that the appraisal scores for the domains were not aggregated. We conducted paired t-tests to compare mean domain scores for the earlier and latest versions of guidelines, when available, to determine change in quality over time. The level of significance was set at alpha = 0.05.

Results

Following screening of 6207 citations, we reviewed 153 articles and included 21 unique CPGs published by national and international organizations [14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34]. Reasons for exclusion during full text review were non-CPG articles (i.e., systematic reviews, narrative reviews, non-guideline reports) and lack of a pediatric T2DM section within guideline documents. Figure 1 shows the flow diagram for study inclusion. We also identified earlier versions for 7 of 21 included guidelines [35,36,37,38,39,40,41]. The kappa for full-text review was 0.87 (95% confidence interval 0.77 to 0.96), indicating excellent inter-rater agreement [42].

Fig. 1
figure 1

Flow diagram for selection process of pediatric type 2 diabetes guidelines

Characteristics of included guidelines

Table 1 outlines the basic characteristics of the included guidelines, and Table 2 summarizes the screening and management recommendations from each guideline. The abstraction of diagnostic criteria of T2DM assumed that HbA1c is not used alone for the diagnosis of pediatric patients.

Table 1 Basic characteristics of included guidelines
Table 2 Screening, diagnostic criteria and management recommendations for pediatric type 2 diabetes

The guidelines reported on diabetes management in the context of an education program that provides the knowledge and skills needed to manage diabetes, manned by qualified personnel. Though recommendations varied slightly, most guidelines recommended screening every 2 years for T2DM by means of fasting or random glucose tests in the presence of multiple risk factors (i.e., obesity, first-degree relative with T2DM, high-risk ethnic group, signs or symptoms of insulin resistance). Management recommendations included lifestyle changes such as physical activity and limiting screen time, as well as metformin and insulin therapy if needed. Target HbA1c varied but generally was ≤ 7.0% (Table 2).

Most of the CPGs were developed or last updated after 2012. Only one of the included guidelines from the Latvian Diabetes Association was published earlier than 2010, before the development of the AGREE II tool [19]. Of the 21 guidelines, 10 (47.6%) originated from Europe. The majority of the CPGs were developed by diabetes societies or associations (81.0%), whereas the remaining CPGs were developed by government organizations. In terms of guideline scope, 5 of the 21 (23.8%) guidelines were specific to pediatric populations while the remainder of the included guidelines mainly addressed adult T2DM and contained a section for pediatric T2DM.

Guideline quality scores

The mean scores for individual domains are presented in Table 3. The ICC statistics for the domains varied from 0.85 to 0.95 indicating good to excellent inter-rater reliability for appraisals (Additional file 3). The lowest mean scores were in the “Editorial Independence” domain (mean 44%, SD 33.83) and the “Rigour of Development” domain (mean 46%, SD 29.73), indicating low quality overall for reported guideline development methodology. Scores for “Editorial Independence” were mainly due to lack of reporting and managing conflicts of interest, and less due to involvement of a funding body in guideline development. The domain with the most acceptable score was “Clarity of Presentation” (mean 72%, SD 18.97), as most guidelines distinctly highlighted recommendations for treatment. The mean score for “Scope and Purpose” was 69% (SD 20.22), “Stakeholder Involvement” was 58% (SD 26.41), and “Applicability” was 48% (SD 27.69).

Table 3 Quality of included guidelines on six domains using AGREE II tool

The CPG published by the American Academy of Pediatrics and the Scottish Intercollegiate Guidelines Network (SIGN) were the highest ranked for “Scope and Purpose.” The SIGN guideline also ranked highest quality for “Rigour of Development” and “Stakeholder Involvement.” The American Diabetes Association scored highest for “Applicability.” The American Academy of Pediatrics had the highest score for “Clarity of Presentation,” while the German Diabetes Association guidance had the highest score for “Editorial Independence.” Overall, the highest scoring guidelines across domains included the American Academy of Pediatrics, American Diabetes Association, Diabetes Canada (formerly known as the Canadian Diabetes Association), National Institute for Health and Care Excellence (NICE), SIGN, and both international organizations (International Diabetes Federation and International Society for Pediatric and Adolescent Diabetes).

Table 4 shows the comparison of mean scores in each domain for the two most recent versions of CPGs from the American Diabetes Association (2000 and 2016), Brazilian Diabetes Society (2014–2015 and 2015–2016), Diabetes Canada (2013 and 2018), Italian Diabetes Society (2007 and 2014), Singapore Ministry of Health (2006 and 2014), and Society for Endocrinology, Metabolism, and Diabetes of South Africa (2012 and 2017), and the International Society for Pediatric and Adolescent Diabetes (2009 and 2014). The comparison of mean domain scores indicated that more recent versions of guidelines had significantly higher scores than the earlier versions for “Scope and Purpose” (mean 54.43, SD 13.84 for earlier version; mean 71.14, SD 19.66 for latest version; p = 0.005) and “Stakeholder Involvement” (mean 43.00, SD 13.98 for earlier version; mean 65.57 SD 23.80 for latest version; p = 0.01). There was no significant difference in the mean scores for any other domains between the earlier and most recent versions of the CPGs (p values > 0.05). However, despite the lack of statistical significance, there was an increasing trend in mean scores, such that the more recently published versions of guidelines had higher scores for each domain in comparison to the earlier versions.

Table 4 Comparison of AGREE II mean domain scores to examine change in overall quality

Discussion

Main findings

Pediatric T2DM is a relatively new disease, and these children will live with this disease for decades longer that adults with T2DM. Therefore, clear evidence-based guidance on the management of diabetes and its comorbidities are paramount to mitigate long-term effects and maintaining good health outcomes. In this review, our goal was to evaluate the current guidelines for management of pediatric T2DM using the AGREE II tool. Findings from this review indicate that the overall quality of CPGs on the management of pediatric T2DM is moderate to low, and can be improved, given that the overall mean scores are below 50% for three of the six domains of the AGREE II tool.

Importantly, when comparing earlier versions of CPGs with the most recent versions, we found significant improvement in the “Scope and Purpose” and “Stakeholder Involvement” mean scores, but no significant change in the mean scores for the remaining domains. Although progress in defining overall purpose and objectives for guidelines and increased involvement of stakeholders indicate improvement over time, there remains a need for improvement in reporting of methodology, appropriate reporting and handling of competing interests, and description of implementation approaches for guideline recommendations. It is important to note, however, that there was an increasing trend in all of the domain scores, such that most recently published versions of the included CPGs had higher scores for each domain in comparison to the earlier versions. Results may not have shown statistical significance given the small sample size (n = 7) and especially within the domains that already had somewhat high scores in the earlier versions (i.e., Clarity of Presentation).

The lowest scoring domains were “Rigour of Development,” “Editorial Independence,” and “Applicability.” Low scores in these three domains are especially significant, considering that they directly evaluate the methodological quality, management of conflicts of interest, and details of CPG implementation and monitoring. If guidelines are not effectively developed and reported, then this can impact clinical practice and patient outcomes. Subsequently, if guidelines are developed with no guidance for monitoring or auditing implementation, similar issues can arise.

For “Rigour of Development,” it may be that guidelines are using limited evidence-based approaches in their development methods or are not fully reporting methodological details. The body of evidence for the management for pediatric T2DM is still developing due to its relatively novelty; however, guideline developers should still aim for an evidence-based approach by utilizing available indirect evidence and explicitly stating when clinical expertise was used in the absence of relevant evidence.

However, the concern with a number of CPGs in this area is the lack of reporting and transparency with regard to guideline development methodology. The RIGHT statement is a recently developed resource for the reporting of CPGs [43] that is endorsed by the Enhancing the QUAlity and Transparency Of health Research (EQUATOR) Network, an international initiative to improve quality of reporting in published research. With the application of the RIGHT statement in the CPG development process, improvements in reporting can lead to higher quality of the pediatric T2DM guidelines, especially in areas of “Rigour of Development” and “Editorial Independence.”

For the generally low “Applicability” domain score observed in the included guidelines, it may be that guideline developers intend for the implementation details such as organizational barriers and cost implications to be published separately from the main body of the guideline. However, plans to publish this information should be indicated in the main guideline document. Additionally, conducting implementation analyses with the involvement of health policy experts can aid in the translation of evidence into practice. For instance, involving policy-level work early in the development of the guideline can help create opportunities and collaborations to mobilize public health campaigns for prevention efforts around pediatric T2DM [44]. Early collaboration with public health experts, patients, and parent representatives, followed by involvement of these individuals on guideline panels can also improve “Stakeholder Involvement” and increase the quality of guidelines.

The majority of high-quality guidelines were developed by organizations in countries with potentially more resources and funding for research (e.g., UK, USA, Canada). Additionally, we did not identify any available guidelines for a number of countries. In areas where they may be a lack of funding and resources for guideline development, organizations can also utilize the ADAPTE guideline adaptation process for their institutions and countries, which involves updating and adapting existing high-quality guidelines to local settings rather than to undergo de novo development [45]. This may be useful for pediatric T2DM, since published CPGs from NICE, American Academy of Pediatrics, Diabetes Canada, and others are recommended for use after undergoing quality assessment (Table 3). Also for individual institutions and local organizations considering the development of CPGs for T2DM, it may be favorable to utilize the ADAPTE process to modify a guideline specific to their local context and population.

Strengths and limitations

This review has several strengths. We conducted a comprehensive search of diabetes and pediatric organizations globally in order to identify eligible CPGs for the review. Four trained assessors evaluated the quality of included guidelines, increasing the reliability of the appraisals. Additionally, we used the AGREE II tool, which has established reliability and validity, to conduct quality assessment [13, 46].

A limitation of this review was the use of Google Translate to assess guidelines that were not available in English. We utilized this method because it was unfeasible due to time and resources to identify four independent bilingual appraisers that were fluent for the included language(s) as well as English, could undergo AGREE II training, and conduct appraisal for the guidelines. Therefore, we used the back-translate approach, from the language of publication to English and back to the language of publication, to determine the overall similarity of the translation to the original document. When using the back-translation approach, we did not note any substantial differences in the text—however, this may be due to use of the same software (Google Translate) for original and back-translation. We note that Google Translate may not be the ideal method for all of our included languages, specifically for the Thai guideline; however, we aimed to maintain a broad scope for the review and determined that it was the best and most feasible method [11, 47]. Also in order to conduct a broad review, we included CPGs with broader scope that incorporated a section dedicated to pediatric T2DM. This may have limited the review if guidelines with different primary objectives (i.e., type 1 diabetes guidelines, guidelines for adult diabetes) received a lower AGREE II score when evaluated for the quality of recommendations pertaining to pediatric patients. However, we appraised all included CPGs for overall methodology and development irrespective of the pediatric recommendations.

Clinical and research implications

This review has major implications as it directly affects clinical practice. The rise in T2DM in children and youth is a serious public health concern, and more patients will be transitioned to adult care having been diagnosed with T2DM as children. These patients will deal with the burden of living with this chronic disease and its comorbidities longer than those diagnosed with T2DM during adulthood. It is important that rigourous methods are used to develop CPGs and reported with transparency for the knowledge end users. Given that pediatric T2DM has emerged only recently, it is especially imperative that clinicians have adequate guidance for its management.

The quality of CPGs may improve with the widespread implementation of reporting guidelines. This trend in improvement has been observed within other types of studies, such as randomized controlled trials [48]. Along with quality of reporting, direct improvements can be made to these guidelines by (1) involving relevant stakeholders, including patient representatives and experts throughout the guideline development process, (2) reporting the guideline development methodology in detail, and (3) discussing barriers and facilitators for implementation of the guideline recommendations. Given that management recommendations for pediatric T2DM involve behavioral changes and self-driven interventions as well as pharmacotherapy, it is especially important to include patients’ and parents’ perspectives to improve guideline development and implementation.

Conclusions

Overall, two thirds of the pediatric T2DM guidelines were rated moderate to low quality and the remaining third ranked higher in quality. There are areas requiring significant improvement such as rigour of development, editorial independence, applicability including tools and barriers for implementation, and stakeholder involvement. Deficiencies noted in guideline development methodology and quality of reporting can be improved by following published reporting statements. Also comparisons of the AGREE II scores between different CPGs suggest that some organizations and societies may have greater access to the substantial resources and time needed for de novo development of CPGs. It may be useful for other organizations to use a guideline adaptation process to achieve high-quality guidance for clinicians in a given setting.