Background

Periodontitis is a progressive inflammatory disease and continues to be a major etiology for tooth loss [1]. According to the data from the National Health and Nutrition Examination Survey (NHANES) 2009–12, the prevalence of periodontitis is 46%, with nearly 9% of severe cases of periodontitis in the US population aged ≥ 30 years [2]. Providing comprehensive periodontal treatment in a timely manner is important to save natural teeth.

Diagnosis is the first important decision made by a practitioner for the patient and guides treatment planning. Moreover, diagnosis reflects the practitioner’s knowledge, clinical insight, and problem-solving skills and it depends on two abilities: skill in detection during the examination and knowledge of the definition and criteria applied for identification of a disease or condition [3]. Dental practitioners have noticed that there are obvious differences in the presentation of periodontal diseases among patients and have attempted to classify periodontal diseases [4]. Classification systems have been developed to aid diagnosis and treatment decisions [5], yet even with these systems in place, considerable disagreements in diagnosis and classification have been reported among dental practitioners when using the 1999 classification of periodontal disease [6, 7]. These inconsistencies could cause over- or underestimation of the severity of periodontal disease and may lead to inappropriate treatment selection for patients.

The 1999 classification of periodontal disease has been used during the past 17 years. Nevertheless, the 1999 system has several important weaknesses, including substantial overlap and a lack of clear pathobiology-based distinction between the specified categories, diagnostic indistinctness, and difficulties in implementation [8]. While general practitioners (GPs) are patients’ initial contact for seeking periodontal treatments and a primary source for referrals to periodontists, a study found that only 62% of GPs felt confident in diagnosing aggressive periodontitis [9].

The 2018 classification was published to update the 1999 classification with scientific evidence. The diseases previously recognized as “chronic” or “aggressive” are now grouped under a single category, “periodontitis” and are further characterized based on a multidimensional staging and grading system [8, 10]. Staging is designed to categorize the severity and extent of periodontitis and is determined based on the levels of clinical attachment loss (CAL) and the percentage of radiographic bone loss (RBL) around teeth. Grading is intended to indicate the rate of disease progression. The current classification recommends that clinicians should initially assign grade B and look for specific evidence to move to grade A or C. Risk factors for periodontal disease, such as diabetes and smoking, are introduced as a grade modifier [8].

To the best of our knowledge to date, no study has investigated variations in periodontal diagnosis and classification among dental practitioners with different educational backgrounds using the current classification of periodontal disease. Ability to distinguish periodontitis Stage I and II from Stage III and IV is important for dental practitioners without periodontal educational backgrounds to recognize severity and complexity of periodontitis cases so that they can treat the mild periodontitis cases themselves and refer the severe cases to specialists.

The purpose of this multicenter study was to examine variations in periodontitis classification among dental practitioners with different postgraduate educational backgrounds at the University of Maryland School of Dentistry (UMSOD) and the Loma Linda University School of Dentistry (LLUSD) using the current classification. The findings can provide feedback on the newly developed current classification system and critical implications for dental educators in training both dental practitioners with and without a periodontal background. The following research questions (RQs) were set to make such contributions.

RQ1: Are there any significant differences in periodontitis classification between the groups of dental practitioners with or without a periodontal background when using the current classification system?

RQ2: Are there any significant differences in the accuracy of periodontitis classification between the two groups of dental practitioners in comparison with the correct stage and grade that the expert panel agreed on each case?

RQ3: Does familiarity with the current classification system influence the accuracy of the classification of each case?

Methods

Ethical approval

This study was conducted under a protocol approved by the institutional review board at the University of Maryland Baltimore (HP-00085364) and at Loma Linda University (5190255).

Study design and data collection

A survey instrument in this study was developed by the principal investigator (PI), Se-Lim Oh (SO), and Yoon Jeong Kim (YK). Three periodontitis cases (Cases 1, 2, and 3) were selected by SO and YK from the patient database in the UMSOD and the LLUSD, which were considered to represent three different stages of periodontitis. The three cases were “real-life” cases without any alterations or modifications in the patients’ data. Each case presentation contained a brief medical and dental history, a full-mouth periodontal chart, intraoral clinical photographs, and intraoral complete radiographs without patient personal identifiers. An expert panel, comprised with the PI (SO) and two board-certified periodontists, reviewed the three cases in this study and drew a consensus agreement on Stage and Grade for each case.

The survey instrument in this study contained these three periodontitis cases, the guideline of the current periodontitis classification, and a questionnaire including closed and open-ended questions. The guideline for the new classification used in this study was taken from Tonetti et al. [11]. The three cases are available as the Additional file 1: Supplementary Material. Table 1 shows the questionnaire used in this study and the expected answers for each case.

Table 1 Questionnaire used in this study and expected answers for each case

This cross-sectional observational study included two cohorts: dental practitioners with periodontal backgrounds and dental practitioners with other educational backgrounds. Faculty and postgraduate (PG) dental residents in periodontics at the UMSOD and the LLUSD and all PG dental residents in other dental disciplines at the UMSOD participated in this survey. PG dental residents had graduated from dental schools and entered additional specialty training programs. The participants were asked to review each case with the provided guideline and fill out the questionnaire independently with no time restrictions. No personal identifiable information of the participant was associated with responses, and only codes for subgroups were used. Data collection was conducted from July 2019 to January 2020.

Study sample

Table 2 summarizes the participants from each dental school. A total of 64 participants were included. In the periodontal cohort (n1 = 31), faculty and PG residents from the two schools were recruited, while PG residents only from the UMSOD were recruited in the nonperiodontal cohort (n2 = 33).

Table 2 Summary of the participants (the number of participants)

The priori power analysis revealed that the current sample size allows us to detect the medium to large effect size of 0.3 to 0.4 with an adequate power of 0.8 to 0.85 when either chi-square or Fisher’s exact test is used.

Statistical analysis

After examining descriptive statistics, Fleiss’ kappa and Kendall’s coefficient of concordance (W) were first calculated to gauge the level of agreement and concordance over eight items among the overall participants as well as within each cohort. To examine the differences in the responses between the two cohorts of interest with respect to each item, Fisher’s exact test was conducted [12, 13].

To examine the differences in the accuracy of the responses between the two cohorts, the responses were dichotomously scored. The “correct” diagnosis and classification were determined by the expert panel members whose agreement reached 100%. Then, the scored responses were compared between the two different cohorts using Fisher’s exact tests with respect to each item.

To examine the relation between the level of familiarity with the current classification and the accuracy of the classification, polychoric correlations were calculated to address the nature of categorical responses that are ordinal. All analyses were conducted using R with the packages, polychor, psych, and irr [14,15,16]. A p value ≤ 0.05 was considered significant.

Results

No responders answered the extent with the stage for each case. Therefore, the extent was not included in the analysis.

Table 3 exhibits the number of participants and the distribution for levels of the acquaintance with the utilization of the current classification in each cohort. Seventy-four percent of the periodontal cohort exclusively used the current classification, while nearly 79% of the nonperiodontal cohort answered as either not aware of or not using the current classification. As such, the distributions of responses appear significantly different between the two groups (Fisher's exact test; p < 0.001).

Table 3 Distribution for levels of the acquaintance and the utilization of the 2018 classification among the participants (Fisher's exact test p < 0.001)

RQ1. Figure 1 shows the distributions of item responses on staging and grading with respect to each case among the participants. Fisher’s exact test for each item revealed that the distribution of item responses was significantly different between the periodontal and nonperiodontal cohorts with respect to only one item, the responses on grading for Case 1 (Fisher’s exact test; p = 0.01).

Fig. 1
figure 1

Mosaic plot of the staging and grading for each case among the participants (total sample = 64; participants with a periodontal background = 31; participates with a nonperiodontal background = 33). Note: The p-value for Fisher’s exact test is reported on top of each panel. The height of each box represents the proportion of each response category within a group, and the numbers in boxes represent the corresponding counts. The correct response categories are shown in gray. N/A refers to “not answered”

Across a total of six staging and grading items in three cases, the overall sample of 64 participants exhibited fair (0.21 < κ < 0.40) to moderate (0.41 < κ < 0.60) levels of agreement (\(\widehat{\kappa }\) = 0.34, z = 61.1, p < 0.001) and concordance (W = 0.61, df = 5, \({\chi }^{2}\)=195, p < 0.001). Among the periodontal cohort, the agreement and concordance levels were higher (\(\widehat{\kappa }\) = 0.41 z = 37.2, p < 0.001; W = 0.68, df = 5, \({\chi }^{2}\)=105, p < 0.001) than the corresponding coefficients calculated from the nonperiodontal cohort (\(\widehat{\kappa }\)=0.28, z = 25.6, p < 0.001; W = 0.57, df = 5, \({\chi }^{2}\)=93.6, p < 0.001). In particular, the difference in \(\widehat{\kappa }\) estimates between the two cohorts was statistically significant at the p = 0.05 level, indicating that a significantly higher level of agreement was observed within the periodontal cohort.

RQ2. The correct classification rates for each case are reported in Table 4 along with the correct identification of risk factors for periodontal disease. The scored data analysis revealed that fewer than 50% of the total participants were successful on three out of eight items: risk factor identification for Case 1, risk factor identification for Case 2, and grade for Case 3. Generally, there were no significant differences in accuracy between the two cohorts except for two items; grading in Case 1 and staging in Case 3. For these items, the performance of the periodontal cohort was significantly better than that of the nonperiodontal cohort (Fisher’s exact test; p = 0.03 for grading in Case 1 and p = 0.04 for staging in Case 3). There were no significant differences in the recognition of risk factors for each case among the two cohorts (Fisher’s exact test; p = 1.00 for Case 1, p = 0.22 for Case 2).

Table 4 Accuracy in the periodontitis classification and recognition of risk factors for periodontal disease among the participants (%)

RQ3. The polychoric correlations between familiarity with the current classification and each scored item are presented in Table 5. Two statistically significant correlations were found between familiarity with the current classification and staging in Case 3 (\(\widehat{\rho }\) = 0.52, SE = 0.16, z = 3.25) and risk factor identification in Case 2 (\(\widehat{\rho }\) = 0.32, SE = 0.18 z = 2.25). Given that the level of familiarity with the current classification was confounded by the periodontal background, we also calculated the same correlations within each group. Once the periodontal background was controlled for, some of the correlations were negative for the nonperiodontal cohort, in which none of the participants were using the current classification in their practice. This indicates who were not aware of or not using the guideline of the current classification performed better in periodontitis classification when the guideline was provided.

Table 5 Correlation between familiarity with the 2018 classification and participant performance on each item (Polychoric correlation coefficient)

Discussion

Although researchers have made efforts to develop new technologies to improve diagnostic ability [17,18,19,20], periodontal diagnosis and classification are still formulated based on clinical and radiographic data collected by individual practitioners. A practitioners’ ability to interpret and integrate the data obtained and critical thinking skills for clinical reasoning yields meaningful periodontal decisions [21]. The purpose of this study was to examine the variations in periodontitis classification among dental practitioners with different postgraduate educational backgrounds using the current classification.

We found that there was a fair level of agreement among all participants and the agreement level was higher among the periodontal cohort than the nonperiodontal cohort when the current classification was employed (Fig. 1). Although fair to moderate agreement was obtained, the accuracy was not at the satisfactory level, ranging from 31 to 83% at the most (Table 4). Even for the most straightforward case from the investigators’ point of view (Case 3), the grading accuracy was 36%. For only two items out of eight, the periodontal cohort demonstrated significantly better accuracy in periodontitis classification (grading for Case 1 and staging for Case 3).

Grading, especially for new patients, could be challenging because dental practitioners often do not have previous periodontal records, such as CAL or RBL. Calculating the amount of CAL or RBL over 5 years, which was suggested as direct evidence in the current classification [8], is difficult. Instead, dental practitioners often use the %bone loss/age index as indirect evidence. The goal of incorporating grading is to estimate the future risk of periodontitis progression and responsiveness to standard therapeutic principles to guide the intensity of therapy and monitoring [11]. Grading is also designed for estimating the potential impact of systemic health on periodontitis to promote comanagement of patient health with medical teams [11]. Having an accurate grade influences the management of the case, including the treatment goal, strategy, treatment modalities and/or sequence.

Identification of risk factors for periodontal disease is also difficult regardless of the educational background, as indicated by the low level of accuracy and the lack of a significant difference in the recognition of risk factors for the two cases (Cases 1 and 2) among the two cohorts. Risk factors, when present in an individual, increase the chance of developing the disease by modifying host responses to the etiology, bacterial plaque, in periodontal disease [22]. Although the guideline stated diabetes and smoking as a risk factor, many responders answered hypertension as a risk factor for periodontitis. This may indicate that the concept of risk factors for periodontal disease is not well understood among many participating dental practitioners in this study. Since risk factors play a role as grade modifiers in the grading system, emphasis on risk factors for periodontal disease in dental education is recommended.

Since the current classification was published in 2018, many dental practitioners with other education backgrounds were not familiar with the classification. Interestingly, when the periodontal background was controlled for, some of the correlations between familiarity and accuracy for staging and grading were even negative for the nonperiodontal cohort (Table 5). This implies that dental practitioners who were not aware of or were not using the current classification performed well in periodontitis classification when the guidelines were provided. It is noteworthy that this result is only generalizable for the population of dental practitioners who have nonperiodontal backgrounds and have never used the current classification. Among this population, the familiarity level is not a key to classify periodontitis cases.

A recent publication emphasized that Stage is a patient-based, not a tooth-based concept [23]. The authors acknowledged that there is a gray zone for the clinicians to use clinical judgement for certain patient cases. Therefore, obtaining all necessary information including patient’s medical history, radiographs, and a full mouth periodontal charting is important [23]. The critical information for clinicians to determine staging and grading for patients is CAL, etiologies for CAL, % RBL, and patient’ age, which dental practitioners should be able to interpret.

The generalizability of our study results is limited due to a few factors. The sample size was relatively small. Only PG dental residents from the UMSOD were included in the nonperiodontal cohort while the periodontal cohort was comprised from both universities. All dental education for undergraduate and postgraduate training in the US should follow the Commission on Dental Accreditation guideline. Therefore, the training for specialists in both universities is similar although they are not the exact same; PG periodontics programs in both universities use the current periodontitis classification. The PG dental residents from the UMSOD in the nonperiodontal cohort was included because the PI confirmed that they did not receive the formal education on the current periodontitis classification, while PG dental residents in some programs from the LLUSD have been implementing the current classification in their training. The number of cases and items for each case were small in the questionnaire. While the three patient cases were meant to represent different scenarios with the investigators’ intention, more cases and items related to each case are necessary to cover contents related to periodontal diagnosis and classification such as clinical and radiographic data assessments, local contributing factors for periodontal disease, and occlusal evaluation.

Conclusion

A fair level of agreement in periodontitis classification was observed among dental practitioners with different educational backgrounds when the 2018 classification was used. The periodontal group showed better agreement levels and partially better accuracy. Identification of risk factors for periodontal disease was difficult regardless of the educational background.