Background

Previous studies have evaluated concordance between dentists’ pre- and post-treatment observations in clinical practice [1], between dentists’ questionnaire responses to hypothetical scenarios and responses collected during clinical treatments occurring subsequently [2, 3], between different hypothetical scenarios [4], and between dentists’ questionnaire responses and previously-published clinical standards of care [5, 6].

For example, one study calculated agreement between estimated depths of primary caries lesions, before and after operative treatment, for 8351 lesions in actual clinical practice. Unadjusted percentage agreement between dentists’ pre- and post-operative depth assessments was 69% overall, 88% for lesions that were into the inner one-third of dentin and 52% for lesions in the outer one-half of enamel [1].

Other studies have reported concordance based on questionnaires or hypothetical scenarios alone, with no direct observation of clinical practice [4,5,6,7]. Few published reports have presented concordance between treatment choices selected by dentists in hypothetical clinical scenarios and treatment choices made by the same dentists in actual clinical practice. Heaven, et al., evaluated the association between dentists’ decision to repair or to replace existing restorations in three hypothetical scenarios with treatment provided to patients in comparable clinical circumstances [3]. Percent agreement was not estimated, but dentists who recommended repair rather than replacement in more of the hypothetical scenarios did indeed have higher rates of repair in clinical practice. In a different study, pre-treatment estimates of lesion depths at which dentists placed initial restorations during a clinical study were compared with lesion depths at which they stated that they would place restorations in hypothetical scenarios [2]. The primary interest was in estimating the percentage of restorations for which the actual caries depths were less than the threshold depths indicated in the hypothetical scenarios.

The current analysis was conducted as a part of a National Dental Practice-Based Research Network study, the Management of Dentin Hypersensitivity (MDH) Study. The objective of the MDH study was to evaluate the methods used by dentists to diagnose and treat dentin hypersensitivity (DH) [8, 9]. The National Dental Practice-Based Research Network (the Network) is a consortium of dental practices and dental organizations focused on improving the scientific basis for clinical decision-making and moving evidence into everyday clinical practice. The structure and function of the Network have been published previously [10], and detailed information about the network is available online [11].

The objective of the current analysis was to evaluate concordance between stated preferences in treatment recommendations for DH, based on practitioners’ responses to a pre-study questionnaire, with observed recommendations made by the same practitioners during subsequent actual clinical treatment of their patients who were enrolled in the MDH study, which served as the parent study for this analysis. The purpose of evaluating concordance was to aid in assessing the value of practitioners’ responses to questions regarding their typical practice methods, as these might be more easily obtained than observations based on actual patient visits.

Methods

The MDH study

Any general dentist or specialist who was enrolled in the Network at the full participation level was eligible to enroll in the study. Study practitioners completed all necessary human participant and conflict-of-interest training as required by the network.

Detailed procedures and inclusion and exclusion criteria for the MDH study have been previously reported [8, 9]. Briefly, the study was reviewed and approved by the institutional review board of the home institution of the lead investigator (D.K.K.), as well as by those for each of the 6 administrative regions within the network. After enrolling in the study, practitioners completed an online questionnaire before enrolling patients. The results of the pre-study questionnaire have been previously published [8].

The practitioners recruited dental patients from their respective practices who reported having sensitive teeth and subsequently received a diagnosis of DH. Each patient provided informed consent to participate in the study. Patient inclusion criteria included that the patient was willing to be available for the duration of the study; was 19 years old or older; reported having a sensitive tooth or teeth with a diagnosis of DH (third molars were excluded), and that the patient granted permission for follow-up [8, 9]. The exclusion criteria for patients were having a medical condition that could interfere with reliable pain reporting, having a chronic pain condition, having odontogenic pain, or reported having taken analgesics more than 3 times in the past week.

At the baseline visit, the practitioners completed the patient’s oral examination, confirmed the diagnosis of DH, completed a dental history form, and recommended, prescribed, or applied treatment for DH. Patients completed pain assessment forms at the baseline visit and at 1-, 4-, and 8-weeks post-visit. Sample size considerations were based on precision of estimation of percentages, adjusted for the effect of clustered sampling due to enrollment of multiple patients per dental practice. On the basis of this analysis, the target sample size of the study was set at approximately 180 practitioners and 1900 patients.

The agreement study

The current study is a cross-sectional analysis of agreement between: (1) practitioners’ characterization of their typical treatments for DH as specified on the MDH study pre-study questionnaire; and (2) summary rates of treatment selections made during the baseline examination visit, calculated from each practitioner’s sample of patients enrolled in the MDH study.

Practitioner and practice data: the enrollment questionnaire

Practitioners completed a Network enrollment questionnaire that collected information about themselves and their practices. The questionnaire is publicly available online [12]. Selected practitioner and practice characteristics were used to explore potential associations with agreement between predicted (typical) and observed treatment recommendations.

Treatment recommendations, predicted: the practitioner questionnaire

The initial stage of practitioner participation in the MDH study comprised completing an online questionnaire regarding methods used to diagnose and manage or treat DH, as well as the practitioner’s judgment regarding factors that predispose a patient to DH. The network’s applicable Institutional Review Boards approved the study. All participants provided informed consent after receiving a full explanation of the procedures.

Development and testing of the questionnaire have been previously reported [8]. Briefly, the questionnaire was pilot tested using six dental practitioners who evaluated the length, acceptability and internet browser compatibility. Following revision based on this initial review, test-retest reliability of the questionnaire was evaluated for 24 additional practitioners [8].

Respondents were asked to select all procedures they routinely use to manage DH from the following list: (1) Fluoride products (includes gels, varnishes pastes and rinses), (2) Desensitizing OTC potassium nitrate toothpastes, (3) Glutaraldehyde/HEMA products, (4) Bonding agents, (5) Sealants, (6) Restorative treatments, (7) Lasers, (8) Oxalates, (9) No treatment, (10) Advice, and (11) Other. These are termed “treatment modalities” on the questionnaire. “No treatment” principally included dentists’ advice and counseling regarding modifying toothbrushing rigor and type of toothbrushes or toothpaste type, reduction in the ingestion of acidic foods and drinks or discontinuing the use of tooth-whitening products.

Following the selection of individual treatments, the practitioners were asked to rank their first, second and third choices of the items they had selected. Results of the questionnaire have been previously reported [8]; the questionnaire is publicly available online [13].

Treatment recommendations, observed: the patient examination form

On patient examination forms, practitioners selected “treatment recommended and/or prescribed” from a provided list and were instructed to check all applicable items. The list items were the same as those on the Questionnaire, except that there were separate items for five fluoride products. For analysis, these were collapsed into a single “fluoride products” category in order to correspond to the single questionnaire item for these products.

For each dentist, frequencies were tabulated for each treatment recommendation. Percentages of the dentists’ total recommendations were calculated for each recommendation. These percentages were ranked for each dentist to identify the most common and the top three recommendations. If two treatment recommendations were tied for most frequently used, then both of these were coded as the first choice for that dentist. These would be evaluated separately as agreement or disagreement with the questionnaire responses.

The patient examination form is publicly available online [14]. The practitioner questionnaire, baseline patient examination form and STROBE checklist for the Agreement Study are included as Additional files 1, 2 and 3.

Statistical analysis

Data analysis was conducted using SAS® Release 9.4 software.

Frequencies of each product/treatment recommendation were tabulated for the patients who were examined by each practitioner, and these were ranked to identify the practitioner’s first, second and third most frequently recommended treatment modalities. Dichotomous variables were coded to identify whether each item was the first-choice treatment recommendation and whether each item occurred among the top three most-used recommendations for each dentist.

Agreement was calculated between (1) dentists’ typical product/treatment recommendations (based on the pre-study questionnaire) with (2) the observed rankings based on these same dentists’ actual treatment recommendation (as recorded during their patient examinations). Agreement was evaluated separately for each listed product/recommendation, for the product/recommendation indicated as the dentist’s first-choice and for products/recommendations indicated as within the dentist’s top three recommendations. Unadjusted percentage of agreement was used to characterize concordance between survey (typical) and (actual) clinical usage. Cohen’s kappa statistic was used to evaluate agreement beyond that expected by chance. Point estimates and 95% confidence intervals were calculated for each measure.

Secondary analyses were conducted to evaluate practitioner and practice characteristics as potential predictors of agreement on the top three treatment recommendations. The outcome variables for these analyses were variables representing agreement between survey responses and clinical use for each treatment recommendation occurring among the three most common choices. The practitioner/practice characteristics that were evaluated as potential predictors were practitioner gender, race (categorized as white versus other due to small observed numbers of non-white respondents), age group (27–39, 40–49, 50–59, 60–73), full-time versus part-time, practice location (Inner city of urban area; Urban, not inner city; Suburban; Rural), and number of practice locations. Associations between practitioner/practice characteristics and agreement were evaluated using the chi-square statistic, separately for each treatment recommendation. No adjustment for multiple analyses was used.

Results

Practices/practitioners and patients

A total of 185 dentists from all six network regions completed the practitioner questionnaire. Of these, 171 completed patient clinical examination forms for a total of 1862 patients and were included in the current study. Characteristics of the practitioners and patients have been previously reported [8, 9]. Patient ages ranged from 19 to 87 (median = 45).

The median number of patients enrolled per practitioner was 12, with enrollment ranging from 1 to 16 patients. Six practitioners (3.5%) enrolled a single patient each, and 21 practitioners (12.3%) enrolled three or fewer. Sixty six practitioners (38.6%) enrolled 16 patients each.

Regional distribution of the practitioners was relatively uniform, with 16.4–17.0% of the total from each of the Midwest, Southwest, South Central and South Atlantic regions. 14.6% were from the Western region and 18.1% were from the Northeast region.

Agreement of survey responses with clinical recommendations

Percent agreement and Cohen’s kappa statistic, representing agreement beyond that expected by chance, are presented in Table 1 for first-choice treatment recommendations. Desensitizing OTC potassium nitrate toothpaste was the product that was most often selected by the dentists as their first choice recommendation, with 84 dentists (49.1%) choosing this item. Of these, 49 dentists (58.3%) also indicated this recommendation most often in the clinical examinations. Overall agreement on this recommendation was 63.7% (95% CI: 56.1, 70.9), with kappa = 0.27 (95% CI: 0.13, 0.42).

Table 1 Agreement on first-choice treatment recommendation for management of dentin hypersensitivity

Fluoride products were selected on the questionnaire by 62 dentists (36.3%) as their first choice. Among these, 33 (53.2%) indicated fluoride products most often in the clinical examinations. Overall agreement for the recommendation of fluoride products was 63.2% (95% CI: 55.5, 70.4) and kappa = 0.22 (95% CI: 0.07, 0.37).

The other recommendation options were selected as their first choice by 0 to 6 dentists (0 to 3.5%), which yielded relatively high percent agreement with observed rates for non-selection of these options. The lower 95% confidence limit for kappa includes zero for all but one recommendation, indicating no agreement beyond what would be expected by chance alone. The single exception was for use of lasers, which showed kappa = 1.0 (95% CI: 1.0, 1.0), based on only one dentist selecting this as the first-choice recommendation on the questionnaire. For this single dentist, laser treatment was also the most-used recommendation in the clinical data. There was complete agreement between the questionnaire and clinical observation for all other dentists regarding lasers not being their first choice of treatment. No dentists selected sealants as the first-choice product/recommendation, although for a single dentist this was the most commonly-indicated recommendation in the patient examinations.

Estimates of agreement with clinical recommendations for dentists’ indication of products/recommendations as being first, second or third choice options are shown in Table 2. Overall agreement was somewhat higher for indication of products/recommendations among the top three options. Fluoride products were selected on the questionnaire by 156 dentists (91.2%). Overall agreement with clinical recommendation of this product was 72.5% (95% CI: 65.2, 79.1), but kappa was only 0.08 (− 0.06, 0.22), indicating that agreement did not exceed that expected by chance.

Table 2 Agreement on treatment recommendation listed in top three choices for management of dentin hypersensitivity

Desensitizing OTC potassium nitrate toothpaste was selected by 150 dentists (87.7%), and overall agreement with clinical recommendations was 73.1% (95% CI: 65.8, 79.6), kappa = 0.26 (95% CI: 0.12, 0.41).

Glutaraldehyde/HEMA products, bonding agents, restorative treatments, and advice were indicated as top-three choices for at least 15% of dentists, and showed overall agreement ranging from 60.8% (95% CI: 53.1, 68.2; kappa = 0.0) for advice to 81.3% (95% CI: 74.6, 86.8; kappa = 0.29) for bonding agents. Fewer than 10% of dentists selected any of the other options as among their first-, second- or third choices.

Practitioner/practice characteristics and agreement

Only two practitioner characteristics showed statistically significant associations with agreement. Agreement on the recommendation of OTC potassium nitrate toothpaste differed by number of practice locations, with 76.2% agreement for practices having a single location, and 54.6% for practices with multiple locations (p = 0.0322). Agreement on the selection of advice among the top three recommendations differed by type of practice location. Inner city, urban (not inner city) suburban and rural locations showed 53.3, 38.3, 72.1 and 69.6% agreement, respectively (p = 0.0013). Results for all practitioner/practice characteristics are presented in Table 3 Appendix online [15].

Discussion

The network provides an opportune setting in which to evaluate concordance between dental practitioners’ reported treatment preferences and methods and treatment choices as observed in an actual clinical setting. Analysis of concordance is necessary in order to establish the validity of questionnaire data to describe typical clinical practice. Data from the MDH study provides a context for the evaluation of this for a specific clinical scenario, that of DH, and was chosen as the basis for the current study because the MDH study collected both questionnaire and clinical data and so provided the opportunity to evaluate concordance. All of the patients who were enrolled in the MDH study had been diagnosed with DH, so neither incidence nor prevalence of DH is estimable from that study. These measures are not necessary to the evaluation of agreement between the two modes of collection of information.

Fluoride products and desensitizing OTC potassium nitrate toothpaste were the items selected by the most dentists as first-choice and as first-, second- or third-choice recommendations for management of DH. However, agreement between pre-study hypothetical questionnaire responses with observed rankings of items recommended during actual clinical examinations was modest for these items. Relatively high unadjusted agreement, often above 90%, was observed for some items selected by only a small number of dentists. This seems to be largely the result of dentists who did not favor the use of these items on the questionnaire also not tending to use them in clinical examinations.

Guidelines for characterization of agreement based on ranges of values of Cohen’s kappa statistic are arbitrary, but are often presented. A commonly-used set of such ranges [16], suggests that kappa greater than 0.75 indicates excellent agreement, 0.40 to 0.75 as fair to good, and below 0.40 as poor. Based on this, agreement on all items as ranked choices evaluated in this study was poor, with the exception of that for lasers. Only one dentist indicated lasers as first-choice, and two dentists indicated lasers as in the top three choices, and agreement between the survey responses and clinical recommendations was almost complete, at 99.4 and 100.0%, respectively. The survey did not address the availability of lasers in the practices.

The practitioners’ responses to the pre-study questionnaire are presumably based on their impressions of their typical practice. Characteristics of the patients who were enrolled in the clinical study might have differed from this expectation, and this could have affected agreement. In particular, practitioners who have multiple practice locations might consider different patient groups when completing the questionnaire, but clinical data were collected at only a single practice, so patient characteristics might differ from those of the aggregate.

It is not surprising that indication of a product/treatment as being in the top three would show higher agreement than that for first-choice. The top-three category is a larger target than the first-choice alone, and there are three chances for inclusion of a particular treatment recommendation. It is also possible that patients might have previously received a dentist’s first-choice recommendation without success, so that the dentist recommends a second- or third-choice approach for this patient.

A high percentage agreement for products/recommendations that were indicated by only a few dentists primarily reflects good agreement for non-use of those items. Dentists who did not choose these items on the pre-study questionnaire also did not tend to recommend them in the patient examinations.

Formal comparisons among agreement estimates in different studies may not be possible because of differences in methodology across studies. Previous network studies have reported concordance between dentists’ questionnaire responses and actual treatment practices regarding the choice of whether to repair or to replace dental restorations [3], between self-reported treatment planning thresholds and clinical choices [4], between stated treatment thresholds and clinical pre-treatment depth estimates [2] and between pre-operative and post-operative estimates of caries lesion depths [1]. Nonetheless, comparisons with other network studies may be of interest. Agreement for the two most-frequent recommendations in the current study were 63.7% for OTC potassium nitrate toothpaste and 63.2% for fluoride products.

Higher agreement levels might be expected for pre- and post-treatment clinical observations of the same clinical entity, such as the same caries lesion. For example, in one network study that quantified the agreement between pre- and post-intervention estimates of caries lesion depth, agreement of 88% was observed for lesions extending to the inner one-third of dentin and 54% for lesions limited to the outer one-half of enamel [1]. However, excellent agreement has also been found between stated treatment thresholds and pre-treatment estimates of caries lesion depth for lesions that were on the proximal surface (98% agreement), but agreement for occlusal lesions was only 51% in the same study [2].

Conclusions

Agreement percentages between hypothetical, typical clinical scenarios (based on questionnaire data) and actual practice (based on clinical examination data) were generally high, but low values of Cohen’s kappa indicated little agreement beyond levels that would be expected by chance. Based on this, survey data and hypothetical scenarios could supplement observations based on actual examinations in studies of DH treatment recommendations, but these should not take the place of data based on clinical observations. This analysis adds to the larger work of the network which has observed a wide range of levels of agreement between hypothetical and actual care, depending upon the specific diagnosis or treatment under consideration.