Introduction

Sjögren's syndrome (SS) is the second most common rheumatic autoimmune disease, with an incidence rate of approximately 0.05–0.4% [1]. According to the American–European Group Consensus criteria, the 2012 American College of Rheumatology (ACR) criteria, and the 2016 SS classification criteria proposed by the American College of Rheumatology and the European League Against Rheumatism (ACR-EULAR) [2,3,4], labial salivary gland biopsy (LSGB) is a specific method to diagnose SS and has high clinical significance [5]. The sensitivity and specificity of LSGB in primary Sjögren’s syndrome (pSS) are both around 0.8 [6].

The diagnosis of pSS should be made on the basis of clinical manifestations, glandular dysfunction, laboratory tests, and LSGB. The common complications after traditional LSGB are local pain and incision dehiscence, and 6% of patients will experience persistent lip numbness after surgery. In contrast, the combined prevalence of permanent or potentially permanent neurological adverse events using the minimally invasive technique (incision of 23 mm) is less than 1/8 that using linear incisions (≥ 5 mm) [7]. The chalazion forceps has a wide clinical application and can be used to treat ophthalmic diseases [8, 9]. It can also assist in oral biopsy, especially for lip lesions [10]. Studies have shown that this device can be used in patients with pSS to simplify minor salivary gland biopsy [11, 12]. In a prospective study of 23 suspected pSS patients, LSGB was performed using scattered granuloma forceps. The results showed that this technique helped to achieve superior yield, ensured adequate glandular sampling for histopathological analysis, and reduced complications associated with traditional techniques [13].

The main objective of this study was to perform LSGB in patients with suspected SS and to assess the value of LSGB conducted with a chalazion forceps compared to traditional LSGB. This study describes the clinical features and glandular dysfunction in patients who underwent biopsy and the effects of modified surgery on LSGB pathological results, incisions, specimens, pain scores, and wound healing. It analyses the factors influencing the diagnosis of pSS and compares the two surgical methods on the match rate between the LSGB pathological results and the final clinical diagnosis.

Methods

Patients

This was a prospective randomized controlled study that included all suspected SS patients over 18 years old who visited the outpatient clinic and were admitted to wards of the Department of Rheumatology of Hospital Affiliated with Zunyi Medical University. Between May 2021 and December 2023, a total of 217 patients treated with LSGB were included in the study. Patients were randomly grouped using a simple randomisation method based on computer-generated random numbers prepared by a statistician not involved in conducting the trial, singularly in the conventional lip gland biopsy group and doubly in the modified lip gland biopsy group, with 92 cases in the conventional lip gland biopsy group and 125 cases in the modified lip gland biopsy group.

LSGB technique and histological parameters

Option 1: Procedure for traditional LSGB

Ninety-two patients with suspected SS were randomly selected for LSGB. This operation was performed by one experienced stomatologist using a linear incision. The procedure was as follows: Routine disinfection was performed, followed by local anaesthesia of the surgical area. The patient’s lower lip was immobilized to expose the surgical site, and a surgical blade was used to create a long fusiform incision from the mucosa to the muscle layer of the lower lip. The gland was bluntly dissected, placed in fixative solution, and examined. The surgeon aimed to harvest at least four minor salivary glands. If the minor salivary glands were too small (<2 mm), six glands needed to be harvested, with a minimum glandular surface area of 8 mm2 [14, 15]. The incision was sutured, the patient was given discharge instructions after the surgery, and the stitches were removed after the patient returned to hospital 1 week later.

Option 2: Procedure for modified LSGB using chalazion forceps

Another 125 patients with suspected SS were randomly selected for LSGB, which was performed by one experienced rheumatologist using a linear incision. The procedure was as follows: Using chalazion forceps (Xinhua Surgical Instrument Co., Ltd., lot number: 249142), the surgeon first turned spiral button upwards and loosened it to select the site for specimen collection. Then, spiral button was turned to slowly compress fixation pressure plate and fixation tray plate to clamp and fix the lower lip. The surgeon gently everted fixation pressure plate to fully expose the surgical site within the scope of ring pressure plate, the excess saliva was wiped away with gauze, and the site of salivary exudation was observed for local anaesthesia and biopsy (Fig. 1). Other treatments were the same as described above.

Fig. 1
figure 1

Schematic of the procedure using chalazion forceps-assisted labial salivary gland biopsy (Red arrow: salivary glands)

Histopathological analysis was performed by two experienced pathologists. The grading criteria for pathological manifestations were as follows: the standard for positivity was based on the Chisholm rating, i.e., the degree of lymphocytic infiltration was ≥ 1 focus/4 mm2 (a focus was defined as an aggregation of at least 50 lymphocytes in the labial gland interstitium within 4 mm2 of tissues) or the focus score (FS) was ≥ 1. The diagnosis of SS can be considered when the pathology is positive [6].

Dryness symptoms and glandular dysfunction

At the time of biopsy, the patients were asked whether they had symptoms of dry mouth or dry eyes. According to the inclusion criteria of the 2016 ACR-EULAR: patients with at least one of the symptoms of dry eyes or dry mouth, i.e., those who satisfied at least one of the five descriptions, were considered positive. The Schirmer I test (SIT) and tear film break-up time (BUT) were used to test for dry eyes, and salivary gland emission computed tomography (ECT) was used to evaluation of salivary gland dysfunction. The definition of glandular dysfunction was as follows: SIT <5 mm within 5 min using standardized sterile test paper (Ophtalmos, Sao Paulo, Brazil), and/or tear film rupture occurring within 10 seconds using 0.125% fluorescein solution, and/or salivary glands not sensitive to acidic substance stimulation during dynamic imaging, no radioactivity or only a small amount of radioactivity in the oral cavity, and no decrease in the time–radioactivity curve and no increase in the oral curve after acidic substance stimulation. The above evaluations were performed by a physician specializing in radiology [16].

Clinical parameters

Information on patient demographics, autoantibodies, and clinical manifestations was obtained from medical reports. The autoantibodies evaluated were anti-nuclear antibodies (ANA) (note that ANA ≥1:320 was defined as positive), which were measured using indirect immunofluorescence. Rheumatoid factor (RF) (RF>20 IU/ml was defined as positive) was measured by the turbidimetric method, and anti-Ro/Sjögren syndrome-antigen A (anti-Ro/SSA), anti-La/Sjögren syndrome-antigen B (anti-La/SSB), anti-Ro52 autoantibodies (anti-RO52), anti-major centromere autoantigen (anti-CENP)-B and anti-centromere antibodies (anti-ACA) were detected using an ANA Test Kit. The detections were performed strictly according to the instruction manual, and the results were evaluated using the EUROLineScan software. These antibodies were evaluated using only the European immunoassay. The intensity of each reaction with positive and negative controls was indicated as follows: “-”, “+”, “++” and “+++” (if the assays all showed a signal of “+”, “++” or “+++”, autoantibodies were considered to be present).

The surgical duration of traditional LSGB was the time from the start of local anaesthesia to the end of suturing (in minutes). The duration of modified LSGB using the chalazion forceps ranged from the start of chalazion forceps use before local anaesthesia to the end of suturing (in minutes). Only one piece of gauze was used for haemostasis during the operation. Postoperative bleeding was estimated based on the final weight of the gauze (g) minus the starting weight of the gauze. Incision pain at 24 h after surgery was evaluated by an oral interview (such as a telephone interview) using the 0–10 numerical rating scale (NRS). There were four score strata: no pain: 0; mild pain: 1–3; moderate pain: 4–6; and severe pain: 7–10 [17]. The status of incision healing 7 days after surgery was divided into grades A, B, and C according to the degree of incision healing after suture removal. Grade A healing referred to initial healing with an excellent outcome and no adverse reactions. Grade B healing referred to poor healing; inflammatory reactions, such as swelling, induration, haematoma, and effusion, occurred at the healing site, but there was no suppuration. Grade C healing referred to incision suppuration requiring incision and drainage.

The surface of the glands was evaluated using an Olympus BX50 microscope. The diameter d of the actual field of view (FOV) was 22 (number of FOVs)/10 (magnification of the objective lens) = 2.2 mm, and the area s of the actual FOV was πr2=π1/4d2 ≈ 4 (mm2) [18]. According to this formula, if 4× (magnification of the objective lens) was used, the diameter d of the actual FOV=5.5 mm, and the area s of the actual FOV ≈ 24 (mm2).

According to the 2016 ACR-EULAR classification criteria for pSS, any patient meeting the following conditions can be diagnosed with SS: satisfying one or more of the inclusion criteria, not satisfying any exclusion criterion, total score on the five items ≥ 4 points, and an expert opinion of pSS positivity [4]. If the patient met the diagnosis of SS and had underlying disease (such as any connective tissue disease), the patient was classified as having secondary Sjögren syndrome (sSS); otherwise, the patient was classified as having pSS [2].

Statistical analysis and ethical issues

SPSS 27.0 software was applied for statistical analysis. The t test was used for comparison between groups of normally distributed measures, the Mann–Whitney rank sum test was used for comparison between groups of non-normally distributed measures, and the chi-square test and Fisher's exact probability method were used for comparison between groups of qualitative data for statistical analysis. P < 0.05 was considered to indicate a statistically significant difference. This study was approved by the Ethics Committee for Clinical Application of Medical Technologies of Zunyi Medical University (approval number: 20210135).

Results

Clinical characterisation of patients with two surgical procedures

The traditional and modified LSGB groups had a similar age distribution, sex, dry mouth and/or dry eye rate, anti-ANA titre, anti-Ro/SSA, anti-La/SSB, anti-RO52, anti-CENP B and/or anti-ACA, systemic involvement (joint pain, nerve, glandular enlargement), combined thyroid disease, RF, immunoglobulin G (IgG), complement C3, erythrocyte sedimentation rate (ESR), C-reactive protein (CRP), white blood cell count (WBC), platelet count (PLT), alanine aminotransferase (ALT) and/or aspartate aminotransferase (AST), FS ≥ 1, SIT and/or BUT, salivary gland ECT (p > 0.05). The incision length, operative time, amount of blood loss, 24 h pain score, and Grade A incision healing at 7 days after surgery in the modified group were better than those in the traditional group (p < 0.01). There was one case of gland collection failure in each of the two groups. The diameter of the glands collected (≥ 2 mm) and the surface area of the glands (≥ 8 mm2) were not significantly different (p > 0.05), but the traditional group had significantly more glands (≥ 3) (p < 0.01). The two groups had similar proportions of individuals diagnosed with pSS (p > 0.05). The complete data are shown in Table 1.

Table 1 Clinical characterisation of patients with two surgical procedures

Clinical characterisation of patients with and without pSS

For the clinical indicators with statistically significant differences shown in Table 1, the influence of sSS needed be excluded. Further analysis of the clinical parameters of the groups with and without pSS revealed that these groups differed in the rate of dry mouth, the rate of dry eyes, the rate of anti-ANA titre ≥ 320, anti-Ro/SSA, anti-La/SSB, anti-RO52, RF, the WBC count, the proportion of patients with FS ≥ 1, incision length, and the number of glands (≥ 3) (p < 0.05). There was no significant difference in surgical methods, interstitial lung disease (ILD), Serum creatinine (Scr), 24 h urine protein quantification between the pSS group and the non-pSS group (p > 0.05). The complete data are shown in Table 2.

Table 2 Clinical characterisation of patients with and without pSS

3.3 Analysis of factors influencing primary Sjögren’s syndrome

Univariable and multivariable logistic regression analyses revealed that sex, age, and surgical method were not independent predictive factors for the diagnosis of pSS (p > 0.05). Univariable analysis showed that the number of glands (≥ 3) (odds ratio (OR) = 2.431, 95% confidence interval (CI) 1.347-4.386, p < 0.01), incision length (OR 3.983, 95% CI 1.569-10.112, p < 0.01), and anti-ANA titre ≥ 1:320 (OR 8.75, 95% CI 3.4232-22.31, p < 0.01) were independent predictors for the diagnosis of pSS. Other variables that were determined to be significantly different above were still associated with the diagnosis of pSS (p < 0.05). Multivariable logistic regression analysis revealed that only dry eye (OR 11.233, 95% CI 1.579-79.914, p < 0.05) and anti-ANA titre ≥ 1:320 (OR 19.783, 95% CI 1.326-295.084, p < 0.05) and FS ≥ 1 (OR 1986.159, 95% CI 97.154-40603.688, p < 0.01)) were independent predictors for the diagnosis of pSS. The complete analysis is shown in Table 3.

Table 3 Analysis of factors influencing diagnostic results in 190 patients

Accuracy analysis of LSGB pathological results in different surgical methods

To analyse the accuracy of the diagnosis of pSS based on the LSGB pathological results from different surgical methods, after excluding 27/217 patients with sSS, the other 190 patients were studied. Among these 190 patients, two had failed LSGB(1 case of pSS by expert opinion in the traditional surgery group and 1 case without SS in the modified group), and the diagnosis of SS in 85 cases was ruled out by expert assessment. In the traditional surgical group, 44 patients had positive biopsy results (FS ≥ 1) (according to expert opinion, all 44 of them had pSS); 38 patients had negative biopsy results (FS < 1) (according to expert opinion, six of them had pSS and 32 did not have SS). The sensitivity of traditional LSGB pathological results to the diagnosis of pSS was 88.00%, and the specificity was 100%. The positive predictive value (PPV) was 100%, and the negative predictive value (NPV) was 84.21%. The accuracy of the diagnosis was 92.68%. In the modified surgical group, 46 patients had positive biopsy results (FS ≥ 1) (according to expert opinion, 43 of them had pSS, and three patients did not have SS); 60 patients had negative biopsy results (FS < 1) (according to expert opinion, 10 of them had pSS, and 50 did not have SS). The sensitivity of modified LSGB pathological results to the diagnosis of pSS was 81.13%, and the specificity was 94.34%. The PPV was 93.48%, and the NPV was 83.33%. The accuracy of the diagnosis was 87.74%.

3.5 Evaluation of the consistency between LSGB pathological positive results and expert opinions and diagnostic criteria

According to the pSS classification standard in the 2016 ACR-EULAR, complete data for patients with sSS and patients with pSS (final diagnosis) are shown in Table 4. In the traditional group, the consistencies of positive biopsy results (FS ≥ 1) with both expert opinion and ACR-EULAR diagnostic criteria were high and significant (p < 0.05), but the consistency with expert opinion (kappa = 0.851) was higher than that with ACR-EULAR criteria (kappa = 0.591). In the modified group, the consistencies of positive biopsy results (FS ≥ 1) with both expert opinion and ACR-EULAR diagnostic criteria were high and significant (p < 0.05), but the consistency with expert opinion (kappa = 0.805) was higher than that with ACR-EULAR criteria (kappa = 0.728).

Table 4 Diagnosed with pSS according to the 2016 ACR-EULAR

Discussion

The diagnosis of SS is challenging because of its complicated clinical manifestations and nonspecific signs. LSGB plays an important role in the diagnosis of SS, especially in patients who are negative for anti-SSA and anti-SSB antibodies. The current traditional LSGB surgical methods have the following deficiencies: (1) there is no fixed device, and an assistant is needed to fix the lower lip; due to the slippery mucosa of the lip, the fixation by hand can easily become unstable; (2) intraoperative bleeding may occur due to the abundant blood supply of the lip, which can affect the surgical field of vision; (3) the incision is large and prone to complications such as infection, suture dehiscence, granulomas, keloids, and lip numbness; (4) in some patients, the lip gland atrophies, which is not conducive to finding the cause of surgical failure. To address the above technical issues, in this study we proposed an improved LSGB technique using a chalazion clamp, which has the following advantages: (1) the fixed pressure plate and fixed tray plate can be inserted and fixed directly from the inner and outer sides of the patient's lower lip, with simple and quick operation and stable fixation [10]; (2) Ring pressure plate is conducive to hemostasis, exposing the gland, and maintaining a clear surgical field; (3) the elliptical tray is conducive to lifting the gland upward and only needs to open the mucosal superficial layer to expose the existing gland; (4) the front part of the fixed pressure plate is equipped with an adjustable spiral button, which can adjust the appropriate pressure and tightness according to the thickness of the patient's lip, with strong controllability. (5) The chalazion forceps have three sizes, and the elliptical hollow metal ring pressure plate at the front end of the device has three sizes (12 mm × 21 mm, 10 mm × 17 mm, 11 mm × 11 mm), which is conveniently chosen based on the size of the incision. In view of the dispersion of the lesions, it is recommended to obtain at least four LSGs; three respondents have advocated the use of fewer glands (two to three), and two respondents recommend using a greater number of glands (five to seven) [15].

This study showed that compared with the traditional surgery group, the modified surgery group had a smaller incision, less bleeding, shorter operative time, milder postoperative pain, and better incision healing at 7 days after surgery. The results also revealed that the LSGB pathological results in the different surgical method groups were similar; there was no significant difference in the diagnostic rate of pSS between the two groups. There were significant differences in some clinical parameters between the modified and traditional groups, such as ILD, Scr, and urine protein quantification, and consequently, these indicators should be excluded to avoid biasing the final diagnosis. This study excluded sSS patients from the follow-up analysis. Among them were 19 patients with rheumatoid arthritis (RA), five patients with systemic lupus erythematosus (SLE), one patient with systemic sclerosis (SSC), one patient with antineutrophil cytoplasm antibody (ANCA)-associated vasculitis, and one patient with Behçet's disease. Any of these diseases may affect some clinical parameters. The prevalence of sSS is 5–30% in RA patients and 14–17.8% in SLE patients [19, 20]. Our results showed that there was no significant difference in ILD, Scr, or 24 h urine protein quantification between the pSS group and the non-pSS group. Moreover, the incision length and the number of glands were significantly increased in the pSS group compared with the non-PSS group (p < 0.05), and the univariate analysis showed that the incision length and the number of glands were independent risk factors affecting the diagnosis of pSS; therefore, an incision length of approximately 1 cm and a number of glands ≥ 3 were significantly correlated with the diagnosis of pSS. However, according to multivariable analysis, only FS ≥ 1, dry eyes and an anti-ANA titre ≥ 1:320 were significantly associated with the diagnosis of pSS [21].

The accuracy of LSGB pathological results from different surgical methods for the diagnosis of pSS showed that the sensitivity of LSGB from the two surgical methods was 81.13–88.00%, and the specificity was 94.34–100%. The accuracy of the diagnosis was 87.74–92.68%, which was consistent with earlier results [22]. The LSGB results of the two surgical methods were in good agreement with the expert opinions and the standard SS classification.

None of our enrolled patients experienced major complications, which is consistent with the findings of previous studies, indicating that the modified surgical method is a safe technique [23]. In the traditional surgery group, when the stitches were removed 7 days after surgery, 17.4% (16/92) patients had partial incision dehiscence, local mucosal congestion, swelling, and pain; however, mucosal healing can take up to 3 weeks. In the LSGB technique, the use of either loupes or a small operating microscope to assist in the biopsy has not been reported. Tsesis et al. suggested that the use of a small operating microscope in dentistry can achieve higher accuracy [24]. Whether the use of either loupes or a small operating microscope in the modified LSGB procedure can significantly improve the success rate of biopsies should be studied.

Conclusion

In summary, the small-incision technique of using auxiliary chalazion forceps collects more superficial glands with a small incision, which may explain the low prevalence of incision complications. It is preferable to take ≥ 3 glands to increase the positivity rate while ensuring that the body surface area and diameter of the glands are large enough. The sample of this study was small; therefore, case–control studies with large samples would reduce any bias in the findings.