Background

Spirometry flow-volume curve displays airflow versus volume during maximum forced inspiration and expiration, it can be influenced by various factors [1]. According to international standards, types of ventilatory defects can be inferred from the configuration of the curve, obstructive abnormalities are thought to be a concave shape on the curve, the curve of a restrictive ventilatory defect shows a convex pattern [2]. Furthermore, Miller and Hyatt [3] found that flattening on the inspiratory and/or expiratory phase of the curve could suggest upper airway obstruction. Flow oscillations referred to as a “saw-tooth” sign seen on curves are thought to be an indicator of obstructive sleep apnea [4]. The flow-volume curve display is diversified in patients with different diseases, as the disease progresses, so does the curve configuration.

Li et al. [5] first found a small plateau (SP) in the early phase of expiratory flow (mostly located at the phase from PEF to FEF50%). They identified the sign in 228 of 808 patients (28.2%) who completed bronchoprovocation test (BPT), about 196 (86%) of patients had negative-BPT indicating that patients with the sign maybe have a negative trend prevalence of airway hyperresponsiveness (AHR). Moreover, they found that patients with the sign had a prevalence of symptoms due to upper airway abnormalities, including severe cough and hoarseness when complete the methacholine challenge maneuvers, which indicated that the sign may be related to abnormal structure or function of the upper airway. Xie et al. [6] similarly found that patients were more often negative-BPT (95.2%) in a total of 124 subjects with SP sign who performed BPT, they further confirmed that patients with SP sign may less likely to have AHR. BPT is used to evaluate AHR which is most commonly associated with asthma [7]. However, due to a variety of reasons, BPT cannot be available in primary care settings. In addition, the test is time-consuming and usually requires specialists. At present, asthma has a rapid rise in prevalence in low-income and middle-income countries [8]. In high-income countries, it has high rates of exacerbation and re-admissions [9], which has been proved associated with lung function [10]. Some previous studies have focused on finding simple methods to evaluate AHR and diagnose asthma, like questionnaires and baseline spirometry parameters [11, 12]. The identification and application of the SP sign may have crucial value in predicting the absence of AHR to avoid unnecessary testing.

With the availability of portable spirometers in primary care, forced spirometry (FS) has become easily accessible to all levels of healthcare. In clinical practice, however, most non-professionals lack the ability or experience to identify the configuration of the flow-volume curve, which may lead to the need for a simple and effective approach for visual inspection of the curve. The application of artificial intelligence in medicine is rising quickly [13]. Deep learning is a branch of artificial intelligence that could provide a simple and useful tool to automate spirogram recognition [14]. Though it is possible that some simpler computer vision (CV) techniques could also solve the problem, deep learning-based CV methods are able to outperform those traditional CV and signal processing methods in most scenarios [15,16,17,18,19]. The reason behind this is that the deep learning model often encompasses a large parameter space and does not require hand-crafted features but instead automatically learns appropriate features. These characteristics give the deep learning model a stronger capacity to model the task and better generalization abilities compared to traditional methods. Since deep learning has produced promising results in tackling the display of the curve, we hypothesize that the application of deep learning is ideal for the automatic recognition of the SP sign.

In trying to analyze the potential use of the SP sign as a negative marker for AHR in patients with suspected asthma, we retrospectively reviewed the characteristics of the SP sign in BPT. Since FEV1 < 60% predicted is a contraindication for BPT, we also reviewed bronchodilator tests (BDTs) to involve patients with severe ventilatory defects. Furthermore, for patients who required laryngoscopy according to specialists’ decision, FS was performed after they have completed the laryngoscopy tests to assess the laryngoscopy-diagnosis upper airway abnormalities of patients with the sign. At last, we leveraged deep learning algorithms to automatically tackle the recognition problem of the SP sign.

Methods

This study was conducted at the First Affiliated Hospital of Guangzhou Medical University, using BPTs and BDTs data from October 2017 to October 2020 and FS data during the period 8 March 2021 to 26 March 2021. The study was approved by the Ethics Committee of the First Affiliated Hospital of Guangzhou Medical University (approval NO.: 2020124).

Pulmonary function tests (PFTs)

BPTs and BDTs were performed on MasterScreen-Pneumo (Jaeger, Germany) and QuarkPFT (COSMED, Italy). FS tests were performed by portable spirometer with U-BREATH PF680 (E-linkcare, China). Quality control of pre-and post- PFTs followed by guidelines, baseline spirometry should achieve three acceptable FEV1 and FVC measurements [1, 20]. Post-BPT and post-BDT acceptable FEV1 and FVC measurements were also needed. A positive-BPT required a 20% fall from the post-diluent baseline FEV1 after the last dose of methacholine [21]. A positive-BDT was identified when the percent change from baseline and absolute changes in FEV1 and/or FVC values ≥ 12% and ≥ 200 ml [2].

Study design

Four junior pulmonologists determined the SP sign based on a repeatable small plateau on the flow-volume curves. They independently labelled different portions of total curves (never the same) and all have experience in operating and interpreting spirometry for more than 2 years. If they have any doubts, then a senior pulmonologist with experience of more than 18 years would make the final decision. Representative examples of the SP sign were shown in Fig. 1, each BPT or BDT report has at least three flow-volume curves. Point A is the start point of the sign, point B is the endpoint. The data points of A and B were extracted using Engauge Digitizer 11.1 software (M. Mitchell, Engauge Digitizer). Volume (Vol) A/FVC and Flow A/PEF were original (x, y) data of point A; Vol B/FVC and Flow B/PEF were original (x, y) data of point B. (Vol A − Vol B) /FVC and (Flow A − Flow B) /PEF represent the width and height of the sign, respectively. Pre-PFT questionnaires and electronic health records were reviewed by the same authors to identify symptoms and PFT indications. To further evaluate the features of the sign, according to the width or height of the sign, subjects were stratified into four classes, respectively:

  • Class 1: (Vol A − Vol B) × 100/FVC ratio or (Flow A − Flow B) × 100/PEF ratio ≤ 10%

  • Class 2: (Vol A − Vol B) × 100/FVC ratio or (Flow A − Flow B) × 100/PEF ratio > 10% to ≤ 20%

  • Class 3: (Vol A − Vol B) × 100/FVC ratio or (Flow A − Flow B) × 100/PEF ratio > 20% to ≤ 30%

  • Class 4: (Vol A − Vol B) × 100/FVC ratio or (Flow A − Flow B) × 100/PEF ratio > 30%

Fig. 1
figure 1

Representative examples of the SP sign in BPTs and BDTs. a SP sign in positive BPT. Curve 1 = pre-BPT, SP sign (+); Curve 5 = post-BPT, SP (−). b SP sign in negative BPT. Curve 1 = pre-BPT, SP sign (+); Curve 6 = post-BPT, SP sign (+). c SP sign in positive BDT; Curve 1 = pre-BDT, SP sign (−); Curve 4 = post-BDT, SP (+). d SP sign in negative BDT; Curve 1 = pre-BDT, SP sign (+); Curve 4 = post-BDT, SP sign (+). Point A = the start point of the SP sign; Point B = the end point of the SP sign. SP = small plateau; BPTs = bronchoprovocation tests; BDTs = bronchodilator tests

The study randomly selected 50 subjects with the sign from each class to compare the PFT and clinical features. All the electronic health records were reviewed within 6 months of PFTs.

For patients who underwent laryngoscopy according to specialists’ decision in the department of otolaryngology, FS tests were performed after laryngoscopy was completed. Patients who had contraindications for PFTs were excluded according to the Chinese Thoracic Society guidelines [22].

Development of the model

3,453 (1953 SP sign positive and 1500 negative samples) and 374 samples (95 positive and 279 negative samples) were used for training and testing the model, respectively. 294 positive samples could not be used due to the model failure to extract the image. Inputs to the model were images extracted from collected PFT reports in pdf. format, to be specific, flow-volume curves were saved as vector graphics and could be extracted by developing Python scripts. All extracted curves were further processed to have a pixel size of 600 ∗ 1200 by using the OpenCV-Python package. A couple of examples of pdf and curve images can be found in Fig. 2. Each PFT report must have at least three flow-volume curves, only when repeatable (at least three curves) detecting of the SP sign, then the model would output the presence of the sign. In this work, the recognition of the SP sign was formulated as a detection task. To this end, we proposed SP Network (SP-Net), a Faster R-CNN [18] alike object detection model.

Fig. 2
figure 2

A couple of examples of pdf and input images. Inputs to the model were images extracted from collected PFT reports in pdf format. a Flow-volume curves of a woman (aged 64 years, height 162.8 cm) with pdf on the left, after pre-processing of curves by the model were used as input images on the right; b Flow-volume curves of a man (aged 25 years, height 165.5 cm). PFT = pulmonary function test

As shown in Fig. 3, given an input image, the classic ResNet50 [17] convolutional neural network was utilized to extract the feature map. Then a region proposal network would generate object bounds and objectness scores, i.e., the probability of a specific sign occurs and its position, according to the feature map in a sliding window fashion. Next, a region of interest pooling layer would extract a feature vector from the feature map for each of the proposals. Each feature vector was fed into a series of fully connected layers that finally branch into a classifier and a regressor which output: (1) Whether an SP sign was detected; (2) Bounding box positions.

Fig. 3
figure 3

Architecture of SP-Net. When given an input image, the classic ResNet convolutional neural network was utilized to extract the feature map. Then a region proposal network would generate object bounds and objectness scores. Next, a RoI pooling layer would extract a feature vector from the feature map for each of the proposals. Each feature vector was fed into a series of fully connected layers that finally branch into a classifier and a regressor which output: (1) Whether an SP sign was detected; (2) Bounding box positions. RoI = region of interest; SP = small plateau; SP-Net = SP-network

The model was firstly trained for 20,000 iterations with a learning rate of 1e-3 and then trained for another 10,000 iterations with a smaller learning rate of 1e-4. The batch size was set to 1 through all iterations. A stochastic gradient descent optimizer was utilized to train the model. Our training objective function was a multi-task loss following Fast R-CNN [23], which consisted of a classification loss and a regression loss for calculating the difference between model predictions and ground truth values.

Statistical analysis

Count data were analyzed by using the χ2 or the Fischer exact test, as appropriate. Median comparisons were performed by using the Kruskal–Wallis test, or Mann–Whitney U test when there were two groups. Mean comparisons were performed by using the One-Way ANOVA or independent-samples T-test, as appropriate.

The deep learning model was implemented using TensorFlow 1.14.0 and trained on 4 Tesla K80 GPUs. Other statistical analyses were performed with SPSS version 26.

Results

Of the 13,661 BPTs reviewed, 2,123 (15.5%) patients with SP sign were labelled. Of these, 1,782 (83.9%) were negative-BPT; 341 (16.1%) were positive-BPT (Fig. 4). Of a total 8,168 BDTs reviewed, there were 219 (2.7%) labeled as the SP sign. Of these, 194 (88.6%) were negative-BDT, 25 (11.4%) were positive- BDT (Fig. 5).

Fig. 4
figure 4

Prevalence of SP sign of all patients in BPTs. BPT (−) SP sign (+) was defined as patients with SP sign and had negative-BPTs; BPT (−) SP sign (−) was defined as patients without SP sign and had negative-BPTs; BPT (+) SP sign (+) was defined as patients with SP sign and had positive-BPTs; BPT (+) SP sign (−) was defined as patients without SP sign and had positive-BPTs. Classes were defined as follows: class 1 = (Vol A − Vol B) × 100/FVC ratio ≤ 10% in BPT (−); class 2 = (Vol A − Vol B) × 100/FVC ratio > 10% to ≤ 20% in BPT (−); class 3 = (Vol A − Vol B) × 100/FVC ratio > 20% to ≤ 30% in BPT (−); class 4 = (Vol A − Vol B) × 100/FVC ratio > 30% in BPT (−). BPTs = bronchoprovocation tests; Vol = volume; SP = small plateau

Fig. 5
figure 5

Prevalence of SP sign of all patients in BDTs. BDT (−) SP sign (+) was defined as patients with SP sign and had negative-BDTs; BDT (−) SP sign (−) was defined as patients without SP sign and had negative-BDTs; BDT (+) SP sign (+) was defined as patients with SP sign and had positive-BDTs; BDT (+) SP sign (−) was defined as patients without SP sign and had positive-BDTs. BDTs = bronchodilator tests; SP = small plateau

Characteristic PFT parameters

SP sign was calculated mostly located on the 90–70% PEF and 18–40% FVC expiratory phase. Compared with positive-BPT, subjects with a negative-BPT had a more obvious SP sign which appeared to be wider and higher in the pre-challenge spirometry (both P < 0.0001) (Table 1). Most of the width of the sign of subjects with a positive-BPT narrowed or even disappeared in the post-BPT. On the contrary, subjects with a positive-BDT mostly without the sign in the prebronchodilator spirometry, the sign appeared in the post-bronchodilator maneuvers. (Table 1).

Table 1 Characteristic parameters

Of the negative-BPT group, patients with SP sign were more often female and had a prevalence of obesity (BMI ≥ 30 kg/m2) (both, P < 0.0001) (Table 2). In pre-BPT and pre-BDT groups, compared patients without SP sign, patients with SP sign had higher median FVC, FEV1, FEF50%, FEF75%, and MMEF % predicted (all P < 0.0001); patients with the sign also had a higher median FEV1/FVC ratio (P = 0.009, P < 0.0001, P < 0.0001 and P < 0.0001, respectively) (Tables 2 and 3).

Table 2 PFT values of patients with pre-BPTs
Table 3 PFT values of patients with pre-BDTs

Pre-PFT questionnaires

Patients with SP sign had higher rates of chronic cough in BPT groups (both P < 0.0001) (Table 4). In addition, patients with the sign were more likely to have evidence of wheezing in positive and/or negative BPT and BDT groups (P = 0.033, P < 0.0001, P = 0.011 and P = 0.014, respectively) (Tables 4 and 5).

Table 4 Pre-PFT questionnaires of BPTs
Table 5 Pre-PFT questionnaires of BDTs

In negative-BPT groups, patients with the sign were more likely to have PFT indications of cough, bronchiectasis, rhinitis (sinusitis), gastroesophageal reflux and pneumonia (all P < 0.0001) (Table 4). In positive-BDT groups, PFT indications of asthma, rhinitis (sinusitis), and pneumonia had higher rates, but COPD had a lower rate within the sign (P = 0.002, P < 0.0001, P = 0.003, and P = 0.006, respectively); patients with the sign had higher rates of PFT indications of asthma, rhinitis (sinusitis), gastroesophageal reflux and pneumonia in the negative-BDT group (all P < 0.0001), on the contrary, a lower rate of PFT indication of COPD (P < 0.0001) (Table 5).

Baseline characteristics of the SP sign classes

In the width-stratified classes, of 1,782 subjects, there were 245 (13.7%) with class 1, 652 (36.6%) with class 2, 507 (28.5%) with class 3, and 378 (21.2%) subjects with class 4 (Fig. 4). Compared with class 1, subjects with class 4 were more often never smoker (P = 0.005) and with higher median FEV1% predicted and FEV1/FVC ratio (P = 0.007 and P = 0.003, respectively). Patients with class 1 had a higher rate of PFT indication of COPD compared to class 4 (P < 0.0001) (Table 6). In the height-stratified classes, there were 439 (24.6%) with class 1, 759 (42.6%) with class 2, 474 (26.6%) with class 3, and 110 (6.2%) subjects with class 4. Compared with class 2, class 4 were with higher median FEV1% predicted (P = 0.009). However, patients with class 1 had a higher FEV1/FVC ratio than other classes (P < 0.0001) (Table 7).

Table 6 Baseline characteristics of the SP width-stratified classes in negative pre-BPTs
Table 7 Baseline characteristics of the SP height-stratified classes in negative pre-BPTs

Laryngoscopy findings

A total of 48 patients who underwent the laryngoscopy completed the FS tests. Of these, 32 (66.7%) patients without SP sign and 16 (33.3%) patients with the sign. Patients with SP sign were less likely to have laryngoscopy evidence of chronic pharyngitis or normal upper airway compared with those without the sign (P = 0.038) (Table 8). They showed a higher percentage of upper airway abnormalities. Figure 6 shows examples of these abnormalities. Further analysis of the degree of upper airway stenosis [24] due to these findings, we found that the degree of stenosis of patients with SP sign were all within 0∼25%, however, patients without SP sign showed different degrees of stenosis, which distributed in ∼25%, 50%, 75%, and 90% (Table 9).

Table 8 Laryngoscopy findings associated with the SP sign
Fig. 6
figure 6

Examples of the laryngoscopy findings. a Epiglottic cyst; b vocal cord nodule; c tonsil mass, d hypertrophy of tonsils/adenoids; e supraglottic laryngeal cancer. The location indicated by the black arrow is the lesion

Table 9 Precision data summary

Automated recognition

An example of the output of our model is shown in Fig. 7. After running the inference, our model would draw a red bounding box to represent the detected position of the SP sign and also generated corresponding annotations. Figure 8 visualizes two case studies consisting of laryngoscopy findings and corresponding model outputs. In both normal (A) and vocal cord polyp (B) cases, our SP-Net successfully detected the SP sign.

Fig. 7
figure 7

Model output. The SP-Net drew a red bounding box to represent the detected position of the SP sign and also generated corresponding annotations. SP = small plateau; SP-Net = SP-network

Fig. 8
figure 8

Examples of the laryngoscopy findings and corresponding model outputs of the SP sign. a Normal; b vocal cord polyp. SP = small plateau

The performance of our proposed deep learning model was evaluated on the test set using five metrics. The evaluation showed that our model achieved 95.2% accuracy, 93.7% sensitivity, 95.7% specificity, 88.1% positive predictive value, and 97.8% negative predictive value (Table 10 and Fig. 9).

Table 10 Performance evaluation (N = 374 records)
Fig. 9
figure 9

Confusion matrix. 0: subjects without SP sign, 1: subjects with SP sign. SP = small plateau

Discussion

Visual inspection of the flow-volume curve is a simple and valuable approach to diagnose and evaluate diseases, such as obstructive sleep apnea, upper airway obstruction, and unilateral main-stem bronchial obstruction [25,26,27,28]. The SP sign is a common but previously not widely recognized configuration of the curve that adds value to classify airway responsiveness. It has been defined as the presence of “a small plateau of the early phase of expiratory flow on the flow-volume curve” [5].

The current study found that patients with SP sign were mostly negative-BPTs. Since a positive-BPT usually means patient with AHR, when spirometry showing an SP sign, that might indicate the patient is less likely to have AHR. This result was in agreement with the previous studies [5, 6]. The prevalence of the SP sign in BDTs was only 2.7%, of them, the rate of negative-BDTs was 88.6%, this outcome indicated patients with SP sign had lower airway reversibility. Additionally, we noted the disappearing or narrowing of the width of the sign in the post-BPT maneuvers of patients with a positive-BPT, while the SP sign presented in the post-BDT maneuvers of patients with a positive-BDT. As indicated above, these findings supported the SP sign less on the flow-volume curves with severe ventilatory defects, or the sign could be masked because of the methacholine challenge-induced decline of PFT values. The median FVC, FEV1, FEF50%, FEF75%, and MMEF % predicted were all higher in patients with SP sign, whereas Li et al. [5] found no significant difference values between patients with or without SP sign using BPT data. In theory, our findings were also confirmed in BDT data. Moreover, a recent study has evaluated baseline spirometry variables as markers for AHR, they found that all baseline spirometry parameters were significantly lower in the positive AHR group. In their study, FEF50% was proved to be a negative predictor for AHR [12]. In line with our study, SP sign also thought to be a negative marker, patients with SP sign presented higher FEF50% % predicted.

Further analysis of pre-BPTs questionnaires, we demonstrated that patients with SP sign were more likely to have PFT indications, including cough, rhinitis (sinusitis), and gastroesophageal reflux. On the contrary, the indications due to asthma and COPD were not significantly different compared to those without the sign. In theory, these diseases have been more often with symptoms like cough, sputum, and abnormal sensation of the throat, which were significantly associated with upper airway abnormalities.

The higher median FEV1% predicted and FEV1/FVC ratio in SP sign width-stratified class 4 compared to 1, demonstrated the sign presented more often in subjects with a normal-to-mild ventilatory defect. In addition, class 4 had lower rates of chronic cough and PFT indication of COPD, but had higher rates of PFT indications of bronchiectasis, chest pain, pleural effusion, and sleep disorder. These findings indicated that with the increase of the width of the sign, types of diseases were more complex and the symptoms were more diverse.

In 48 patients with laryngoscopy findings, we observed patients with SP sign had a lower prevalence of chronic pharyngitis or normal finding. They were more likely to have evidence of upper airway stenosis due to epiglottis cyst, hypertrophy of tonsils, vocal cord nodule, and throat tumor, etc. However, the degree of stenosis in patients with SP sign was mostly 0∼25%. In line with this tendency, Li et al. [5] observed the endoscopy findings of nine subjects with a negative-BPT and SP sign, they found that these subjects were excluded from severe central airway stenosis. These findings suggested that the SP sign may be an indicator to identify upper airway disorders.

Because the SP sign presented a good indicator to classify patients who were less likely to have AHR and the degree of severity of spirometric abnormality, we developed an SP-Net to automatically recognize it with an accuracy of 95.2%. The confusion matrix has shown that the SP-Net was sensitive (93.7%) when assessed on the test set (N = 374 cases). Six positive cases were wrongly detected negative because some curves have a concave shape at the same location of the SP sign. These shapes also appeared repeatedly in the different curves of one person. However, we did not know their physiological mechanism, fortunately, they were not common. A previous study proposed a neural network to detect upper airway obstruction caused by goiter using the flow-volume curve [29]. Artificial intelligence also helped with the interpretation and quality control of PFTs, together with the diagnosis of chronic respiratory diseases [14, 30, 31]. To the best of our knowledge, we were the first to use deep learning models for automatic detection of SP sign. Our object detection model not only could classify the image type, but could simultaneously locate the position of the SP sign. This additional feature could serve as interpretable guidance for the doctors to analyze and understand based on what visual cues the model classified a sample into the category of SP sign. It  was very promising that our proposed deep learning algorithms could serve as a simple tool to aid SP sign recognition in primary care. In the future, the model can be integrated into the software of lung function equipment, more tasks of recognition of specific configuration of flow-volume curves, such as a saw-tooth sign, upper airway obstruction, and unilateral main bronchial stenosis will be developed. It will help alert the non-specialists to the potential presence of some disorders which sometimes be misdiagnosed.

A major limitation of this study was acquired data retrospectively, which could not observe symptoms of patients during BPTs. Although the SP sign was mostly present in patients with a negative-BPT, they were more likely to have evidence of symptoms due to upper airway abnormalities. The symptoms during BPTs will need to be observed in future work. Another drawback of our study was a lack of results of AHR in patients with laryngoscopy findings. They were only assigned to complete a baseline spirometry test, but not a BPT, since performing a BPT takes more time and may bring a bad experience to the subject. In addition, the level of inter-rater variability of authors when labelled the spirometry files could not be calculated. Because no one has re-labelled files that others’ have already labelled. Finally, we only developed an approach to recognition SP sign using SP-Net, future studies should incorporate clinical manifestations, PFT indices, and laryngoscopy findings to assist clinicians in the classification of healthy people, patients with asthma and COPD, as well as the prediction of airway responsiveness.

Conclusions

We analyzed the prevalence, clinical, and lung function characteristics of the SP sign. The sign mostly presented in patients with normal-to-mild ventilatory defect and a negative-BPT. Patients with SP sign were less likely to have AHR and severe ventilatory defects. We propose the SP sign as an indicator to perform the classification of patients with asthma who are mostly with AHR and patients with COPD who are mostly with obvious airway obstruction. The application of the sign is ideal for primary care that cannot carry out BPTs. Therefore, we developed SP-Net to automate the recognition of it. Additional studies will need to further define the sign, including its airway dynamics and physiology mechanisms.