FormalPara Key Summary Points

Asthma exacerbation events remain difficult to avert and approaches for personalized medicine are lacking.

Each patient’s asthma journey and disease severity are influenced by a range of factors including medical history, biomarker phenotype, pulmonary function, level of healthcare system support, compliance to prescribed therapy, comorbidities, personal habits, and environmental conditions.

Machine learning (ML) uses mathematical and statistical methods to detect patterns across large datasets including electronic health records (EHR).

ML has the potential to augment clinical decision-making and provide appropriate treatment to improve both asthma prognosis and the overall quality of life.

In this review, we summarize recent studies that have demonstrated the ability to predict asthma exacerbations using different algorithms and propose a few next steps for this field.

Introduction

Asthma exacerbations are difficult to predict on the basis of available data of patients at risk. Most medical practitioners usually rely on the patient’s medical history, pulmonary function test (PFT) results, levels of asthma control, and treatments; however, it is unclear as to what proportion of clinicians follow asthma guidelines for the assessment of risk and prevention of exacerbations or use biomarkers [1, 2]. With such multiple variables, it is unsurprising that up to 10 million asthma exacerbations are reported annually in the USA, constituting the majority of expenditures devoted to asthma care [3].

Additionally, management of asthma is a complex process. Clinically, many patients can have poor asthma control and/or poor PFT results; however, they may not seek (or find) medical help and/or comply with treatments until they develop severe bronchoconstriction. Other patients with mild or even asymptomatic asthma can deteriorate suddenly and end up in the emergency department (ED), become hospitalized, or even suffer from fatal asthma attacks [4]. Biomarkers can help pinpoint which patients are at risk of exacerbations [5,6,7]; however, the endotypes may not be stable because of spontaneous, environmentally triggered, or treatment-induced variability [7,8,9]. Furthermore, healthcare systems can vary by region, and lack of access is a major challenge that can lead to undertreatment and deterioration of health [10,11,12]. Patient habits and/or lack of compliance to medications can also influence the development of exacerbations [13,14,15]. Lastly, environmental air quality is a critical determinant of airway inflammation, bronchospasm, and asthma deterioration [16, 17]. All of these factors can be assessed in the clinic via questionnaires, physical examinations, PFTs, and/or laboratory evaluations. However, once these initial evaluations are completed, there is a need for accurate prediction of the probability of future exacerbations for each individual patient. As clinical factors evolve, subsequent predictive modeling may be beneficial for effective disease management in said patients.

This article is based on previously conducted studies and does not contain any new studies with human participants or animals performed by any of the authors.

There are at least two studies that have used multivariable logistic regression to pinpoint features which may predict exacerbations. Blakey et al. extracted longitudinal medical records of 118,981 patients with actively treated asthma from the Optimum Patient Care Research Database, seeking to predict the number of exacerbations in a 2-year period. After selecting predictors with significant univariable association, the authors trained multiple multivariate logistic regression for recurrent asthma attacks prediction. The strongest features associated with future attacks included the following: history of exacerbations, lung function, smoking status, blood eosinophilia, age, sex and history of rhinitis, nasal polyps, eczema, gastroesophageal reflux disease, and obesity. The final validated models utilized 19 and 16 risk factors for two or more (area under the receiver operating characteristic curve (AUC) 0.785) and four or more (AUC 0.867) attacks over 2 years, respectively [18].

In addition, Noble et al. trained a multivariable logistic regression to predict asthma events (accident and emergency attendance, hospitalization, or death) on a dataset of 61,861 people with asthma extracted from England and Scotland using the Clinical Practice Research Datalink. The fitted model was validated on 174,240 patients with asthma from Wales and had AUC of 0.71. Older and underweight patients who smoked and had history of blood eosinophilia and past asthmatic attacks had the highest probability of experiencing asthma-related crisis events [19]. Since clinicians are not as well suited to the implementation of complex algorithms as machines, assessment of the aforementioned features may not be sufficient to alert patients to their risk of asthma exacerbation. Machines may be unable to implement complex algorithms because data may be inconsistently captured or maintained in an unstructured format [20]. Indeed, additional factors beyond past medical history, current symptoms, PFTs, and even biomarkers may be required for predicting future exacerbations. The patient’s access to medical care, genetic makeup and expression, habits, and environmental exposures may also impact their asthma prognosis. Thus, there is a need to integrate the many characteristics of the “asthmatic patient” using algorithms trained and tested in large, independent datasets to arrive at an accurate prognosis of the disease and potentially augment our ability to predict exacerbations for each patient individually.

Machine learning (ML) and deep learning (DL) are fields within artificial intelligence (AI) that are increasingly being used in medicine [21, 22]. However, one of the challenges in ML is the “no free lunch” theorem, which states that there is no single algorithm that will fit all datasets or solve all questions [23]. There have been efforts to—at a minimum—correlate symptoms and airway function with asthma exacerbations using ML [24]. Although the analysis of lung images was one of the first areas of respiratory medicine to demonstrate increased diagnostic accuracy with the use of neural networks [25], the prediction of acute medical events has become more recently a major unmet need, particularly during the COVID-19 pandemic [26]. This need to establish an accurate prognosis of serious conditions, such as asthma exacerbations, still persists today. In this review, we summarize recent studies that have demonstrated the ability to predict asthma exacerbations using different algorithms and propose a few next steps for this field.

Complexity of Exacerbations

More than 10 million asthma exacerbations occurred in the USA in 2020 [3]. There a need not only to accurately predict who is at risk of exacerbations [27] but also to avoid overdiagnosis and overtreatment of asthma [28]. Asthma exacerbations are defined by symptoms of increased severity that require treatment with systemic corticosteroids for at least three consecutive days [29]. Short bursts of systemic corticosteroids are effective to treat acute asthma, but they tend to be overprescribed and can cause serious side effects [28]. Furthermore, patients with poorly controlled asthma can progress to develop exacerbations due to lack of access to medical care and proper therapies [30], or lack of compliance to prescribed treatments [14]. Indeed, providing proper medications has been shown to reduce the incidence of exacerbations in some geographies [31,32,33]. However, when applied broadly, such therapeutic approaches incorrectly assume that every person with asthma is at risk of exacerbations, resulting in overtreatment, which in turn can lead to the buildup of unnecessary costs and an increased incidence of treatment-emergent adverse events. Incidentally, among well-treated patients with asthma, exacerbations are ultimately triggered by environmental insults (isolated or in combination) such as respiratory infections, allergens, and air pollution [34, 35].

Asthma exacerbations are complex and multidimensional phenomena, and ML can analyze and predict such medical events if proper inputs (including intrinsic and external factors such as patient features and environmental insults, respectively) are provided to train (or develop), validate, and test said algorithms. One of the key challenges when predicting asthma exacerbations is that the analysis of small sample sizes and few determinants of asthma severity may lead to less accurate predictions and limited conclusions [36, 37]. For example, there are patients with daily symptoms and/or low forced expiratory volume in 1 s (FEV1) who do not exacerbate; conversely, there are patients who seem clinically stable but may go on to develop an acute asthma attack when several dimensions of the asthma milieu occur in concert [38]. Several studies have analyzed levels of asthma biomarkers, such as increased eosinophils in peripheral blood/sputum and increased fractional exhaled nitric oxide (FeNO) [5, 6], yet given the multidimensional aspects of asthma [39], ML approaches will likely be required in the future to augment the accuracy of predictions.

Another potential consequence of acute asthma is disease relapse in some patients who are treated in the ED and then sent back to the ambulatory setting. Although the reported percentage varies among studies, median 2-week relapse rate is considered to be between 4% and 17%, with nearly two-thirds of the patients experiencing a relapse within the first few weeks of discharge [40, 41]. Factors associated with relapse are the same as in any exacerbation [40], with the additional consideration that some patients may be discharged without full recovery of airflow obstruction and underlying inflammation [42], and/or may suffer from psychosocial challenges making recovery outside of the hospital difficult [43, 44]. Even in the best-case scenario, considering only the patient’s past medical history, asthma symptoms, PFT results, biomarkers, and treatments, the number of exacerbations remains constant. Thus, there continues to be a need to increase our knowledge of asthma and/or consider including more features of the “asthmatic” circumstances to predict exacerbations with greater accuracy using AI (Fig. 1).

Fig. 1
figure 1

The constellation of factors influencing the asthmatic patient’s journey that may be analyzed using artificial intelligence to augment clinical decision-making. Patients with asthma present a unique collection of factors that influence their disease severity and outcome. The cornerstones of traditional patient management consist of assessments of pulmonary function (i.e., spirometry measuring lung capacity and FEV1), biomarker levels (e.g., blood eosinophils, IgE, FeNO), and patient-reported symptoms (e.g., cough, wheezing, shortness of breath, sleep). However, a myriad of factors together influence the individual patient’s risk of asthma exacerbation and disease progression, including both intrinsic factors such as genomics, comorbidities, and allergic status, and extrinsic factors such as occupational and environmental exposures, access to the healthcare system and medications, and geography. Because asthma exacerbations are complex and multidimensional phenomena, the use of artificial intelligence may better predict such events if proper inputs (including intrinsic and extrinsic factors) are provided to train, validate, and test next-generation treatment algorithms. IgE immunoglobulin E, FeNO fractional exhaled nitric oxide, FEV1 forced expiratory volume in 1 s

Machine Learning and Deep Learning

ML is a subset of AI that allows a machine to automatically learn from data without explicit programming. ML consists of algorithms (a series of steps) that must be trained (developed), validated, and tested before being deployed for general use [45]. There exist many algorithms that can be used in ML (Table 1), which are mainly classified into three categories: (i) supervised learning, (ii) unsupervised learning, and (iii) reinforcement learning.

Table 1 Commonly used machine learning algorithms (with Python and R codes)

In supervised learning, data are labeled, and the ML model learns how to best map feature vectors (input) to labels (output), allowing for the prediction of outcomes for unforeseen data [45]. This process is called training and gives the model the ability to produce accurate predictions when it is exposed to new and unlabeled datasets. Supervised learning is categorized into two algorithms: regression and classification. In regression algorithms, the output is a continuous variable (i.e., FEV1), while in classification algorithms, the output is a category (i.e., hospitalization). Examples of such algorithms include linear regression and logistic regression (Fig. 2a). In unsupervised learning, the algorithm acts without guidance as data points are not labeled or classified. The algorithm discovers patterns in the data that allow for the classification of similar datapoints together into clusters. Unsupervised learning algorithms are categorized into clustering and associations. Examples of such algorithms are K-means clustering and hierarchical clustering (Fig. 2b). In reinforcement learning, the algorithm (also called “agent”) learns by trial and error, i.e., through interaction with and feedback from the environment in a manner resembling the learning process in animals and humans. Reinforcement learning is used to train the agent to perform more complicated tasks and receive a reward, or not (Fig. 2c). The latter has not been used in medicine yet, as far as we know.

Fig. 2
figure 2

Supervised, unsupervised, reinforcement, and deep learning algorithms. a Linear regression and logistic regression are among many supervised learning algorithms and are trained on labeled data. The label can be either a continuous outcome (linear regression) or a category (logistic regression). b K-means clustering and hierarchical clustering are examples of unsupervised learning algorithms and use unlabeled data to discover hidden patterns and data clusters. The output of the supervised learning can be the clusters’ information, a dendrogram, or other visuals that highlight notable patterns within the data. c Reinforcement learning algorithms train an agent to find the optimal succession of actions that optimize a given task. The agent acts within an environment, according to defined rules and a reward system, and through trial and error, the agent explores the whole solution space and eventually identifies the optimal solution. d Deep learning models are composed of multiple layers of artificial neural networks and can be trained to predict continuous and categorical variables, using both structured (i.e., tables) and unstructured data (i.e., images, time series)

Deep learning is a type of ML that involves training artificial neural networks (ANN) containing several hidden layers on a large dataset. These ANNs learn to predict the outcome variable, which can be either continuous or categorical, on the basis of the input. In asthma, DL models have been trained on patient data for demographics, medical history, asthma phenotype, environmental factors, biomarkers, as well as PFT results. The trained model can then be used to predict the likelihood of asthma exacerbations in the near future on the basis of the patient’s data in the baseline period. This approach holds the potential for application in personalized medicine in asthma (Fig. 2d).

Studies Using Machine Learning to Predict Asthma Exacerbations

According to the “no free lunch theorem” there is no single algorithm that can be applied to all datasets homogeneously, making algorithm selection a key decision when applying ML [57, 58]. Thus, the use of ML to analyze features of asthma creates the possibility to uncover interactions between all potential features that can produce clinical outcomes and that may have not been considered until now, thereby leading to better asthma care without increasing the burden on patients and healthcare practitioners.

Original research efforts in ML and asthma were done in small clinical datasets and/or using only genomics data. Results were compared with predictions made by clinicians [59], or those made in genome-wide association studies (GWAS) [60], resulting in ML models with lower accuracy in predicting asthma exacerbations than those based on physicians’ opinions. In 2017, however, Finkelstein and Jeong interrogated data from 7001 self-reported records during a 7-day home telemonitoring window and found that ML algorithms, such as naïve Bayesian classifier, adaptive Bayesian network, and support vector machines (SVM), respectively, were able to predict asthma exacerbations on day 8 with a sensitivity of 0.80, 1.00, and 0.84; a specificity of 0.77, 1.00, and 0.80; and an accuracy of 0.77, 1.00, and 0.80 [61]. Predictive modeling included asthma symptoms, self-reported medication consumption, asthma trigger exposure, and lung function assessed by peak expiratory flow (PEF). Although using these parameters demonstrated promising results, future predictive models for asthma exacerbation should include other potentially relevant predictive factors such as genomic, clinical, sociodemographic, behavioral, and environmental.

Additional studies have also used ML to predict hospitalizations in children and adults with acute asthma [62, 63]. One study in children with acute asthma seen in the ED compared four ML approaches using features from 29,392 patients (mean age ± SD, 7 ± 4.2 years) [62]. Data used to predict hospitalizations included clinical data available at the time of triage, weather reports, neighborhood characteristics, and viruses circulating in the community. Four models were trained and validated, including decision trees, least absolute shrinkage and selection operator (LASSO) logistic regression, random forests, and gradient boosting machines (GBMs), and the area under the receiver operating characteristic curve (AUC) was calculated for each model. AUCs for each model included decision tree, 0.72 (95% confidence interval [CI] 0.66–0.77); logistic regression, 0.83 (95% CI 0.82–0.83); random forests, 0.82 (95% CI 0.81–0.83); and GBM, 0.84 (95% CI 0.83–0.85). Patient vital signs and acuity, age, and weight, followed by socioeconomic status and weather-related features, were the most important features predicting hospitalizations, with GBM identified as the best predictor of hospitalization in the study.

In another study, Goto et al. tested four ML algorithms (LASSO logistic regression, random forest, GBM, and deep neural network [DNN]) on data from 3206 adults with asthma and chronic obstructive pulmonary disease (COPD) exacerbations who visited the ED [63]. In both patient populations (asthma and COPD), GBM was the best-performing approach and achieved the highest discriminative ability (C-statistics 0.80 vs 0.68), reclassification improvement (net reclassification improvement [NRI] 53%; P = 0.002), and sensitivity (0.79 vs 0.53) over the reference model. For the prediction of hospitalizations, random forest provided the highest discriminative ability (C-statistics 0.83 vs 0.64), reclassification improvement (NRI 92%; P < 0.001), and sensitivity (0.75 vs 0.33).

Zein et al. analyzed data extracted from electronic health records (EHRs) of 60,302 patients with asthma and tested three different models (logistic regression, random forests, and GBM) to predict three different outcomes occurring in 19,772 patients with acute asthma [64]. Of the three asthma outcomes, 32.8% required oral glucocorticoid bursts, 2.9% required ED visits, and 1.5% required hospitalizations. The three outcomes were predicted best by light gradient boosting machine, with an AUC of 0.88 (95% CI 0.86–0.89). Risk factors for all three outcomes included age and use of long-acting β-agonists, high-dose inhaled glucocorticoids, or chronic oral glucocorticoid therapies. In a subgroup analysis of 9448 patients with spirometry data, adding results from PFTs did not improve the models’ predictive performances.

More recently, a predictive model of asthma exacerbations used data downloaded from ProAir Digihaler® together with clinical and demographic information [65]. The generated model predicted an impending exacerbation over the following 5 days with high diagnostic accuracy (AUC 0.83). The most significant factors contributing to the model were features based on the mean number of inhaler puffs during the 4 days before prediction. Similarly, Zhang et al. analyzed 728,535 records of daily peak expiratory flow and asthma symptoms in 2010 patients [24]. The authors used logistic regression, decision tree, naïve Bayes, and perceptron algorithms to assess the primary outcome of exacerbation detection on the same day, or up to 3 days in the future. The best model used logistic regression, which showed an AUC of 0.85 with a sensitivity of 90% and specificity of 83% for severe asthma exacerbations.

Tong et al. published results from an accurate model using data from patients with asthma treated at Intermountain Healthcare between 2011 and 2018 [66]. The authors inputted 234 candidate features leading to the analysis of 82,888 data instances to predict asthma exacerbations in the following 12 months. The XGBoost algorithm was used because it deals with missing values and calculates the importance of each feature according to its contribution to the model. The final model, using XGBoost and 71 features in descending order of their importance, yielded an AUC of 0.90 with an accuracy of 91%, sensitivity of 70%, and specificity of 91%. The top 10 features associated with exacerbations were all related to loss of asthma control and African American race. To calculate the probability of an asthma exacerbation occurring in children, Overgaard et al. developed a model that yielded an AUC of 0.80 using logistic regression, whereas random forest showed an AUC of 0.79 and perceptron showed an AUC of 0.78 [67]. Their purpose was to further conduct a clinical trial to assess safety, clinical outcomes, care quality, care cost, and satisfaction of end users (e.g., clinicians, patients, and caregivers) comparing ML algorithms versus standard care for clinical decision-making.

ML has also been used to assess the influence of environmental exposures on asthma exacerbations using conditional random forest, conditional tree, and generalized linear models. Conditional random forest and conditional tree identified five features (prednisone use, race, particulate matter exposure, obesity, and gender) to be of significant importance in the association with exacerbations. The same features (except for gender) were replicated using a generalized linear model [68]. In addition, Haque et al. used a DNN regression (DNNR) model to predict asthma exacerbations on the basis of Asthma Control Test (ACT) scores, weather triggers (temperature, humidity, pressure, and windspeed) and demographic data, with their approach achieving an accuracy of 94% [69].

Another aspect of acute asthma is the number of readmissions that can occur after the initial treatment of acute asthma in the ED. In a study of 81 patients with asthma, Halner et al. developed multivariate random forest models to identify predictors of treatment failure defined as the need for additional systemic corticosteroids and/or antibiotics, hospital readmission, or death within 30 days of the initial ED visit [70]. Forty-three patients failed treatment, which was predicted by an 11-variable random forest model (including medication, history of exacerbations, symptoms, and quality of life) with an AUC of 0.81. In this regard, de Hond et al. recently used home monitoring of peak flow rates and symptoms in 266 patients to adjust their asthma medications and avert exacerbations [71]. These authors compared ML algorithms (XGBoost vs one-class SVM) with a clinical rule and logistic regression. The AUCs were 0.85 for XGBoost and 0.88 for logistic regression. Clinical application, however, remains a challenge owing to the low incidence of events, which may be overly sensitive and create false alarms [72]. Additionally, because asthma is the result of the patient interacting with the environment, both patient activity and environmental data should be included in any ML approach to augment clinical decision-making.

The ML algorithms used in the aforementioned studies found that the most common clinical features associated with asthma exacerbations were loss of asthma control and poor air quality or weather changes; intake of medications and use of PFTs were less clearly linked to asthma exacerbations with their relevance depending on the patient populations and settings. Table 2 contains a concise summary of the studies described above.

Table 2 Summary of studies employing ML to predict asthma outcomes

Pros and Cons of Using Machine Learning to Predict Asthma Exacerbations

There are clear advantages to using ML approaches to predict asthma exacerbations with greater accuracy than it is being done today. Such advantages include the more accurate prediction of asthma exacerbation and prognosis through the analyses of large databases (as mentioned in the studies discussed above). ML approaches are faster and less expensive than clinical trials as they reduce the need for manual data entry and analyses, and detect patterns that may be more difficult for human experts to detect [73, 74]. The ability to automatically integrate the different dimensions that are part of the asthma syndrome and its timeline of deterioration leading to exacerbations can only be achieved using ML owing to its lesser scope for human error. In this regard, new features or data patterns that can predict asthma exacerbations could be detected (as has been done in other areas of medical care). For instance, studies have demonstrated that ML can prevent adverse events such as healthcare-associated infections, adverse drug events, venous thromboembolism, surgical complications, pressure ulcers, falls, decompensation, and diagnostic errors [75]. Another advantage of using ML is the ability to implement therapeutic measures by predicting the future risk of deterioration at the point-of-care [76]. Finally, ML may also be applied retrospectively in specific patient subgroups to understand/identify likely clinical outcomes depending on treatment selected, sometimes described as real-world evidence (RWE). There exist some ethical challenges that need to be taken into consideration when planning to use AI in healthcare. First, informed consent must explain when (and if) AI has been used to provide a therapeutic recommendation or to inform prognosis and risk of future events. Explaining to the patient the type of AI used and how it has augmented the accuracy of the prediction is needed, while “black box” algorithms (when the developer cannot share details of the algorithm) should be avoided [77]. Black box approaches are those in which the inner workings of the system are not easily explained, but they are often used in ML algorithms that are trained on large amounts of data. These algorithms can be very accurate at making predictions but can also be difficult to interpret, meaning that it is not always clear why they make the predictions they do. Therefore, there are several reasons why black box approaches should be avoided in healthcare. First, they can lead to a lack of trust between patients and healthcare providers. Second, black box approaches can make it difficult to identify and address bias in healthcare algorithms. Third, black box approaches can make it difficult to improve healthcare algorithms over time. If it is not clear how an algorithm works, it can be difficult to identify the factors that contribute to its accuracy and to make changes that can improve its performance. This can lead to algorithms that are less accurate and less effective over time.

Additionally, the data currently available for use in AI are often unstructured, simply not in any documented form, and/or disputed. High-quality, large, annotated databases may prove to be quite fruitful in minimizing patient harm in the future (caused by potentially incorrect predictions) [78]. Until this happens, patients must be informed of potential pitfalls when implementing AI in healthcare decisions. Because all AI input is developed by humans, it is subject to human bias. This can occur when algorithms are trained in limited demographic groups leading to a lack of generalizability and unintended social bias. If the data are biased, the algorithm will perpetuate these biases, leading to incorrect predictions [79]. Finally, with regards to data privacy, EHRs contain confidential patient information; searching such data using algorithms may constitute a breach of patient privacy.

Conclusion

Despite the use of inhaled corticosteroids, bronchodilators, leukotriene modifiers, and biologics for T2-high and T2-low asthma, some patients with asthma continue to suffer from acute exacerbations. Predicting asthma exacerbations with better accuracy using disease management programs and RWE can significantly reduce asthma-related morbidity and healthcare costs.

Each patient’s asthma journey and disease severity are influenced by a constellation of factors, including their medical history, biomarker phenotype, pulmonary function, level of healthcare system support, compliance to prescribed therapy, comorbidities, personal habits, and environmental conditions. Such aspects of the patient’s experience can be analyzed using ML to augment clinical decision-making and provide appropriate treatment to improve both asthma prognosis and the overall quality of life.

It must be noted that there are two metrics used to evaluate performance of an ML model in binary classification tasks: precision and recall. These measure how accurate and how complete the model’s positive predictions are, respectively. In general, it is desirable to have both high precision and high recall. However, in a clinical diagnosis system, one may prioritize recall over precision because it is more important to identify all patients at risk of exacerbations, even if some of the patients with less severe asthma may present with comparatively lower risk of exacerbations.

Additionally, while ML works well in very large and/or well-labeled datasets, it may be difficult to apply ML directly in smaller and/or less defined datasets. In such circumstances, an initial step of natural language processing (NLP) could help provide a curated dataset as input to the ML model. Clinical NLP can be applied to clinician notes to extract clinical features such as clinic visits, hospital treatments, and PFTs among other features in the patient timeline that can serve as ML inputs. Furthermore, the interpretability of ML algorithms by patients and healthcare professionals can be improved by using simpler algorithms that can explain their predictions in plain language, or by visualizing the algorithm’s decision-making process that can then be audited by humans.