Failure prediction using personalized models and an application to heart failure prediction

Roy, Asim; Bruce, Charles; Schulte, Phillip; Olson, Lyle; Pola, Manasa

doi:10.1186/s41044-020-00044-2

Failure prediction using personalized models and an application to heart failure prediction

Research
Open access
Published: 08 July 2020

Volume 5, article number 3, (2020)
Cite this article

Download PDF

You have full access to this open access article

Big Data Analytics

Failure prediction using personalized models and an application to heart failure prediction

Download PDF

Asim Roy¹,
Charles Bruce²,
Phillip Schulte²,
Lyle Olson² &
…
Manasa Pola¹

4406 Accesses
1 Citation
Explore all metrics

Abstract

Background

To reduce disruptions of processes and the cost of maintenance, predicting the onset of failure (or a similar event) of a physical system (or components of a physical system) has become important. Prediction of onset of failure would allow appropriate corrective actions at the right time. In this paper, we present a method to predict the “onset” of failure (the start of a degradation process or similar types of events) of a physical system that minimizes data collection and personalizes it for the physical system. The method applies to situations where one monitors the operating characteristics of the physical system at regular time intervals by means of attached sensors and other measurement instruments. It creates a model of the physical system, during normal operations, using the time-series data produced by the sensors and measurement instruments. However, it does not create or use any time-series models. It simply examines the distribution of time-series data across different time periods. It uses this model of normal operations in subsequent time periods to monitor the physical system for deviations from normality.

Results

We illustrate this method with an application to predict the “onset” of subsequent decompensated heart failures for patients already treated for a heart failure at a hospital. As part of an NIH study, these heart failure patients received two ECG patches, an accelerometer and a bio-impedance measurement device for regular monitoring for a period after their release from the hospital.

Conclusions

When dealing with non-homogenous, disparate physical systems, personalized models can be better predictors of a phenomenon compared to generalized models based on data collected from an assortment of such physical systems. In medicine such models can be a powerful addition to the set of medical diagnostic tools. And such personalized models can be built rather quickly without waiting for extensive data collection.

Home-Based Multi-parameter Analysis for Early Risk Detection and Management of a Chronic Disease

From Holistic Health to Holistic Reliability—Toward an Integration of Classical Reliability with Modern Big-Data Based Health Monitoring

Machine Learning for Failure Analysis: A Mathematical Modelling Perspective

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Background

In general, the first step to create a model of any phenomenon is data collection. For example, to predict fraud, an organization would collect data on different fraud and non-fraud cases. And to predict breast cancer from biopsies, one would collect data on different cases where the tumors were either benign or malignant. In general, the basic idea in machine learning is to build models from a diverse set of cases so that the models can “generalize” and accurately account for the variety of cases that exemplify the phenomenon. However, the attempt to generalize from a very diverse set of cases can sometimes be problematic and can result in models that may not be very accurate in their predictions. Such diversity of cases arise often in the medical field because human bodies become very dissimilar physical systems over time. For example, one would find patients with similar medical histories exhibiting different medical conditions making it difficult to identify common medical profiles for certain medical conditions. (Creating such common profiles is the task of generalization in machine learning.) For instance, heart failure patients often have very different medical histories and, thus, makes it difficult to build highly accurate profiles for them. Hence, generalization and accuracy of prediction suffer with these kinds of phenomena.

It is often possible to redefine the problem, especially when system degradation prediction is of interest, by simply focusing on the data produced by an individual physical system. In this scenario, one simply would build models of an individual physical system that, in effect, would define its normal operating characteristics. Such models would no longer need to compare an individual system’s behavior with other similar systems. They would therefore not require extensive data collection for the purpose of “generalization.” In addition, they could be built quickly, almost instantaneously in some cases, and be able to capture the idiosyncrasies of a particular system.

Here is an example of such a situation. In one particular NIH study of decompensated heart failure (DHF) [40], further discussed later in this paper, DHF patients, after their first heart failure treatment, received a package of devices for remote patient monitoring (RPM) on their discharge from the hospital. The package included two ECG patches, an accelerometer and a bio-impedance measurement device. The NIH study collected data from individual patients with the RPM devices in order to predict the “onset” of next decompensated heart failure for such patients. However, given the diversity of the patient population in this study, any population-based predictive model, based on RPM and other medical data of the patients, would not be very accurate. In this paper, we redefine the prediction problem in such situations and show that a personalized model for each individual patient, based simply on the RPM data, would be much more accurate in its prediction of the “onset” of DHF.

In this paper, we propose a method for personalized modeling of a physical system for failure prediction (or, to be precise, predict the start of the degradation process of a system) based on time-series data produced by sensors and other measurement instruments. We then show the application of this method to predict the “onset” of subsequent decompensated heart failure of three patients from the NIH study. Heart failures are generally a slow degradation process and are similar to slow failure processes of many other physical systems. Thus, the method can be applied to failure prediction of machinery and production processes with similar characteristics. Although the proposed method uses time-series data produced by sensors and other instruments, it does not actually construct or use any time-series models. It simply examines the distribution of time-series data across specified time cycles to make predictions about the “onset” of failure. Prediction of the “onset” of slow degradation processes is also not strictly anomaly detection but is more about trend analysis. The advantage of personalized modeling is that it does not require large amounts of data collection about other similar systems. And for situations where it is difficult to generalize from diverse population characteristics, personalized models can be far more accurate.

Literature review - remaining useful life (RUL) approach to failure prediction

Prediction of remaining useful life (RUL) of a physical system, before it is likely to require repair or replacement, is widely used for predictive maintenance. RUL methods, also called prognostic methods, are broadly classified as two types: (1) data-driven methods and (2) model-driven methods [28]. Data-driven approaches use statistical and machine learning methods on historical failure data of similar systems to build prognostic models. Traditional data-driven approaches include autoregressive (AR) and threshold autoregressive models [3, 42], projection pursuit models [12] and multivariate adaptive regression splines models [13]. More recent data-driven approaches include a variety of neural network models [2, 19, 20, 27, 37, 41]. Model-based approaches use models that describe the physics of the system [1, 7, 30].

Liu et al. [28] proposed a data-model fusion framework for system state prognostics. Nystad et al. [34] investigated the problem of estimating the remaining useful life using stochastic lifetime models and considered randomly distributed failure thresholds. Gola and Nystad [16] combined a condition monitoring system that provides reliable calculations of the actual erosion state of a choke valve during its operation with a lifetime RUL model. In effect, it adjusts a lifetime RUL model using real-time system monitoring data. Lei et al. [26] proposed a model-based method for predicting RUL that has two modules: (1) an indicator constructor which fuses mutual information from multiple features and properly correlates to the degradation processes of machinery, and (2) a RUL predictor that uses a particle filtering-based algorithm.

Casoetto et al. [3] constructs a model of normal behavior of a system by fusing multiple sensor signals during its normal operation. Using this model, they then monitor the behavior of the system for degradation, expressed by drift of signals away from the normal. They fit an Autoregressive (AR) model to time series signals of multiple sensors and extract the Power Spectral Density (PSD) peaks of individual sensor readings from the roots of the corresponding AR model characteristic equation. These features, extracted during normal operations, are saved as the model of normal operating behavior. Their overall idea is very similar to ours in the sense that they build a personalized model of a process or system and do not use any historical data. They predict failure (degradation) by observing drift in the behavior of the system. However, our signal fusion method is completely different from their method.

Qiu et al. [37] present a prognostic method that includes a wavelet filter-based method for signal de-noising, weak signature enhancement for fault identification and two Self Organizing Maps (SOMs) for performance assessment and degradation detection to detect defects at an early stage in rolling element bearings. The approach, in principle, also constructs a personalized model by training the SOMs with normal operating data of a system. We also use SOMs in our method as discussed later. However, the way they use SOMs to track system degradation is completely different from ours. Plus, we do not use any wavelet-based method for signal denoising.

Literature review – predictive models in personalized medicine

There are many ongoing efforts to use predictive models in personalized medicine that rely more on individual patient’s medical history and data. We cite a few here. Nevins et al. [32] provide a model building framework to combine multiple types of data, both genomic and clinical, of individual patients to better predict breast cancer treatment outcomes. Esteban et al. [11] model the clinical evolution of individual patients, which usually is composed of thousands of events such as ordered tests, lab results and diagnoses, to predict the future sequence of events for clinical decisions. Their specific work related to patients with kidney failure who either obtained an organ transplant or were still waiting for one. They construct neural network models in combination with embeddings in a clinical context. Che et al. [6] propose a model to personalize prediction of Parkinson’s disease progression. It learns patient similarity from longitudinal and multi-modal patient records with a Recurrent Neural Network (RNN) architecture. Ng et al. [33] propose a method to build personalized predictive models to predict the onset of diabetes in patients. They show that personalized models (they use logistic regression models), built using data of similar patients, outperform global models that use data for all patients. Jiang et al. [21] propose a new method to calibrate a predictive model for individual patients; it uses a similar group of patients for calibration. Lee et al. [24] used a cosine-similarity based patient similarity metric to identify similar ICU patients from an ICU database and to dynamically build 30-day mortality prediction models for individual patients. Their model outperformed the general, one-size-fits-all model for such predictions. Lee [25] uses the same idea of finding similar patients but uses a random forest model instead of cosine similarity, to build a customized model for an individual patient. Many of these methods, particularly the ones that use a similar group of patients, use notions from the collaborative filtering method used in personalized recommendation systems in e-commerce. In summary, to build predictive models for personalized medicine, these are the kinds of approaches being used.

Literature review - predicting heart failure

Ross et al. [39] performed a systematic review of studies evaluating patient characteristics associated with hospital readmission for heart failure (HF). Rahimi et al. [38] reviewed the literature for risk prediction models for patients with heart failure and identified the most consistently reported independent predictors of risk across models. Kansagara et al. [22] assessed the performance of the prevailing set of models that predict hospital readmission. Mortazavi et al. [31] used data from Telemonitoring to Improve Heart Failure Outcomes trial [5] to compare the effectiveness of various machine learning methods to predict 30- and 180-day all-cause readmissions and readmissions because of heart failure. Choi et al. [8] used a recurrent neural network (RNN) model to predict the initial diagnosis of heart failure. The RNN model exploited temporal relations among events in electronic health records (EHRs) and used 3884 HF records of primary care patients. They also compared it with other models such as logistic regression, neural network, support vector machine and K-nearest neighbor classifiers. Wang et al. [44] used structured and unstructured data from electronic health records to predict the onset of heart failure and varied the prediction window from 60 to 720 days before heart failure diagnosis. They used a total of 1684 heart failure records of primary care patients. Dai et al. [9] used five machine learning models to predict heart-related hospitalizations. They used EHR data of patients with heart disease from a large urban hospital in Boston.

The rest of the paper is organized as follows. The “Method” section provides an overview of the nature of the proposed personalized model and the data collected and used to predict the “onset” of heart failure using such a model. The “Results” section has the detailed steps of the proposed method. The “Discussion” section shows the application of the method to predicting the “onset” of decompensated heart failure for three patients in the NIH study. The “Conclusions” section provides some concluding remarks.

Method

The proposed method is based on time-series data

The proposed method is about personalizing a model for a physical system to predict the “onset” of its degradation process. Such a model is solely based on data recorded at certain time intervals by a monitoring system for the physical system. It does not use or depend on any prior knowledge about such physical systems. For plants and machinery, such a monitoring system typically would consist of different types of sensors attached to them, such as the ones to measure vibration, pressure and temperature. For the heart failure case study discussed in this paper, the remote patient monitoring (RPM) system consisted of two ECG recording patches, an accelerometer to monitor the patient’s activity and a bio-impedance measuring device.

The sensors of a monitoring system can generate data at different frequencies. For example, ECG patches generate data every few milliseconds while blood pressure and weight might be recorded only a few times a day. The data generated by a sensor is essentially time-series data. When multiple sensors generate data at different frequencies, the frequencies need to be aligned for modeling purposes. There are different ways to align slow and high frequency time-series data. For example, temperature or pressure, if they are measured too frequently, can be averaged over a time interval to produce a lower frequency time-series. In the same way, if weight is measured infrequently, the same weight value can be used at subsequent time points until a new weight is recorded.

Sensor data is usually collected by external devices which then can extract additional information from them. The extracted features, in turn, define additional time-series. In general, multiple sensors and downstream devices collectively produce streaming time-series data. Thus, we can define a physical system by the characteristics of such a collection of different time-series. One way to predict the “onset” of degradation (the “onset” of failure) is to model each time-series from the data generated during normal operations of the physical system, then monitor the physical system using these time-series models and look for deviations from the normal operating mode [3]. However, we do not create any time-series models in our approach. Instead, we look for changes in the distribution of time-series data over time and then isolate one or more time-series that potentially is causing the degradation.

On the decompensated heart failure study

Since we will illustrate each step of our method with a few heart failure cases, we present some background information on the NIH supported decompensated heart failure study at Mayo Clinic that provided the data for these cases. The NIH study [40] used the BodyGuardian remote monitoring system from Preventice [36]. The BodyGuardian remote health management system is an FDA 510 approved device used for remote monitoring of cardiac patients. It has a front-end that includes an adhesive snap-strip body sensor (BodyGuardian) with built-in electrodes that measure ECG signals and bio-impedance. It also has a 3-way accelerometer. Overall, the system measures heart rate, ECG, respiration rate (RR) and activity. It also communicates with off-body sensors such as a BP cuff and scale to incorporate BP and weight data. In addition, it solicits symptoms from the user thus acting as an event recorder and recording simultaneous physiologic data. It wirelessly transmits all data to a central data analysis hub.

From ECG signals, bio-impedance measurements and accelerometer data, BodyGuardian derives 56 features. It classifies activity level in the range 0 to 100, which is then binned into 10 ranges. From the activity data, it derives three basic body positions: lying, leaning and standing. We excluded activity level and body position data from our model. From ECG data, it extracts a number of features including: PVC (premature ventricular complex), SVC (supraventricular complex), NSR (normal sinus rhythm), Unclassified Rhythm, SinTachy (sinus tachycardia), SinBrady (sinus bradycardia), IVCD (interventricular conduction delay), Mobitz 1 and 2, AV Block (atrioventricular block), PJC (premature junctional complex), PAC (premature atrial contractions), SVTA (supraventricular tachyarrythmia), AFib (atrial fibrillation - slow, normal, rapid), IVR (idioventricular rhythm), VT (ventricular tachycardia), VF (ventricular fibrillation), minimum heart rate and maximum heart rate. The data also includes blood pressure, respiration rate and weight.

We decided to have the data averaged every 5 min for modeling purposes, although the data is available on a finer time scale. In effect, we are observing the patient every 5 min. When recorded continuously during a day, one gets 288 observations. We decided to create a model using BodyGuardian data for a single day and, then, use that model to track changes in the patient’s physiological profile on subsequent days. Since the physiological measurements vary during the course of a day, our approach is to model the distribution of the physiological data during the day.

Since, in this study, Mayo Clinic provides a patient with the BodyGuardian device only after a heart failure treatment, in general, we create a model for a patient after a full day of recording following discharge from the hospital. One can construct models using data over several days following discharge from the hospital, but there is a risk in the sense that there could be onset of decompensation very soon after discharge. The model is meant to reflect the physiological state of the patient before the onset of a subsequent decompensation.

We do not use any clinical data of patients in our models. Nor do we use data of other patients to build each individualized patient model. This concept of creating a personalized model based predominantly on data generated by wearable biosensors is new and may have wide applicability in many situations. Table 1 (from [35]) shows some typical biosensors in use today and the biosignals generated by them. There are many factors driving the growth in usage of such wearable devices including: an aging population worldwide, the need to reduce hospital and emergency visits, and the need to monitor and manage chronic diseases remotely.

Table 1 Different types of biosensors and the biosignals generated by them. From Pantelopoulos & Bourbakis [35] and with permission of IEEE

Failure prediction using personalized models and an application to heart failure prediction

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Home-Based Multi-parameter Analysis for Early Risk Detection and Management of a Chronic Disease

From Holistic Health to Holistic Reliability—Toward an Integration of Classical Reliability with Modern Big-Data Based Health Monitoring

Machine Learning for Failure Analysis: A Mathematical Modelling Perspective

Explore related subjects

Background

Literature review - remaining useful life (RUL) approach to failure prediction

Literature review – predictive models in personalized medicine

Literature review - predicting heart failure

Method

The proposed method is based on time-series data

On the decompensated heart failure study

Results

Discussion

Application of the method to predict the “onset” of decompensated heart failure

Step 4 of the method - clustering

Step 5 – feature ranking

Step 6 – individual patient monitoring using the clustering model and the ranked features

Patient A – readmitted to the hospital 17 days after hospital discharge

Patient B – readmitted to the hospital 17 days after hospital discharge

Patient C – not readmitted to the hospital during the monitoring period

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation