A continuous time Bayesian network model for cardiogenic heart failure

Gatti, E.; Luciani, D.; Stella, F.

doi:10.1007/s10696-011-9131-2

A continuous time Bayesian network model for cardiogenic heart failure

Published: 08 December 2011

Volume 24, pages 496–515, (2012)
Cite this article

Download PDF

Flexible Services and Manufacturing Journal Aims and scope Submit manuscript

A continuous time Bayesian network model for cardiogenic heart failure

Download PDF

E. Gatti¹,
D. Luciani² &
F. Stella¹

1665 Accesses
21 Citations
Explore all metrics

Abstract

Continuous time Bayesian networks are used to diagnose cardiogenic heart failure and to anticipate its likely evolution. The proposed model overcomes the strong modeling and computational limitations of dynamic Bayesian networks. It consists of both unobservable physiological variables, and clinically and instrumentally observable events which might support diagnosis like myocardial infarction and the future occurrence of shock. Three case studies related to cardiogenic heart failure are presented. The model predicts the occurrence of complicating diseases and the persistence of heart failure according to variations of the evidence gathered from the patient. Predictions are shown to be consistent with current pathophysiological medical understanding of clinical pictures.

Risk Assessment for Primary Coronary Heart Disease Event Using Dynamic Bayesian Networks

Hybrid Time Bayesian Networks

Causal Independence Models for Continuous Time Bayesian Networks

1 Introduction

Recent technological developments in the field of Information and Communication Technology have offered an extremely important opportunity to operational health care management [1]. Because of this, decision support systems (DSSs) are becoming increasingly attractive for physicians, as they can offer great benefits without necessarily daring to replace human judgement [2–4]. The contribution of DSSs in health care has been far-reaching and still evolving as evidenced by the large number of references that appear in PUBMED, a widely used health care search engine. Increasingly, health care costs make it imperative for hospitals and physicians to make optimal decisions to improve the quality and the efficiency of health care delivery. Recent advances in DSSs have provided a prominent and growing role of DSSs in improving clinical as well as administrative decision making [5].

Bayesian networks (BNs) [6, 7] have become a popular representation in Artificial Intelligence for encoding uncertain knowledge [8, 9]. As inferential engines on even a large set of outcomes of interest, BNs often represent the core of flexible DSSs, like influence diagrams or, more generally, decision graphs [7]. Their task concerns the selection of decision options which are optimal in the light of both the knowledge on the modelled domain and the observations collected in specific cases, in which the user is offered a normative interpretation of an undetermined state [10]. Even if not equipped with decision analysis operators, BNs may nevertheless offer to the decision maker crucial information about the impact of observations on a set of variables which influence the decision.

In medicine, causal explanations of patient manifestations and future outcomes can be regarded as the main variables of interest, since they enable, respectively, diagnostic and prognostic reasoning. Medical literature offers several examples of models used for reasoning under uncertainty in different medical fields [11]. QMR-DT [12] was proposed for internal medicine, Pathfinder and Intellipath [13] for pathology, Qualicon [14], Localize [15], Myosys [16], Myolog [17], Electrodiagnostic Assistant, Neurop [18], Kandid [19] and Munin [20] for neuromuscular disorders.

However, with many medical problems the time duration of events concerning patient conditions cannot be dismissed like in the above static models. Partially observable Markov decision processes (POMDPs) can in principle be exploited to formalise the temporal planning of clinical management. However, their practical application is hampered by their coarse representational granularity and complex formulation. Graphical representations were advocated in order to improve both the computational tractability and the representation of POMDPs [21]. Since then, the use of temporal graphical models has appeared in the field of pediatric cardiology [22], abdominal pain [23], insulin administration [24] and ventilator-associated pneumonia [25]. Most of these applications are based on dynamic Bayesian networks (DBNs) [26], which represent the standard extension of BNs when dealing with dynamical systems.

DBNs discretize the time to model a dynamical system with several time slices. Each time slice is associated with a BN fragment which models the transition from the state at time t to the state at time t + 1. DBNs describe the state of the dynamical system at discrete time points, but do not model time explicitly. This makes it very difficult to query a DBN for a distribution over the time at which a particular event takes place. Furthermore, in the case where the system consists of processes which evolve at different time granularities and/or the obtained observations are irregularly spaced in time, the inference process may become computationally intractable.

In all cited dynamic models, different strategies were exploited to deal with the computational burden imposed by the temporal dimension, such as narrowing the temporal windows, including past observations [25], preliminary detection of critical time of change [23] and focusing on the most relevant variables as the process evolves [22]. Each strategy seems appropriate for a specific task and domain to represent, whereas no general solution emerged as appropriate for all domains.

In this paper continuous time Bayesian networks (CTBNs) [27] are used to diagnose acute cardiogenic heart failure while overcoming the main limitations of DBNs. In spite of the medical advances, cardiogenic heart failure remains one of the most common, costly, disabling and deadly medical conditions encountered by a wide range of physicians and surgeons in both primary and secondary health care. Indeed, from 1 to 2% of the adult population suffers from heart failure, but the numbers are increasing due to the aging of the population, as the disorder mainly affects people over 65 years old [28].

The proposed CTBN includes both unobservable variables and clinical manifestations which are directly accessible through medical investigation. Inference on unobservable variables such as myocardial infarction and cardiac pump impairment is the focus of diagnostic judgement as well as prognostic task related to the occurrence of shock and heart failure persistence. Three scenarios serve the purpose to show how the developed model can be used for both diagnosis and prediction of complicating disorders. The described scenarios include point evidence, usually also available with DBNs, and interval evidence, which is one of the main modelling advantages of CTBNs over DBNs. The CTBN model represents the cardiovascular system at a level of detail which appears appropriate to explain its main causes, specifically, an underlying chronic weakness of the cardiac muscle and a large myocardial infarction.

The rest of the paper is organized as follows. Section 2 gives the basics of CTBNs. In Sect. 3 the acute cardiogenic heart failure model is presented, and how it can be exploited for reasoning under uncertainty over time is described. Three evidence scenarios show the capability of the proposed model to assist the clinician in both diagnostic and prognostic tasks. Section 4 discusses the proposed approach to cardiogenic heart failure, while Sect. 5 draws conclusions and proposes further research directions.

2 Continuous time Bayesian networks

CTBNs explicitly represent temporal dynamics and allow us to recover the probability distribution over time when specific events occur. CTBNs are based on homogeneous Markov processes, while they exploit BNs to provide an intuitive language to describe complex dynamical systems.

CTBNs have been used to model the presence of people at their computers together with the specific application they are using (e.g., email, word processing, web browsing, etc…) [29]. They have been successfully used for modeling and analyzing the reliability of dynamical systems [30], for network intrusion detection [31] and for modeling social networks [32].

2.1 Homogeneous and conditional Markov processes

CTBNs are based on finite state continuous time homogeneous Markov processes, i.e. stochastic processes in which the transition intensities do not depend on time. Let X be a random variable whose state can take k discrete values Val(X) = {x ₁, …, x _k }. X changes its state continuously over time t. A homogeneous Markov process X(t) is described with its intensity matrix:

$$ \user2{Q}_{X}=\left[ \begin{array}{ccccc} -q_{1} & q_{12} & \ldots & \ldots & q_{1k} \\ q_{21} & -q_{2} & \ldots & \ldots & q_{2k} \\ \ldots & \ldots & \ldots & \ldots & \ldots \\ \ldots & \ldots & \ldots & \ldots & \ldots \\ q_{k1} & q_{k2} & \ldots & \ldots & -q_{k} \end{array}\right] . $$

The matrix Q _X allows us to describe the transient behaviour of the random variable X. If at time t = 0 the random variable is in state x _i, then it stays there for an amount of time which is a random variable exponentially distributed with parameter q _i. Therefore, the probability density function together with the distribution function for X(t) to remain in state x _i are as follows:

$$ \begin{aligned} f\left( t\right) &=q_{i}\exp \left( -q_{i} t\right) \\ F\left( t\right) &=1-\exp \left( -q_{i} t\right) \end{aligned} $$

where t ≥ 0. It is worthwhile to mention that the expected time of transitioning from state x _i is $\frac{1}{q_{i}}, $ while when transitioning from state x _i the random variable X shifts to state x _j with probability $\frac{q_{ij}}{q_{i}}. $

However, the size of the intensity matrix Q _X, i.e. the state space of the Markov process, grows exponentially with the number of variables and with their cardinality. This makes the above representation infeasible for all but the smallest spaces, i.e. models including a very small number of variables. Therefore, to compose Markov processes in a larger CTBN model, the concept of conditional Markov process must be introduced.

A conditional Markov process is a particular kind of inhomogeneous Markov process, in the sense that, for any given random variable, the intensities are a function of the current values of a particular set of other variables, which also evolve as Markov processes. Therefore, intensities vary over time but not as a direct function of it. To clarify how the conditional Markov process is described, let X be a random variable whose domain is Val(X) = {x ₁, …, x _k } and assume that it evolves as a Markov process X(t). Furthermore, assume that the dynamics of X(t) are conditionally dependent from a set V of random variables evolving over time. Then the dynamics of X(t) can be fully described by means of a conditional intensity matrix (CIM), which can be written as follows:

$$ \user2{Q}_{X|V}=\left[ \begin{array}{cccccc} -q_{1}\left( V\right) & q_{12}\left( V\right) & \ldots & \ldots & q_{1k}\left( V\right) \\ q_{21}\left( V\right) & -q_{2}\left( V\right) & \ldots & \ldots & q_{2k}\left( V\right) \\ \ldots & \ldots & \ldots & \ldots & \ldots \\ \ldots & \ldots & \ldots & \ldots & \ldots \\ q_{k1}\left( V\right) & q_{k2}\left( V\right) & \ldots & \ldots & -q_{k}\left( V\right)\\ \end{array}\right] . $$

A CIM is a set of intensity matrices, one intensity matrix for each instance of values v to the set of variables V. Using the BN’s terminology, the variables belonging to the set V are called the parents of the random variable X. This set is usually denoted pa(X), while in the case where the parent set pa(X) is empty, the CIM is simply a standard intensity matrix.

2.2 The continuous time Bayesian network model

Conditional intensity matrices (CIMs) allows us to model local dependencies between random variables, which is a fundamental aspect of both BNs, DBNs and CTBNs. Given a set of CIMs they can be put together to obtain a single structured model which fully describes the aspects of the evolution of a multivariate probability distribution. A CTBN consists of two main components: (1) an initial probability distribution and (2) the dynamics which rule the evolution over time of the joint probability distribution associated with the CBTN.

Definition 1

[27] (Continuous Time Bayesian Network). Let X be a set of local variables X ₁, …, X _n. Each X _i has a finite domain of values Val(X _i). A CTBN ℵ over X consists of two components: The first is an initial distribution P ⁰_X , specified as a Bayesian network $\mathcal{B}$ over X. The second is a continuous transition model, specified as

a directed (possibly cyclic) graph G whose nodes are X ₁, …, X _n; pa(X _i) denotes the parents of X _i in G.
a conditional intensity matrix, ${\textbf{\textit{Q}}}_{X_{i}|pa(X_{i})} $, for each variable $X_i \in {\textbf{\textit{X}}}. $

CTBNs allow, differently from BNs and DBNs, cycles in the graph G. Therefore, arcs directed from node X to node Y and directed from node Y to node X imply that the dynamic of the random variable X depends on Y as well as the dynamic of the random variable Y depends on X. This dependency is analogous to a DBN model where we have an arc directed from X(t) to Y(t + 1) and an arc directed from Y(t) to X(t + 1).

2.3 Queries and inference

In [27] it has been shown that a CTBN ℵ is a factored representation of a homogeneous Markov process described by the joint intensity matrix defined as

$$ {\textbf{\textit{Q}}}_{\aleph }=\prod\limits_{X\in{\textbf{\textit{X}}}}{\textbf{\textit{Q}}}_{X|pa\left(X\right) }. $$

(1)

Therefore, the CTBN ℵ can be used to answer any query which can be answered by using an explicit representation of a Markov process. Indeed, given the set of CIMs ${\textbf{\textit{Q}}}_{X|pa\left( X\right)}, $ X ∈ X associated with the nodes of the CTBN model ℵ, it is always possible to form the joint intensity matrix Q _ℵ to answer queries just as we do for any homogeneous Markov process. Given the joint intensity matrix Q _ℵ and the initial distribution $P_{\user2{\aleph}}^{0}, $ many questions can be answered about the homogeneous Markov process ℵ(t).

The distribution over the value of ℵ(t) is given by

$$ P_{\aleph }\left( t\right) =P_{\aleph }^{0}\exp \left( {\textbf{\textit{Q}}}_{\aleph }t\right) $$

(2)

while the joint distribution over any two time points can be computed as follows:

$$ P_{\aleph }\left( s,t\right) =P_{\aleph }\left( s\right) \exp \left( {\textbf{\textit{Q}}}_{\aleph }\left( t-s\right) \right) ,\quad t\geq s. $$

(3)

Inference in CTBNs can be performed by exact and approximate algorithms. Full amalgamation [27] is an exact algorithm that involves generating an exponentially-large matrix representing the transition model over the entire state space (1). Exact inference in CTBNs is NP-hard, and thus different approximate algorithms have been proposed. Nodelman et al. [33] introduced the Expectation Propagation (EP) algorithm which allows both point and interval evidence. It exploits message passing in a cluster graph, where the clusters contain distributions over trajectories of the variables through a duration. Saria et al. [34] presented a new EP-based algorithm which uses a flexible cluster graph architecture that fully exploits the natural time-granularity at which different sub-processes evolve. It also dynamically chooses the appropriate level of granularity to use in each cluster at each point in time. Alternatives are offered by sampling based inference algorithms. The importance sampling algorithm [35] computes the expectation of any function of a trajectory, conditioned on any evidence set constraining the values of subsets of the variables over subsets of the timeline. El-Hay et al. [36] developed a Gibbs sampling procedure for CTBNs which iteratively samples a trajectory for one of the components given the remaining ones. This approach naturally exploits the structure of the CTBN to optimize the computational cost of each step. This procedure is the first that can provide asymptotically unbiased approximations in such processes.

In this paper the inference task has been performed by using a proprietary software environment, designed and developed at the MAD laboratory. The CTBNs framework, developed under the MATLAB environment, offers the following functionalities:

• Load and compile; allows to load a CTBN model, to check its consistency and to allocate the required data structures for its management.

• Query; gets both point and interval evidence and includes them in a previously loaded and compiled CTBN model.

• Inference; offers the following algorithms; full amalgamation, EP and Gibbs sampling.

• Reporting; reports on all the statistics, including posterior probabilities, expected times to transition and expected number of transitions.

The correctness of approximate algorithms has been extensively tested exploiting full amalgamation and the CTBN-LRE environment [37].

3 Acute cardiogenic heart failure

3.1 The continuous time Bayesian network model

Heart failure is a disorder in which the heart pumps blood inadequately. Because the heart pumps oxygenated blood into the arterial vessels while taking unoxygenated blood from the veins, the consequences of heart failure are twofold. On one side, it leads to a reduced blood flow with a lower delivered oxygen into the peripheral tissues, which in turn induces a reduced exercise capacity level and fatigue, or even an irreversible condition known as shock, in which cells become unable to meet their metabolic functional needs. On the other side, heart failure induces congestion of blood both in the veins and lungs, leading to shortness of breath and the enlargement of organs. The first major advance for understanding the functional role of the heart, is due to William Harvey in 1628, who provided the first scientific demonstration of the circulation theory in his Exercitatio Anatomica de Motu Cordis et Sanguinis in Animalibus [38]. Since then, many authors have added fundamental contributions to explain the effects of various impairments of the cardiocirculatory system on human health. Figure 1 shows how these findings were given a graphical representation in terms of causal graphs, i.e. the qualitative component of the CTBN.

The meaning of nodes in Fig. 1, the information concerning their accessibility to medical investigation, the associated unit measure as well as the meaning of their states are listed in Tables 1 and 2.

Table 1 Meaning of the nodes of the acute myocardial infarction CTBN

Full size table

Table 2 Accessibility, unit measures and state meaning for the nodes of the acute myocardial infarction CTBN

Full size table

The consistency of the qualitative component of the CTBN model, i.e. the set of directed arcs, is ensured by the current medical knowledge as described in an authoritative textbook in the field of cardiology [39]. The model includes both variables accessible to medical investigation and variables whose role was studied only within an experimental setting. Some of the contemplated observations (Fig. 1) are always accessible in the medical practice, like heart frequency (HF), mean arterial blood pressure (BP) or the occurrence of pedal edema (LPE). Others can be investigated only with the application of simple diagnostic procedures (Table 2). The strength of the heart in pumping blood into the vascular system is represented by the node Pump ([39], p. 412). Together with HF, the cardiac pump influences both the left and the right cardiac output (LCO and RCO) ([39], p. 413), as well as the left and right cardiac input (LCI and RCI) ([39], pp. 394–399). However, the amount of blood coming out from the ventricles is constrained by the availability of blood arriving in the left cardiac chamber. Thus, LCO depends on the BP within the tract between the pulmonary capillaries and the left ventricle (PCtoLVcirc) ([39], pp. 405–407), likewise RCO depends on the BP within the tract between the capillaries and the right ventricle (CBRVcirc) ([39], p. 408). The amount of blood entering the left and right heart do influences the pressure within two circulatory tracts, respectively, the vessels between the PctoLVcirc and the vessels between the CBRVcirc. Two nodes represent the amount of fluid exchanged with the external environment. The first is labelled blood volume (BV), being affected by the balance between the water intake (WI) and the urinary output (UO). As such, it is supposed to influence the pressure within the systemic venous tract (CBRVcirc) ([39], pp. 561–562). The second is labelled UO ([39], p. 574), which in turn depends on the BP occurring in the systemic arterial tract (LVtoCBcirc). Furthermore, some physiological mechanisms by which the organism restores the corrupted blood flow were contemplated. The model already accounts for a decrease in UO when the arterial BP is dropped to restore the normal pressure within the systemic arteries (LvtoCBcirc) ([39], p. 478). In addition, the neurovegetative control (SS) over both heart beat frequency and the systemic arterial resistance (VR) was also represented ([39], pp. 414–416). The node SS is sensitive to arterial BP (BP is regarded as a manifestation of LVtoCBcirc) and it has, in turn, an impact on both arterial vascular resistance (VR) and HF ([39], pp. 417–418). Arterial VR might increase the systemic arterial pressure (LvtoCBcirc) ([39], p. 478). A node representing the persistence of symphatic neurovegetative activation (SS-pers) was included to account for the impairment of such a control mechanism when it lasts for too long (see the down-regulation described in [39], p. 440). As such, SS-pers is influenced by the SS node, while it influences the VR node. Some variables show the status of the cardiovascular system to a medical observer, specifically, whether some of its tract is stagnant. Pulmonary congestion (PulmCong) might be the result of stagnation in the pulmonary venous tract (RvtoPCcirc), whereby peripheral congestion (PeriphCong) might be the result of stagnation in the systemic venous tract (CBRVcirc) (see Right-Sided vs. Left-Sided Heart Failure in [39], p. 473). These two phenomena manifest themselves respectively with shortness of breath (Dispn) ([39], pp. 475–477) and pleural effusion (PleuEff) on one side (see hydrothorax in [39], p. 480), and pedal edema (LPE) on the other side (see edema in [39], p. 480). The first scenario can be revealed directly by a low partial pressure of arterial oxygen (O2pa) (see forward failure [39], p. 472), whereas a severe reduction of blood perfusion gives rise to a fatal complicating condition known as shock (Shock) ([39], pp. 561–563). Heart failure is said to be cardiogenic when the cardiac muscle (Pump) is the organ from which the circulatory failure was triggered. In turn, acute myocardial infarction (AMI) might be the cause of cardiac impairment, although in most instances it is not. As any other infarction, AMI is due to lack of arterial perfusion of the organ tissues. In case of AMI, obstruction of coronary arteries (CorObstr) occurs, whose blood supply comes from the main arterial system (LvtoCBcirc) (see ischemic heart disease [39], p. 435). One well known manifestation of coronary obstruction (CorObstr) is an intense chest pain ([39], pp. 1226–1228), called angina pectoris (Angor) ([39], p. 1235). Only when obstruction is both severe and lasting is there infarction, it manifests with the increase of cardiac enzymes (CardEnz) in the blood stream ([39], pp. 1239–1240) and, in functional terms, the impairment of the cardiac pump (Pump) ([39], p. 1230). In turn, intense pain stimulates the neurovegetative system with an increment of sympathetic activity and, therefore, of BP ([39], p. 1237) and HF ([39], p. 1238).

The quantitative component of the CTBN model, i.e. the CIM parameters, were elicited on the basis of the medical expertise of one of the authors (DL). Since each CIM includes a large number of parameters, whose interpretation is also far from being trivial, the attention was diverted on the parameters of the conditional probability tables (CPTs) that within a time interval of 10 s represent the impact of the parents on each node as their correspondent CIMs would do in continuous time. To further reduce the number of quantities to elicit, this task was accomplished in two steps. The first concerned the elicitation for each node of a conditional probability distribution based on a small number of parameters. The second addressed the quantification of the parameters. The time-interval of 10 s was deemed short enough to capture interesting dynamics, whereby the periodical changes of some physiological variables like cardiac alternation of the systole and diastole phases could be neglected. The distribution probabilities over all the parents’ combinations for each node were parameterized in terms of well-known functions (Noisy-Or-Gate and multivariate Gaussian), according to the type of variable (discrete, binary, continuous) and considering whether an interaction among the parents was known to occur. Whenever the assumptions underlying a parametric distribution were found to be not consistent with medical knowledge on a specific node-parents relationship, the CPT was generated by a mixture of parametric distributions, each defined by a specialized set of parameters conditioned by combinations of the parents. The whole procedure from the Noisy-Or-Gate to the corresponding CIM for the node Pump is depicted in Fig. 2.

3.2 Inference

To validate the model, we enter it with a set of patient observations whose explanations and consequences do generally appear straightforward to the medical profession. The current analysis encompasses the impact of clinical manifestations, i.e. BP, Dispn, HF, Angor and LPE, whereas associated laboratory or imaging observations, like CardEnz, O2pa, PulmCong and PleuEff, were only predicted along with other relevant outcomes, like the potential occurrence of shock (Shock). Since manifestations are derived from patient monitoring, they are referred to a time interval. For the purpose of our analysis, all cases are assumed to be normally hydrated cases, so WI was always kept to the normal state (mid).

In light of the above consideration, the following three scenarios should provide some evidence on the ability of the model to explain the simulated observation and to predict their potential consequences.

3.2.1 Scenario 1

The patient shows low BP (BP = low), an increased HF (HF = high), no chest pain (Angor = absent), pedal edema (LPE = present) together with shortness of breath (Dispn = present). All these manifestations last for 5 h. Therefore, the CTBN model is queried with the following interval evidence;

$$ [BP= low, HF= high, Angor= absent, LPE= present, Dispn= present, WI= mid], $$

for the time interval from 0 to 5 h, and with the interval evidence [WI = mid], for the time interval from 5 to 6 h.

The occurrence of pedal edema (LPE = present) and shortness of breath (Disp = present) would make the doctor keen on the diagnosis of congestive heart failure, involving both the right and the left heart side. The absence of angor (Angor = absent) would make a diagnosis of AMI very unlikely. The doctor is aware that such a condition, if left untreated, could lead to shock. Conditionally on the above interval evidence, the posterior probability of shock (Shock = present) attains the highest peak at the end of the observations, reaching a posterior probability value equal to 0.59. A node belonging to the body internal state of the CTBN model for the AMI (Fig. 1), i.e. the node Pump shows a reduced pump strength. Indeed, the posterior probability value associated with Pump = reduced is equal to 0.97 one hour after the initial observations (Fig. 3). Instead, the posterior probability value associated with the AMI (AMI = present) remains low (<0.01) during the whole period of interest (Fig. 4). This means that the patient is affected by primary congestive heart failure, whereas the adjective primary refers to a disease that is not the secondary result of another disease. The increased probability value of the low UO (UO = low) (0.22 at the end of the observations) and the likely absence of cardiac enzymes (CardEnz = present) (<0.015) reinforces the above diagnosis.

3.2.2 Scenario 2

The patient shows normal BP (BP = mid), increased HF (HF = high) and substernal chest pain (Angor = present). The patient does not show pedal edema (LPE = absent), nor shortness of breath (Dispn = absent). These manifestations are supposed to last for 45 min. Therefore, the CTBN model is queried with the following interval evidence:

$$ [BP=mid, HF=high, Angor= present, LPE=absent, Dispn=absent, WI=mid], $$

for the time interval from 0 to 45 min ([0, 0.75)), and with the following interval evidence [WI = mid], for the time interval from 45 min to 6 h ([0.75, 6)).

Angor persisting (Angor = present) for more than half an hour, is a classical marker of AMI. Therefore, the pump strength is expected to be unaffected, since there are no signs of heart failure (LPE = absent and Disp = absent). The model predicts, even after 1 h, a very low probability value (0.0052) of shock being present (Shock = present) (Fig. 5). The probability of AMI (AMI = present) becomes as high as 0.45 after half an hour of persisting angor, but it decreases after the end of chest pain (from 0.49 to <0.02 after 15 min) (Fig. 5). Cardiac enzymes follow a similar evolution. The probability of a reduced pump strength (Pump = reduced) remains low (<0.10) (Fig. 5) during the time interval, likewise the probability of its associated manifestations, e.g. UO.

3.2.3 Scenario 3

The patient shows normal BP (BP = mid), with increased HF (HF = high) and chest pain (Angor = present). Like in scenario 2, the patient does not show shortness of breath (Dispn = absent) nor pedal edema (LPE = absent). However, the manifestations last only for 15 min. After this interval, the angina disappears (Angor = absent) for the next 15 min. Therefore, the CTBN model is queried with the following interval evidence;

$$ [BP=mid, HF= high, Angor= present, LPE=absent, Dispn=absent, WI=mid], $$

for the time interval from 0 to 15 min ([0, 0.25)), with the following interval evidence

$$ [BP=mid, HF=high, Angor= absent, LPE=absent, Dispn=absent, WI=mid], $$

for the time interval from 15 to 30 min ([0.25, 0.50)), and with the interval evidence [WI = mid], for the time interval from 30 min to 6 h ([0.50, 6)).

Physicians regard the occurrence of angor as a threatening condition because of its association with coronary obstruction, the cause of myocardial infarction. However, from a clinical point of view, when the chest pain does not persist for at least half an hour, the occurrence of AMI is unlikely. According to the model, heart failure and shock are unlikely events. In the following hour, the posterior probability of shock (Shock = present) remains low (<0.005), likewise the probability of any other abnormal state (Fig. 6).

4 Discussion

The temporal dimension is an essential feature of medical reasoning and decision making. The diagnosis may take advantage from knowing the persistence of observations, and therapy may be optimized in light of the likely future evolution of the medical disorder by anticipating complicating diseases. The epidemiological relevance of heart failure and the usefulness of accurate predictions in the correct management of such an evolving disorder is confirmed by other contributions addressed to the formal representation of the disorder. Although methodologically different, they are all attempts to provide the problem with a quantitative analysis to be exploited in the medical practice. For instance, the Seattle Heart Failure Model is based on a survival model [40] and is probably the first computer-based application to translate medications and devices that a heart failure patient receives in predicted years of survival. However, this model does not represent the process by which the outcomes are affected and, like most multivariate statistical analysis, it is focused on the evolution of chronic heart failure, not on episodes of its reacutization [41].

Bayesian reasoning and inference procedures have gained popularity in the fusion of information obtained from different sources. Perhaps their greatest potential in the clinical setting is to provide a pathophysiological interpretation of events that might be variably accessible to observations. An influence diagram has been proposed to predict heart contractility dysfunctions reflected in the condition of systolic heart failure [42]. Although the model is already structured as a decision support system, it is based on a static BN representation; this way it skips the complexity of inference along the temporal dimension. In spite of the prevalence of proportional hazard models as prognostic models in medicine, DBNs have been also proposed to take advantage of the causal and temporal nature of medical domain knowledge as elicited from domain experts [43].

To the best of the authors’ knowledge, there has been only one attempt to model the evolution of heart failure by means of DBNs [8]. The network is based on a time granularity of minutes, rather than seconds like in our application. While this interval can offer a summarised picture on how the disorder evolves, it is also likely to affect the consistency of the dynamic to represent. On the other hand, BNs do not provide direct mechanisms for representing temporal dependencies, so any DBN representation, resulting from the assemblage of several BNs for each time of interest, tends to become rapidly intractable when applied to large but realistic domains [44]. The CTBNs framework overcomes most of the difficulties presented above, making it possible to elaborate inference on medical problems where the temporal information about a set of manifestations is available from clinical reports or monitoring instrumentation. As such, it might represent a significant improvement over DBNs.

The validity of the qualitative component of the proposed model was addressed by showing the consistency of the graphical structure of the CTBN with a medical textbook of cardiology [39]. Medical expertise was exploited to define the quantitative component, the elicitation of which took advantage of the preliminary reduction in the parameters underlying the CIMs. Further research is needed to provide a quantitative assessment of the predicted probabilities, a task which has been proven to be challenging for any probabilistic expert systems, given that data on large domains are generally lacking [45]. Notwithstanding, the clinical scenario offers several clues on the validity of quantitative predictions in the light of what medical doctors would expect given the selected patient manifestations. Of note, those predictions were achieved by means of ordinary hardware resources.

The comparison of Scenario 2 and Scenario 3 allows us to appreciate the impact of evidence known to be relevant for the occurrence of heart failure, although the reason of failure could be different. The first case study shows the typical consequences of a congestive heart failure, whereby the second patient shows symptoms of one potential cause of heart failure, i.e. AMI. In Scenario 1, the model correctly detects a primitive pump deficit as the cause of heart failure, anticipating shock as a likely future complication. Instead, there are no reasons to hypothesize a pump deficit as a secondary consequence of AMI. In Scenario 2, because the probability of heart failure is low and there are no symptoms of heart failure, the model correctly shows an uncomplicated AMI as the most likely diagnosis. Scenario 2 and Scenario 3 show the same set of manifestations, but their comparison allows us to appreciate the impact of duration of pathological events. Physicians are aware that substernal chest pain is a symptom of coronary obstruction, whose impact on the myocardial tissue depends on the persistence of obstruction. Since an interval of 30 min is generally regarded as the trade off over which the occurrence of infarction becomes more likely than a simple angina episode, the model correctly discriminates the underlying diagnostic explanations of the two cases.

Finally, at the current stage of their development, CTBNs do not encompass an explicit decision analysis. Optimal options in temporal domains are particularly complex to compute. Even if the problem encompasses the selection of a single decision, the latter can nevertheless be affected by the future candidate decisions [7]. Like in [25], we rest on the inferential ability to compute the uncertainties on the main clinical variables, leaving to the doctor the choice of making the most appropriate decision in light of the quantitative updating of both diagnostic and prognostic judgements.

5 Conclusions

In this paper the authors have described the first clinical application of developments in the research area of continuous time graphical models. This approach allows a direct representation of time and offers a valid computational machinery for medical inference.

The predictions emerging from the three scenarios have confirmed the heuristic power of the proposed framework and have allowed a quantitative evaluation of the expected time before each variable changes its state. The proposed model has then the potential to be used for diagnostic purposes, as well as to develop a strategic plan to reduce the risk associated with each patient treatment.

Additional improvements are needed to turn the CTBN on cardiogenic heart failure into a practical medical tool. Quantitative parameters might be further tuned to achieve posterior probabilities that better fit with expectations derived from pathophysiological knowledge. This could be achieved by learning the CIMs directly from clinical data.

The usefulness of the CTBN could be further increased with the embedding of the CTBN model into a DSS which assists the clinician to choose and to apply the correct therapy. However, a decision analysis would preliminarily call for the computability of posterior probabilities of models at least as complex as the one presented. Thus, we anticipate the usefulness of CTBNs in clinical domains where, like in the case of heart failure, there is growing interest in quantitative predictions.

References

European Commission (2008) Ict-bio 2008: conference report. Computer modelling and simulation for improving human health
Lipscombe B (1989) Expert systems and computer-controlled decision making in medicine. AI & Society 3(3):184–197
Article Google Scholar
Sim I, Gorman P, Greenes RA, Haynes RB, Kaplan B, Lehmann H, Tang PC (2001) Clinical decision support systems for the practice of evidence-based medicine. J Am Med Inf Assoc 5(6):527–534
Article Google Scholar
Garg AX, Adhikari NKJ, McDonald H, Rosas-Arellano MP, Devereaux PJ, Beyene J, Sam J, Haynes RB (2005) Effects of computerized clinical decision support systems on practitioner performance and patient outcomes. A systematic review. J Am Med Assoc 293(10):1223–1238
Article Google Scholar
Burstein F, Burstein C, Holsapple W (2008) Handbook on decision support systems 2: variations. Springer, Heidelberg
Google Scholar
Pearl J (1988) Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmann Publishers Inc., San Francisco
Google Scholar
Jensen FV (2001) Bayesian networks and decision graphs. Springer, New York
MATH Google Scholar
Hulst J (2006) Modeling physiological processes with dynamic Bayesian networks. PhD thesis, Delft University of Technology
Rose C, Cherif Smaili C, Charpillet F (2005) A dynamic Bayesian network for handling uncertainty in a decision support system adapted to the monitoring of patients treated by hemodialysis. In: Proceedings of the 17th IEEE international conference on tools with artificial intelligence, pp 594–598
Howard R (1990) From influence to relevance to knowledge. In: Oliver RM, Smith JQ (eds). Influence diagrams, belief nets, and decision analysis. John Wiley and Sons, pp 3–23
Xiang Y, Pant B, Eisen A, Beddoes MP, Poole D (1993) Multiply sectioned Bayesian networks for neuromuscular diagnosis. Artif Intell Med 5(4):293–314
Article Google Scholar
Jaakkola T, Jordan MI (1999) Variational probabilistic inference and the qmr-dt network. J Artif Intell Res 10(1):291–322
MATH Google Scholar
Heckerman DE, Horvitz EJ, Nathwani BN (1992) Toward normative expert systems: part I. The pathfinder project. Methods Inf Med 31(2):90–105
Google Scholar
Xiang Y, Eisen A, MacNeil M, Beddoes MP (1992) Quality control in nerve conduction studies with coupled knowledge-based system approach. Muscle Nerve 15(2):180–187
Article Google Scholar
First MB, Weimer BJ, McLinden S, Miller RA (1982) Localize: computer-assisted localization of peripheral nervous system lesions. Comput Biomed Res 15(6):525–543
Article Google Scholar
Vila A, Ziebelin D, Reymond F (1985) Experimental emg expert system as an aid in diagnosis. Electroencephalogr Clin Neurophysiol 3(61):S240
Google Scholar
Gallardo R, Gallardo M, Nodarse A, Luis S, Estrada R, Garcia L, Padron O (1987) Artificial intelligence in the electromyographic diagnosis of cervical roots and brachial plexus lesions. Electroencephalogr Clin Neurophysiol 66:37
Google Scholar
Rialle V, Vila A, Besnard Y (1991) Heterogeneus knowledge representation using a finite automaton and first order logic: a case study in electromyography. Artif Intell Med 3(2):65–74
Article Google Scholar
Fuglsang-Frederiksen A, Ronager J, Vingtof S (1989) Pc-kandid: an expert system for electromyography. Artif Intell Med 3(1):117–124
Article Google Scholar
Olesen KG, Kjaerulff Y, Jensen F, Jensen FV, Falck B, Andreassen S, Andersen SK (1989) A munin network for the median nerve—a case study on loops. Appl Artif Intell 3(2–3):385–403
Article Google Scholar
Boutilier C, Dean T, Hanks S (1999) Decision-theoretic planning: structural assumptions and computational leverage. J Artif Intell Res 11:1–94
MathSciNet MATH Google Scholar
Peek N (1999) Explicit temporal models for decision-theoretic planning of clinical management. Artif Intell Med 15(2):135–154
Article Google Scholar
Provan GM (1993) Tradeoffs in constructing and evaluating temporal influence diagrams. In: Uncertainty in Artificial Intelligence, Proceedings of the ninth conference. Morgan Kaufmann Publisher, San Mateo, CA, pp 40–47
Hovorka R, Benn J, Olesen KG, Carson ER, Andreassen S (1991) A model-based approach to insulin adjustment. In: Proceedings of the 3rd conference on artificial intelligence in medicine, pp 239–248
Charitos T, Van der Gaag LC, Visscher S, Schurink K, Lucas PJF (2009) A dynamic Bayesian network for diagnosing ventilator-associated pneumonia in ICU patients. Expert Syst Appl 36(2):1249–1258
Article Google Scholar
Dean T, Kanazawa K (1990) A model for reasoning about persistence and causation. Comput Intell 5(3):142–150
Google Scholar
Nodelman U, Shelton C, Koller D (2002) Continuous time Bayesian networks. In: Proceedings of the eighteenth conference on uncertainty in artificial intelligence, pp 378–387
McMurray JJV, Pfeffer MA (2005) Heart failure. Lancet 356(9474):1877–1889
Article Google Scholar
Nodelman U, Horvitz E (2003) Continuous time Bayesian networks for inferring users’ presence and activities with extensions for modeling and evaluation. Technical Report MSR-TR-2003-97, Microsoft Research, One Microsoft Way, Richmond, WA 98052, USA
Boudali H, Dugan J (2006) A continuous-time Bayesian network reliability modeling and analysis framework. IEEE Trans Reliab 55(1):86–97
Article Google Scholar
Xu J, Shelton CR (2008) Continuous time Bayesian networks for host level network intrusion detection. In: ECML/PKDD No. 2, 613–627
Fan Y, Shelton CR (2009) Learning continuous-time social network dynamics. In: Proceedings of the twenty-fifth international conference on uncertainty in artificial intelligence
Nodelman U, Koller D, Shelton CR (2005) Expectation propagation for continuous time Bayesian networks. In: Proceedings of the twenty-first international conference on uncertainty in artificial intelligence, pp 431–440
Suchi S, Nodelman U, Koller D (2005) Reasoning at the right time granularity. In: Proceedings of the twenty-first conference on uncertainty in artificial intelligence, Edinburgh, Scotland, UK, pp 421–430
Fan Y, Shelton CR (2008) Sampling for approximate inference in continuous time Bayesian networks. In: Tenth international symposium on artificial intelligence and mathematics
El-Hay T, Friedman N, Kupferman R (2008) Gibbs sampling in factorized continuous-time markov processes. In: Proceedings of the eighteenth conference on uncertainty in artificial intelligence, pp 378–387
Shelton CR, Fan Y, Lam W, Xu JLJ (2010) Continuous time Bayesian network reasoning and learning engine. J Mach Learn Res 11:1137–1140
MATH Google Scholar
Harvey W (1995) In: Keynes G (ed) The anatomical exercises: De Motu Cordis and De Circulatione Saguinis in english translation. Dover Publications, Mineola, New York
Braunwald E (1988) Heart disease: a textbook of cardiovascular medicine edited by Eugene Braunwald, 3rd edn. Saunders, Philadelphia
Levy W, Mozaffarian D, Linker D, Sutradhar S, Anker S, Cropp A, Anand I, Maggioni A, Burton P, Sullivan M, Pitt B, Poole-Wilson P, Mann D, Packer M (2006) The Seattle heart failure model. Circulation 113:1424–1433
Google Scholar
Brophy JM, Dagenais GR, McSherry F, Williford W, Yusuf S (2004) A multivariate model for predicting mortality in patients with heart failure and systolic dysfunction. Am J Med 116(5):300–304
Article Google Scholar
Fernandez J, Martinez-Selles M, Arredondo M (2004) Bayesian networks and influence diagrams as valid decision support tools in systolic heart failure management. Comput Cardiol 31:181–184
Article Google Scholar
van Gerven M, Taal BG, Lucas PJF (2008) Dynamic Bayesian networks as prognostic models for clinical patient management. J Biomed Inf 41(4):515–529
Article Google Scholar
Kjaerullf U (1994) Dhugin: a computational system for dynamic time-sliced Bayesian networks. Int J Forecast 11:89–111
Article Google Scholar
Hoepelman IM, Bonten MJ, Schurink CA, Lucas PJ (2005) Computer-assisted decision support for the diagnosis and treatment of infectious diseases in intensive care units. Lancet Infect Dis 5(5):305–312
Article Google Scholar

Download references

Acknowledgments

The authors are grateful to the anonymous referees whose precious comments contributed to improve the quality and the clarity of the paper.

Author information

Authors and Affiliations

DISCo, Università degli Studi di Milano-Bicocca, Viale Sarca 336, 20124, Milano, Italy
E. Gatti & F. Stella
Laboratorio di Epidemiologia Clinica, Istituto di Ricerche Farmacologiche Mario Negri, Via La Masa 19, 20156, Milano, Italy
D. Luciani

Authors

E. Gatti
View author publications
You can also search for this author in PubMed Google Scholar
D. Luciani
View author publications
You can also search for this author in PubMed Google Scholar
F. Stella
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to F. Stella.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gatti, E., Luciani, D. & Stella, F. A continuous time Bayesian network model for cardiogenic heart failure. Flex Serv Manuf J 24, 496–515 (2012). https://doi.org/10.1007/s10696-011-9131-2

Download citation

Published: 08 December 2011
Issue Date: December 2012
DOI: https://doi.org/10.1007/s10696-011-9131-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A continuous time Bayesian network model for cardiogenic heart failure

Abstract

Similar content being viewed by others