Investigating pH based evaluation of fetal heart rate (FHR) recordings

Georgoulas, George; Karvelis, Petros; Spilka, Jiří; Chudáček, Václav; Stylios, Chrysostomos D.; Lhotská, Lenka

doi:10.1007/s12553-017-0201-7

Investigating pH based evaluation of fetal heart rate (FHR) recordings

Original Paper
Open access
Published: 04 July 2017

Volume 7, pages 241–254, (2017)
Cite this article

Download PDF

You have full access to this open access article

Health and Technology Aims and scope Submit manuscript

Investigating pH based evaluation of fetal heart rate (FHR) recordings

Download PDF

George Georgoulas¹,
Petros Karvelis²,
Jiří Spilka³,
Václav Chudáček³,
Chrysostomos D. Stylios² &
…
Lenka Lhotská³

2512 Accesses
46 Citations
Explore all metrics

Abstract

Cardiotocography (CTG) is a standard tool for the assessment of fetal well-being during pregnancy and delivery. However, its interpretation is associated with high inter- and intra-observer variability. Since its introduction there have been numerous attempts to develop computerized systems assisting the evaluation of the CTG recording. Nevertheless these systems are still hardly used in a delivery ward. Two main approaches to computerized evaluation are encountered in the literature; the first one emulates existing guidelines, while the second one is more of a data-driven approach using signal processing and computational methods. The latter employs preprocessing, feature extraction/selection and a classifier that discriminates between two or more classes/conditions. These classes are often formed using the umbilical cord artery pH value measured after delivery. In this work an approach to Fetal Heart Rate (FHR) classification using pH is presented that could serve as a benchmark for reporting results on the unique open-access CTU-UHB CTG database, the largest and the only freely available database of this kind. The overall results using a very small number of features and a Least Squares Support Vector Machine (LS-SVM) classifier, are in accordance to the ones encountered in the literature and outperform the results of a baseline classification scheme proving the utility of using advanced data processing methods. Therefore the achieved results can be used as a benchmark for future research involving more informative features and/or better classification algorithms.

Least Squares Support Vector Machines for FHR Classification and Assessing the pH Based Categorization

Automatic Evaluation of FHR Recordings from CTU-UHB CTG Database

A Survey of Electronic Fetal Monitoring: A Computational Perspective

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The aim of Cardiotocography (CTG) is to screen for signs of fetal distress thus allowing obstetricians to react in a timely fashion in order to prevent potential adverse outcomes for the fetus. CTG recordings, also referred to as Cardiotocograms, consist of the recording of two signals; the fetal heart rate (FHR) signal, measured in beats per minute (bpm) and the uterine contractions (UCs) signal, measured either in mmHg or in arbitrary units, and remain the only technique used world-wide that can provide continuous information about the state of the fetus during delivery [1, 2]. One of the reasons for the adoption of CTG, is the sense of “security” that it offers to the clinicians by providing on-line real time monitoring of the fetus. On the other hand, even though continuous information of the Cardiotocogram is an improvement over the previously used intermittent auscultation (IA), its evaluation is hindered by the large variance of the responses of individual fetuses to stress situations.

In the clinical practice the evaluation of Cardiotocograms primarily relies on eye inspection and assessment following guidelines, which usually stem from those issued by the International Federation of Gynecology and Obstetrics (FIGO guidelines [3]). Nevertheless, despite the existence of specific guidelines, Cardiotocogram interpretation suffers from high inter- and intra-observer variability among clinicians [4, 5]. Moreover, CTG is cited as one of the reasons behind the increase of the number of Caesarean sections [6]. On the other hand, since there are currently no significantly new approaches to fetal monitoring during delivery in the horizon, CTG is here to stay regardless of its flaws. To mitigate the variability in the evaluation of Cardiotocograms, two major ways have been proposed [7]: i) extensive training of the clinical staff or ii) use of computerized systems for decision support. In the latter case, the evaluation problem is usually cast as a classification one, where the classes for CTG evaluation are primarily based on umbilical artery cord blood analysis. The most common approach is to use the pH value with an apriori defined threshold to distinguish between acidotic and non-acidotic fetuses.

It should be noted, that the idea of developing computerized systems for Cardiotocogram evaluation is very old, even preceding the release of general FIGO guidelines [8]. Since these early works, many more approaches have been proposed, ranging from systems that emulate FIGO guidelines [9] to systems that rely on the extraction of features using advanced signal processing techniques coupled with advanced classification algorithms. Many of these features are based on the well-established adult Heart Rate Variability (HRV) research [10]; others come from the statistical analysis of the FHR [11], or from the nonlinear domain [12, 13]. Time-scale descriptors [14], features based on Empirical Mode Decomposition (EMD) [15] and even artificially generated features have also been proposed [16] and tested. Additionally, modeling of FHR and its behavior with relation to contractions has also been investigated [17]. For the classification part, methods such as Support Vector Machines (SVMs) [13, 14, 17,18,19], Artificial Neural Networks (ANNs) [11, 20,21,22], Generative Models (GMs) [23], Fuzzy Systems [24, 25], and Hidden Markov Models (HMMs) [26] have been tested among other computational systems.

Regarding the studies that use pH values to define the respective classes, in [23] different GMs are trained using features calculated over consecutive windows which are then turned into symbolic sequences. The combination of a Naïve Bayes GM with a first order Markov chain GM, outperforms conventional discriminating approaches using SVMs and crisp rules when a threshold of normality is set to pH equal to 7.15, achieving a sensitivity of 60.9% and a specificity of 81.7%. The use of symbolic representation compensates for the high computational cost related to GMs. In [19] a combination of a Genetic Algorithm (GA) for feature selection, with three different classifiers is used for the discrimination of normal and pathological cases. Cases with pH < 7.05 are considered pathological while cases with 7.27 < pH < 7.33 are considered normal. The GA with an SVM classifier with Radial Basis Function (RBF) kernels, regularized using the Bayesian information criterion (BIC), outperforms all other combinations achieving sensitivity equal to 83.02% and specificity equal to 66.03%. An ensemble/committee of ANNs is trained in [21] using normal and pathological cases that are defined slightly different between training and testing data sets (training: pathological pH < 7.1, normal 7.27 < pH < 7 .33 – testing: pathological pH < 7.1, normal 7.22 < pH < 7.27). Principal Component Analysis (PCA) is applied to reduce the dimension from 47 to just six and increase computational efficiency. A sensitivity of 60.3% and a specificity of 67.5% are reached.

It must be noted that that the aforementioned works use different databases, which in turn differ in many parameters: different size (40–500), different fraction of pathological cases, varying time until delivery, and use of different parts of the FHR signal for analysis (e.g. with or without the second stage of labor). More importantly these works use different pH thresholds while some of them use more than one threshold to define the classes’ boundaries. In the present study a single threshold is used. This threshold is set to 7.05, which is the setup used in the majority of technical papers on CTG classification [1, 27,28,29,30], because it provides sufficient compromise between the amount of pathological cases that are considered and the amount of complications related to the health status of the fetus [31,32,33].

More specifically, this study presents the results of FHR classification on the unique open access intrapartum CTU-UHB CTG database [33]. Both the involved set of features, which covers different domains, and the classification algorithm, which is based on a computational efficient variant of SVMs, could be considered to be among the current state of the art methods. A two stage feature selection procedure is used, which significantly reduces the number of involved features, further increasing the computational efficiency of the approach. To the best of our knowledge this is the most extensive testing performed on this data set and as such the achieved results can be used as a benchmark for the evaluation of other features and/or other classification algorithms, tested on this database. For comparison reasons and in order to check whether the use of the aforementioned advanced data processing methods offer an advantage over simpler approaches, the proposed method is also tested against a simpler classification scheme.

The rest of the paper is structured as follows: Section 2 describes briefly the methods employed in this work, from signal preprocessing and feature extraction to classification. Section 3 presents the results along with a discussion about the effect of the number of selected features and Section 4 concludes the paper with some directions for possible future research.

2 Materials and methods

In this work the newly released CTU-UHB CTG database [33] is used. The proposed method classifies all the recordings using primarily the FHR signal and consists of the following steps: FHR preprocessing; Feature extraction; Feature ranking / selection and Classification. Matlab 2012b (The Mathworks, Inc.) is used to analyze the data. In the rest of this section all the involved steps are presented along with a description of the CTU-UHB CTG dataset.

2.1 Data set

The open access CTU-UHB CTG database [33] consists of 552 records, which is a subset of 9164 intrapartum CTG recordings acquired between the years 2009 and 2012 at the obstetrics ward of the University Hospital in Brno, Czech Republic. All women signed informed consent and the study for the data collection was approved by the Institutional Review Board of University Hospital Brno. All CTG recordings and clinical data were anonymized. For this study the last 30 min of the 1st stage of labor are selected.

Clinical parameters were used to achieve as consistent a database as possible: Only fetuses with more than 37 completed weeks of gestation and singleton pregnancies were included. All fetuses with known intrauterine growth restriction, fetal infection and fetuses with congenital malformations were excluded. Only recordings that ended less than 20 min (median 5 min) before delivery were selected for the database. The gap between the time of end of the actual CTG signal and the time of birth in the form of mean (min, max) was 2.70 min (0, 29); the length of the first stage of labor was 225 min (45, 648); and the length of the second stage of labor was 11.87 min (0, 30).

From the 552 recordings, 44 of them have a pH value lower or equal to 7.05, which is the border line selected for defining the two classes in this study. Therefore, these 44 cases constitute the abnormal class while the rest 508 constitute the normal class, cf. Figure 1 for sample records. More details about the database and its construction can be found in [33].

2.2 FHR preprocessing

The FHR signals were obtained either directly using a Doppler ultrasound (US) probe placed on mother’s abdomen, or from direct electrocardiogram (DECG) measured internally by a scalp electrode attached to the fetal scalp. Due to the acquisition method, FHR can be “contaminated” by spiky artifacts (impulsive noise) or contain periods where the FHR is zeroed. Spiky artifacts in the FHR as well as missing values can reflect on the values of the extracted features. Therefore a simple artifact rejection scheme is employed: first, extreme (not physiological) values (> 200 bpm and <50 bpm) are removed as in [9]; second, Hermite spline interpolation is applied to fill the gap of missing values [34]. We must note that long gaps (> 15 s) are not included in the subsequent feature extraction process. Despite its simplicity, this kind of artifact removal scheme is an established preprocessing step before further analysis can take place [35], even though more elaborate techniques have been proposed over the past years [36, 37].

On average 13.85% of the total duration of the 30 min segment consists of noisy data (artifacts including extreme values and missing data in less than 15 s gaps) with a minimum value of 0% and a maximum of 49%. Therefore, on average, the data set could be considered relatively stable, since in general noisy/missing data can amount to about 20%–40% of the total data length.

2.3 Feature extraction

Feature extraction is probably the most important step in any classification problem, since an informative set of features makes the subsequent classification stage a much easier process. In this work a mixture of features is utilized. In total 54 features are used – coming from 21 basic features by varying some internal parameters, which are presented in summary in Table 1.

Table 1 Features used in presented work

Full size table

Different feature domains represent different points of view of the CTG, ranging from FIGO-based features that try to emulate the information extractable by eye, to time domain features that are very understandable to clinicians yet impossible to see by naked eye, to more complex feature domains, which quantify the signal using frequency and nonlinear analysis tools. These latter approaches are well established for the analysis of adult’s HRV and are expected to perform well also in the case of FHR. All features are quite common in FHR analysis studies and have been already described in other publications [13, 18, 49] and cover the following areas:

FIGO-based features: baseline, number of acceleration/deceleration, and long term variability. In this work for the extraction of the FIGO-based features, which describe the macroscopic properties of the FHR, the algorithms proposed in [50] are used.
Time domain features: quantifying Short Term Variability (STV) and Long Term Irregularity (LTI).
Frequency domain: energy in different frequency bands. These features are believed to capture the balance of behaviour of the two autonomic nervous system branches. A non-parametric Welch periodogram was used for the power spectral density (PSD) estimation (parameters: Gaussian-like window of size 1024 samples and 80% overlap). The energy in frequency bands was computed using [10, 41].
Nonlinear domain: Fractal dimensions, Detrend Fluctuations Analysis (DFA), Entropy measures, Lempel-Ziv complexity, and Poincaré plot (embedding of RR_n intervals vs. RR_n+1 (dimension m = 2, time delay τ = 1). All these features try to quantify the complexity of the signal under investigation.

2.4 Feature selection

Usually, the step of feature extraction creates a large number of features. However as in almost all classification problems, not all of the extracted features are necessary for the classification task at hand. This happens because some of the features may convey overlapping information or even not be as informative as expected. Therefore a feature selection stage, or a dimensionality reduction stage, is usually involved before the application of the final classification algorithm [51]. This stage significantly decreases the time required for building/training a classifier, increasing therefore computational efficiency, while at the same it might improve the generalization capability of the classifier [51].

To put it more formally, the task of feature selection, for classification, can be described as follows: given an initial set of N _F features, select a subset of j features, where j < < N _F, retaining as much as possible of their class discriminatory information. The choice of a suitable subset of the features can allow the classifier to reach a near-optimal performance, which is a key step for any machine learning algorithm.

Feature selection algorithms can be divided into three general categories [52, 53]: filters, which do not require a learning algorithm, wrappers, which use a classification algorithm as part of the selection procedure and embedded methods, where the feature selection takes place at the same time of the classifier building, with each one of these methods having its pros and cons.

In this work, a hybrid approach is used, consisting of a filtering stage followed by a stage where eye-inspection (based on “experts” feedback) is used for the selection of the number of features that is to be included in the subset. Both steps of the feature selection process are described in the rest of this section.

The first step involves a “filtering” process during which, a measure is used to evaluate the effectiveness of each individual feature in predicting the class of each sample/example. The features are then ranked based on that measure: from the most helpful to the least helpful one. This method is very computational efficient and is preferred in conjunction with an advanced classifier or with subsequent fine tuning selection using e.g. a wrapper to further reduce the final set.

The feature selection process becomes more challenging for problems with class imbalance. One of the best ways to tackle the feature selection under class imbalance is to use the Receiver Operating Characteristic (ROC) curve and the corresponding value of the Area Under the ROC Curve (AUC) to rank the features, which is a measure that is immune to class imbalance [54]. In [54] the AUC was approximated using a small number of trapezoids leading to a very fast implementation. In this work, since the number of cases is relatively small, a more precise estimation is used relying on the Mann-Whitney-Wilcoxon two-sample statistic [55, 56]:

$$ {U}_1={n}_1{n}_2+\frac{n_1\left({n}_1+1\right)}{2}-{R}_1, $$

(1)

$$ AUC=\frac{U_1}{n_1{n}_2}, $$

(2)

where n ₁ is the sample size of the examples belonging to class 1, n ₂ is the sample size of the examples belonging to class 2 and R ₁ is the sum of the ranks from the samples of class 1.

After the ranking stage, a visual inspection of the features’ ranking is performed. Features are plotted in descending order based on the AUC value that is estimated by leaving out each time a randomly selected positive example (a pathological case with pH ≤ 7.05) and a dozen of randomly selected negative examples (normal cases with pH > 7.05). The resulting pattern is consistent among the different trials having two distinct “clusters” of features consisting of three and six features. These clusters have much higher (individual) predictive value compared to other features, cf. Figure 2. Based on these observations, three different input feature sets are tested: a) the three individually “best” features (energy at the VLF [41], energy at the LF [10], SD₂ (Standard Deviation of points along the line y = x of a Poincaré plot), b) the nine highest ranked features (the first three plus next six (ApEn r = 0.2 , m = 2, ApEn r = 0.15 , m = 2, ratio of energies in LF and High Frequency (HF) bands (LF/HF) [10], SampEn, STV [38], energy at the LF band [41]) and c) all 54 features.

2.5 Classification using least squares support vector machines (LS-SVMs)

As it was described in the previous section, an AUC based filter selection scheme is applied to reduce the number of features and select those having a noticeable impact from the rest. However, by this setting the correlation between features is not considered. Therefore SVMs, a classification paradigm that is not affected so much from the presence of correlated inputs, is selected [57] to perform the categorization/decision task. More specifically the Least Squares version of the SVMs (LS-SVMs) [58] is chosen due to the much faster training time required when moderate size problems are tackled (the computational complexity for a naïve implementation is O(N ³) where N is the size of the training set, but other faster approaches exist [59]).

SVM classifiers map the data into a higher dimensional space and then an optimal separating hyper-plane is constructed. Given a set of N training samples {(x _i, y _i), i = 1, … , N}, where $ {x}_i\in {\mathbb{R}}^{N_f} $ (N _f being the dimension of the input space) and the corresponding labels y _i = {+1, −1}, the support vector method aims to construct a classifier of the form:

$$ {w}^T\varphi \left({x}_i\right)+ b\ge +1,\mathrm{if}\;{y}_i=+1 $$

(3)

$$ {w}^T\varphi \left({x}_i\right)+ b\le -1,\mathrm{if}\;{y}_i=-1 $$

(4)

or equivalently

$$ {y}_i\left[{w}^T\varphi \left({x}_i\right)+ b\right]\ge 1, i=1,\dots, N, $$

(5)

where φ(⋅) is a nonlinear function which maps the input space into a higher dimensional space, with b a scalar and w an unknown vector with the same dimension as φ(⋅).

For the standard SVM algorithm the following optimization problem is formulated:

$$ \begin{array}{l}\kern2em {min}_{w,\mathrm{b},{\xi}_k}{F}_1\left( w,{\xi}_i\right)=\frac{1}{2}{w}^T w+ C\sum_{i=1}^N{\xi}_i\hfill \\ {}\mathrm{Subject}\ \mathrm{to}:{y}_i\left[{w}^T\varphi \left({x}_i\right)+ b\right]\ge 1-{\xi}_i, i=1,\dots, N\hfill \\ {}\kern6em {\xi}_i\ge 0, i=1,\dots, N\hfill \end{array} $$

(6)

where, ξ _i s are slack variables that allow misclassifications in the set of inequalities (e.g., due to overlapping distributions). The positive real constant C is considered as a tuning parameter in the algorithm. For the case of the LS-SVM classifier, instead of Eq. (6), the optimization problem is formulated as in [58]:

$$ \begin{array}{l}\kern3em { \min}_{w, b, e}{F}_2\left( w, e\right)=\frac{1}{2}{w}^T w+\frac{1}{2}\gamma \sum_{i=1}^N{e}_i^2\hfill \\ {}\mathrm{Subject}\ \mathrm{to}:{y}_i\left[{w}^T\varphi \left({x}_i\right)+ b\right]=1-{e}_i, i=1,\dots, N,\hfill \end{array} $$

(7)

where e _i is an error variable and γ is a regularization parameter.

The above formulation leads to the construction of a decision function of the form:

$$ y(x)=\mathit{\operatorname{sign}}\left(\sum_{i=1}^N{a}_i{y}_i K\left({x}_i, x\right)+ b\right), $$

(8)

which implies that every training data point is a support vector. K(⋅, ⋅) is a kernel function that implicitly performs the mapping form the input to the high dimensional feature space and a _i are the Lagrange multipliers.

In this work the RBF kernel is used:

$$ K\left({x}_i,{x}_j\right)= \exp \left(\frac{-{\left\Vert {x}_i-{x}_j\right\Vert}_2^2}{\sigma^2}\right), $$

(9)

where σ is the spread parameter of the RBF kernel.

The above formulation works fine in the case of well-balanced classes. However for cases with imbalanced distribution between the two classes, a mechanism for compensating for this is needed [60]. One of the simplest methods relies on subsampling the majority class. However this might discard some of the patterns that lie on the decision boundary. To avoid this problem, a second approach based on the calculation of unequal costs for the two classes is used [18, 60]. In the case of LS-SVMs, the formulation becomes.

$$ \begin{array}{l}\kern2em {min}_{w, b, e}{F}_2\left( w, e\right)=\frac{1}{2}{w}^T w+\frac{1}{2}\gamma \sum_{i=1}^N{v}_i{e}_i^2\hfill \\ {}\mathrm{Subject}\;\mathrm{to}\;{y}_i\left({w}^T\varphi \left({x}_i\right)+ b\right)=1-{e}_i, i=1,\dots, N,\hfill \end{array} $$

(10)

where v _i is given by [61]:

$$ {v}_i=\left\{\begin{array}{cc}\hfill \frac{N}{2{N}_p}\hfill & \hfill if\;{y}_{\iota}=+1\hfill \\ {}\hfill \frac{N}{2{N}_{\mathrm{N}}}\hfill & \hfill if\;{y}_{\iota}=-1\hfill \end{array}\right\}, $$

(11)

with N _p and N _N representing the number of “positive” and “negative” training samples respectively. In this work, a fine tuning of the ratio between the two penalty factors is sought during the parameter selection process.

For the LS-SVM implementation, the LS-SVMlab toolbox is used (http://www.esat.kuleuven.be/sista/lssvmlab/).

3 Results

Due to the small number of abnormal cases (44 in total) a 44-fold stratified cross-validation is used for performance estimation. The employed cross validation consists of an outer and an inner loop. In the inner loop the LS-SVM parameters are tuned while in the outer loop the performance is estimated. The number of folds is set such that the best exploitation of the limited number of “abnormal” cases is achieved. More specifically for each fold, one case belonging to the abnormal set and 12 (or 11) cases belonging to the normal set are used for testing, leaving 43 abnormal and 496 (or 497) normal cases reserved for training. The training set is normalized so that each feature has mean value equal to zero and standard deviation equal to one. The learned transform is then applied to the testing data.

Before testing the LS-SVM the involved parameters are tuned (i.e. σ, C, and the imbalance factor) using the training data and a 43-fold stratified cross validation procedure. This inner-loop procedure is repeated five times and each time a reshuffling of the normal cases takes place ensuring that for each one of the five repetitions each abnormal case is never matched with the same 12 (or 11) normal cases. The whole evaluation procedure is repeated 15 times, each time reshuffling the samples corresponding to the normal cases. Figure 3 depicts the whole process.

The applied tuning process is used in order to select good-near-optimal parameter values subject to a specific criterion / performance measure. In general, all classification measures can be estimated using the elements of the confusion matrix (Table 2), with sensitivity and specificity being among the most commonly reported values for medical settings.

Table 2 A general confusion matrix for a binary problem

Full size table

Some of the most common performance measures are:

the overall accuracy:

$$ Accuracy=\frac{TP+ TN}{TP+ TN+ FP+ FN}, $$

(12)

the True Positive Rate (TP _rate) also known as Sensitivity or Recall:

$$ {TP}_{rate}=\frac{TP}{TP+ FN}, $$

(13)

the True Negative Rate (TN _rate) also known as Specificity:

$$ {TN}_{rate}=\frac{TN}{TN+ FP}, $$

(14)

and the Positive Predictive Value (PPV) also known as Precision:

$$ PPV=\frac{TP}{TP+ FP}. $$

(15)

Conventional overall accuracy, is not suitable for problems with high imbalance between the classes, because with imbalanced datasets it leads to the adoption of classifiers that may completely neglect the minority class [62] (in the current case a classifier that assigns everything to the negative class would have an accuracy value of 508/552 or 92.3% but would be practically useless). In order to avoid that, four alternative measures of classification performance are used, which manipulate differently the entries of the confusion matrix:

a)
Balanced Error Rate (BER):

$$ BER=\frac{1}{2}\left(\frac{FP}{FN+ TP}+\frac{FN}{FP+ TN}\right), $$

(16)

b)
Geometric mean (g-mean):

$$ g- mean=\sqrt{TP_{rate}\cdot {TN}_{rate}}, $$

(17)

c)
Harmonic mean (F-measure):

$$ F- measure=\frac{2}{\frac{1}{TP_{rate}}+\frac{1}{PPV}}, $$

(18)

d)
Matthews Correlation Coefficient (MCC):

$$ MCC=\frac{TP\cdot TN- FP\cdot FN}{\sqrt{\left( TP+ FP\right)\left( TP+ FN\right)\left( TN+ FP\right)\left( TN+ FN\right)}}. $$

(19)

From the four measures, only the BER has an inverse relationship to performance (smaller values correspond to better performance). For all the others the higher the value the better the classifier is. BER is a common measure of performance in the case of imbalanced data sets [54]. g-mean has also been used in the context of FHR classification [18] and it is often employed in the case of imbalanced data sets. In the case of the F-measure, the simplest form is selected, which corresponds to estimating the harmonic mean of precision (PPV) and recall (TP _rate) which usually leads to balanced values between precision and recall [63]. Finally, MCC is another measure not affected by the different size of the two classes [64].

The performance for the different input feature sets and for the different measures are summarized in Fig. 4, where instead of the BER, 1- BER is included so that higher values correspond to better performance as in the other three measures. It should be noted that for each one of the reported measures, the same criterion is used during the tuning process. In other words the g-mean values shown in Fig. 4a correspond to an LS-SVM whose parameters were selected using g-mean as the tuning criterion etc.

The average values are reported in Table 3. From Fig. 4 it is evident that the best performance for all four different measures is achieved using only the three individually best features (highlighted in bold) and the performance shows a decreasing trend with the addition of more features (Note: this statement holds only for this specific ranking of the features and the specific three feature levels (3–9-54)). Figure 5 shows the feature space for the three top ranked features, highlighting the non-separable nature of pH based classes. This is further supported by Fig. 6 showing the non-separable nature of normal and abnormal cases for the three top ranked features (VLF, LF, and SD₂) (Note: The difference in median between normal and abnormal cases is statistically significant for all three features (p < 0.05, Wilcoxon rank sum test)). While it is difficult to tight these features to a precise underlying physiological mechanism, it is clear that VLF and SD₂ correlates with frequency of FHR decelerations and LF is associated with neural sympathetic activity [10, 32, 41]. In terms of sensitivity and specificity, the results are summarized in Table 4, where each row corresponds to the results of the application of LS-SVMs having as inputs three features and tuned using the criterion listed in the first column. The aggregated confusion matrices for the case of the input set with only 3 features are presented in the appendix. From Table 4 it can be seen that F-measure seems to lead to a different configuration of the LS-SVM while the other three criteria lead to classifiers with similar performance.

Table 3 Average performance for the different input feature sets

Full size table

Table 4 Average sensitivity and specificity values, for the three best input features, under different tuning criteria

Full size table

As also explained in the next section, comparison of the variety of methods found in the literature is not straightforward. Therefore in order to further validate the utility of the proposed approach and more specifically the approach that utilizes only the top three features, a comparison with a more conventional classification scheme is performed. This scheme involves a dimensionality reduction stage, a means to compensate for the class imbalance, and a simple classifier. However, this time the dimensionality reduction stage is not performed through feature selection but via the use of PCA [51], the imbalance compensation is performed using the Synthetic Minority Oversampling TEchnique (SMOTE) [65] and the classifier is a linear one, the Minimum Mahalanobis Distance Classifier (MMDC) [51].

PCA has been used regularly in classification approaches involving the FHR [18, 21] and even though it is a linear unsupervised technique, it can be very competitive to more advanced schemes when it comes to applications of real life data [66]. SMOTE is also a method that has been widely used in FHR analysis, since in this field normal cases dominate existing datasets. The MMDC is a linear method which despite its simplicity can perform “embarrassingly” (for more advanced schemes) well when applied to real life data [67]. Another appealing property of the MMDC is that it is a parameter free method. However even for this simple classification scheme and despite the fact the MMDC does not have any parameters to be tuned, the other two stages need tuning: selection of the number of retained PCs and the amount of oversampling of the abnormal class for SMOTE.

As in the case of the LS-SVM scheme, a grid search is performed following the same procedure as described above. The same four performance measures are involved and the results are depicted in Fig. 7, against the results achieved using the more elaborate classification scheme using only thee input features. From Fig. 7, it is evident that the extra effort for developing the proposed classification scheme is indeed beneficial.

4 Discussion

In this work, a large set of available (and commonly used) features that are extracted from open access CTU-UHB CTG database is examined for classification. These features originated from different domains in order to cover as much as possible of the information contained in FHR that could be associated with delivery outcomes. The delivery outcome is quantified using umbilical artery pH. The analysis shows that the use of only three features combined with a powerful, yet computational efficient classifier, can achieve sensitivity and specificity values that are close or above 70%. The balanced nature of the results is reached by taking into account the imbalanced nature of the problem during the training phase of the LS-SVM (using unequal costs for the two classes, Eq. (10)).

Even though comparison to other published works is almost impossible the proposed approach is comparable or even outperforms the published literature. Compared to [34], where a similar approach was pursued with a filter selection (RELIEF algorithm) and a boosted mean prototype classifier, the results are better (sensitivity equal to 64.1% and specificity equal to 65.2%). Moreover the current approach reaches higher performance values using a smaller number of features, thus improving computational efficiency.

Compared to the results presented in [29], where a sensitivity equal to 57% and a specificity of 97% is reported, the current work achieves better sensitivity but worse specificity. However, these findings should be viewed with caution, since in [29] a different data set is involved. Regarding the computational complexity the system employed in [29] has a very fast inference mechanism based on well-established medical criteria [68]. Our method on the other hand, requires training, but once trained the response of the model is very fast, especially since it requires the extraction of only three features.

Regarding other works that use the CTU-UHB CTG database for pH based classification, as in [69, 70], direct comparison is again not possible since higher a threshold is used (pH threshold equal to 7.2) in combination with other criteria (Note: pH is a logarithmic measure and different thresholds can lead to dramatically different stratification of the data leading to completely different classification performance), while in [69] the approach is more of an exploratory nature. Finally, compared to the simpler scheme using linear methods and SMOTE for compensating the data imbalance, the proposed approach is much more effective. Table 5 summarizes the performance of the aforementioned works along with the involved feature space, which however should be treated with caution since different criteria and different datasets are involved.

Table 5 Summary of recent approaches using pH as a means to class formation

Full size table

The performance of the proposed method seems to be in agreement with the findings of [32], where a g-mean equal to 0.7 is anticipated for large datasets. However more research is needed before a conclusion can be reached on the limitations of an approach based solely on FHR processing and pH based class formation. Moreover, Fig. 5 clearly shows quite an overlap between the two classes indicating that a perfect classification, in the current setting, may be impossible. Therefore new features, including also information coming from the UC signal [71], and/or better algorithms for classification are needed. This work can therefore act as a benchmark for the evaluation of new features and algorithms, since it only requires the extraction of three features and the use of a computational method, the training of which can be done really fast.

5 Conclusion

In this work a method for the evaluation of FHR is proposed and tested using the open access CTU-UHB CTG database with promising results. For the specific setting, a minimal set of three input features seems to produce the best results in terms of performance measure developed for imbalanced data sets. These results are comparable to those achieved by other methods presented in the literature, and outperform a simpler classification scheme, which is used as a “base” measure to validate the use of advanced data processing techniques. However the lack of standardization makes it impossible to have a more formal comparison. The proposed approach, being the most complete experimental study so far, could be used as a benchmark for future studies involving the CTU-UHB CTG open access database.

The results also seem to confirm the findings of [32] that reported difficulties in obtaining high classification performance using FHR recordings and pH based classes on large datasets. Therefore, either other source of information should be seek, such as the inclusion of the Maternal Heart Rate (MHR) [72], ST analysis [29], or other clinical information as part of the feature set, and/or alternative labeling process should be considered, keeping also in mind that it is not natural to have a simple separating line (pH based) between the normal and abnormal (pathological) FHR groups. Toward the latter, a model for aggregating experts’ opinion has been recently proposed based on the Latent Class Analysis (LCA) [73,74,75]. In future work we plan to investigate a hybridization of both approaches in hope of developing more reliable decision support tools for the interpretation of CTG recordings.

References

Amer-Wåhlin I, Maršál K. ST analysis of fetal electrocardiography in labor. Semin Fetal Neonatal Med. 2011;16(1):29–35.
Article Google Scholar
Stout MJ, Cahill AG. Electronic fetal monitoring: past, present, and future. Clin Perinatol. 2011;38(1):127–42.
Article Google Scholar
FIGO. Guidelines for the use of fetal monitoring. Int J Gynaecol Obstet. 1986;25:159–67.
Google Scholar
Bernardes J, Costa-Pereira A, Ayres-de-Campos D, Geijn HP, Pereira-Leite L. Evaluation of interobserver agreement of cardiotocograms. Int J Gynecol Obset. 1997;57(1):33–7.
Article Google Scholar
Hruban L, Spilka J, Chudáček V, Janků P, Huptych M, Burša M, et al. Agreement on intrapartum cardiotocogram recordings between expert obstetricians. J Eval Clin Pract. 2015;21:694–702.
Article Google Scholar
Steer PJ. Has electronic fetal heart rate monitoring made a difference. Semin Fetal Neonatal Med. 2008;13(1):2–7.
Article Google Scholar
Ayres-de-Campos D, Ugwumadu A, Banfield P, Lynch P, Amin P, Horwell D, et al. A randomized clinical trial of intrapartum fetal monitoring with computer analysis and alerts versus previously available monitoring. BMC pregnancy and childbirth. 2010;10(1):71.
Article Google Scholar
Visser GH, Dawes GS, Redman CW. Numerical analysis of the normal human antenatal fetal heart rate. BJOG. 1981;88(8):792–802.
Article Google Scholar
Ayres-de-Campos D, Bernardes J, Garrido A, Sa J, Pereira-Leite L. SisPorto 2.0: a program for automated analysis of cardiotocograms. J Matern Fetal Med. 2001;9(5):311–8.
Google Scholar
Task-Force. Heart rate variability. Standards of measurement, physiological interpretation, and clinical use. Task force of the European Society of Cardiology and the north American Society of Pacing and Electrophysiology. Eur Heart J. 1996;17(3):354–81.
Article Google Scholar
Magenes G, Signorini MG, Arduini D. Classification of cardiotocographic records by neural networks. Proceedings IEEE-INNS-ENNS, 2000. International Joint Conference on Neural Networks IJCNN. 2000;3:637–41.
Google Scholar
Goncalves H, Rocha AP, Campos DA, Bernardes J. Linear and non linear fetal heart rate analysis of normal and acidemic fetuses in the minutes preceding delivery. Med Biol Eng Comput. 2006;44(10):847–55.
Article Google Scholar
Spilka J, Chudáček V, Koucký M, Lhotská L, Huptych M, Janků P, et al. Using nonlinear features for fetal heart rate classification. Biomedical Signal Processing Control. 2012;7(4):350–7.
Article Google Scholar
Georgoulas G, Stylios C, Groumpos P. Feature extraction and classification of fetal heart rate using wavelet analysis and support vector machines. International Journal Artificial Intelligence Tools. 2006;15(03):411–32.
Article Google Scholar
Krupa N, Mohd AM, Zahedi E, Ahmed S, Hassan FM. Antepartum fetal heart rate feature extraction and classification using empirical mode decomposition and support vector machine. Biomed Eng Online. 2011; doi:10.1186/1475-925X-10-6.
Georgoulas G, Gavrilis D, Tsoulos I, Stylios C, Bernardes J, Groumpos P. Novel approach for fetal heart rate classification introducing grammatical evolution. Biomedical Signal Processing Control. 2007;2(2):69–79.
Article Google Scholar
Warrick PA, Hamilton EF, Precup D, Kearney RE. Classification of normal and hypoxic fetuses from systems modeling of intrapartum cardiotocography. IEEE Trans Biomed Eng. 2010;57(4):771–9.
Article Google Scholar
Georgoulas G, Stylios C, Groumpos P. Predicting the risk of metabolic acidosis for newborns based on fetal heart rate signal classification using support vector machines. IEEE Trans Biomed Eng. 2006;53(5):875–84.
Article Google Scholar
Xu L, Redman CW, Payne SJ, Georgieva A. Feature selection using genetic algorithms for fetal heart rate analysis. Physiol Meas. 2014;35(7):1357–71.
Article Google Scholar
Czabanski R, Jezewski J, Wrobel J, Horoba K. Predicting the risk of low-fetal birth weight from cardiotocographic signals using ANBLIR system with deterministic annealing and-insensitive learning. IEEE Trans Inf Technology Biomedicine. 2010;14(4):1062–74.
Article Google Scholar
Georgieva A, Payne SJ, Moulden M, Redman CW. Artificial neural networks applied to fetal monitoring in labour. Neural Comput & Applic. 2013a;22(1):85–93.
Article Google Scholar
Jezewski M, Wrobel J, Labaj P, Leski J, Henzel N, Horoba K, et al. Some practical remarks on neural networks approach to fetal cardiotocograms classification. Proceedings 29th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Lyon: August 2007. p. 5170–3.
Dash S, Quirk JG, Djuric PM. Fetal heart rate classification using generative models. IEEE Trans Biomed Eng. 2014;61(11):2796–805.
Article Google Scholar
Czabanski R, Jezewski J, Matonia A, Jezewski M. Computerized analysis of fetal heart rate signals as the predictor of neonatal academia. Expert Syst Appl. 2012a;39(15):11846–60.
Article Google Scholar
Czabanski R, Wrobel J, Jezewski J, Jezewski M. Two-step analysis of the fetal heart rate signal as a predictor of distress. Proceedings 4th Asian conference intelligent information and database systems. 2012b. pp 431–8.
Georgoulas G, Stylios CD, Nokas G, Groumpos P. Classification of fetal heart rate during labour using hidden Markov models. Proceedings IEEE International Joint Conference on Neural Networks. 2004;3:2471–5.
Google Scholar
Costa MD, Schnettler WT, Amorim-Costa C, Bernardes J, Costa A, Goldberger AL, et al. Complexity-loss in fetal heart rate dynamics during labor as a potential biomarker of academia. Early Hum Dev. 2014;90(1):67–71.
Article Google Scholar
Georgieva A, Papageorghiou AT, Payne SJ, Moulden M, Redman CWG. Phase-rectified signal averaging for intrapartum electronic fetal heart rate monitoring is related to acidaemia at birth. BJOG. 2014;121(7):889–94.
Article Google Scholar
Costa A, Ayres-de-Campos D, Costa F, Santos C, Bernardes J. Prediction of neonatal acidemia by computer analysis of fetal heart rate and ST event signals. Am J Obstet Gynecol. 2009;201(5):464–e1.
Article Google Scholar
Keith RD, Beckley S, Garibaldi JM, Westgate JA, Ifeachor EC, Greene KR. A multicentre comparative study of 17 experts and an intelligent computer system for managing labour using the cardiotocogram. BJOG. 1995;102(9):688–700.
Article Google Scholar
Georgieva A, Moulden M, Redman CWG. Umbilical cord gases in relation to the neonatal condition: the EveREst plot. Eur J Obstet Gynecol Reprod Biol. 2013b;168(2):155–60.
Article Google Scholar
Spilka J. Complex approach to fetal heart rate analysis: a hierarchical classification model. PhD Thesis, at Czech Technical University in Prague Department of Cybernetics. 2013.
Chudáček V, Spilka J, Burša M, Janků P, Hruban L, Huptych M, et al. Open access intrapartum CTG database. BMC Pregnancy and Childbirth. 2014;14:16.
Article Google Scholar
Spilka J, Georgoulas G, Karvelis P, Oikonomou V, Chudáček V, Stylios C, et al. Automatic evaluation of FHR recordings from CTU-UHB CTG database. In: Information technology in bio-and medical informatics; 2013. p. 47–61.
Ayres-de-Campos D, Rei M, Nunes I, Sousa P, Bernardes J. SisPorto 4.0–computer analysis following the 2015 FIGO guidelines for intrapartum fetal monitoring. J Matern Fetal Neonatal Med. 2016 doi:10.3109/14767058.2016.1161750.
Oikonomou VP, Spilka J, Stylios CD, Lhotská L. An adaptive method for the recovery of missing samples from FHR time series. Proceedings CBMS; 2013. p 337–42.
Krupa BN, Ali MM, Zahedi E. The application of empirical mode decomposition for the enhancement of cardiotocograph signals. Physiol Meas. 2009;30(8):729.
Article Google Scholar
deHaan J, Bemmel J, Versteeg B, Veth A, Stolte L, Janssens J, et al. Quantitative evaluation of fetal heart rate patterns: I. Processing methods. Eur J Obstet Gynecol Reprod Biol. 1971;1(3):95–102.
Article Google Scholar
Yeh SY, Forsythe A, Hon EH. Quantification of fetal heart beat-to-beat interval differences. Obstet Gynecol. 1973;41(3):355–63.
Google Scholar
Pardey J, Moulden J, Redman C. A computer system for the numerical analysis of nonstress tests. Am J Obstet Gynecol. 2002;186(5):1095–103.
Article Google Scholar
Signorini MG, Magenes G, Cerutti S, Arduini D. Linear and nonlinear parameters for the analysis of fetal heart rate signal from cardiotocographic recordings. IEEE Trans Biomed Eng. 2003;50(3):365–74.
Article Google Scholar
Kinsner W. Batch and real-time computation of a fractal dimension based on variance of a time series. Technical report. Department of Electrical & Computer Engineering, University of Manitoba, Winnipeg. 1994.
Sevcik C. A procedure to estimate the fractal dimension of waveforms. Complex Int. 1998;5:1–19. http://arxiv.org/pdf/1003.5266.pdf.
Higuchi T. Approach to an irregular time series on the basis of the fractal theory. Phys D. 1988;31(2):277–83.
Article MATH MathSciNet Google Scholar
Peng CK, Havlin S, Stanley HE, Goldberger AL. Quantification of scaling exponents and crossover phenomena in nonstationary heartbeat time series. Chaos. 1995;5(1):82–7.
Article Google Scholar
Pincus S. Approximate entropy (ApEn) as a complexity measure. Chaos. 1995;5(1):110–7.
Article MathSciNet Google Scholar
Richman JS, Moorman JR. Physiological time-series analysis using approximate entropy and sample entropy. Am J Physiol Heart Circ Physiol. 2000;278(6):2039–49.
Google Scholar
Lempel A, Ziv J. On the complexity of finite sequences. IEEE Trans Information Theory. 1976;22(1):75–81.
Article MATH MathSciNet Google Scholar
Chudáček V, Spilka J, Janků P, Koucký M, Lhotská L, Huptych M. Automatic evaluation of intrapartum fetal heart rate recordings: a comprehensive analysis of useful features. Physiol Meas. 2011;32(8):1347–60.
Article Google Scholar
Bernardes J, Moura C, de Sa JP, Leite LP. The Porto system for automated cardiotocographic signal analysis. J Perinat Med. 1991;19:61–5.
Article Google Scholar
Theodoridis S, Koutroumbas K. Pattern recognition. 4th ed London: Academic Press; 2009.
Guyon I, Elisseeff A. An introduction to variable and feature selection. J Mach Learn Res. 2003;3:1157–82.
MATH Google Scholar
Liu H, Motoda H. Computational methods of feature selection. Boca Raton: CRC Press; 2010.
Wasikowski M, Xue-wen C. Combating the small sample class imbalance problem using feature selection. IEEE Trans Knowledge Data Engineering. 2010;22(10):1388–400.
Article Google Scholar
DeLong E, DeLong D, Clarke-Peterson D. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1998;44:837–45.
Article MATH Google Scholar
Maloof M. Learning when data sets are imbalanced and when costs are unequal and unknown. In: Workshop on Learning from Imbalanced Data Sets II ICML; 2003.
Joachims T. Text categorization with support vector machines: learning with many relevant features, Poceedings ECML-98. 1998; p. 137–42.
Suykens JAK, Vandewalle J. Least Squares support vector machine classifiers. Neural Process Lett. 1999;9:293–300.
Article MATH Google Scholar
Ding LZ, Liao S. Approximate model selection for large scale LSSVM. Proceedings ACML. 2011. p. 165–80.
Osuna EE, Freund R, Girosi F. Support vector machines: training and applications, MIT, A.I. Memo. no. 1602. 1997.
Luts J, Ojeda F, Van de Plas R, De Moor B, Van Huffel S, Suykens JA. A tutorial on support vector machine-based methods for classification problems in chemometrics. Anal Chim Acta. 2010;665(2):129–45.
Article Google Scholar
Tan P, Steinbach M, Kumar V. Introduction to data mining. Reading: Addison-Wesley; 2006.
Google Scholar
Yanmin S, Kamel M, Wong A, Wang Y. Cost-sensitive boosting for classification of imbalanced data. Pattern Recogn. 2007;40(12):3358–78.
Article MATH Google Scholar
Matthews BW. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochimica et Biophysica Acta (BBA) - Protein Structure. 1975;405(2):442–51.
Article Google Scholar
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority oversampling technique. J Artif Intell Res. 2002;16:321–57.
MATH Google Scholar
van der Maaten LJ, Postma EO, van den Herik HJ. Dimensionality reduction: a comparative review. J Mach Learn Res. 2009;10:66–71.
Google Scholar
Hand DJ. Classifier technology and the illusion of progress. Stat Sci. 2006;21(1):1–14.
Article MATH MathSciNet Google Scholar
Ayres-de-Campos D, Sousa P, Costa A, Bernardes J. Omniview-SisPorto® 3.5–a central fetal monitoring station with online alerts based on computerized cardiotocogram+ ST event analysis. J Perinat Med. 2008;36(3):260–4.
Article Google Scholar
Rotariu C, Pasarica A, Costin H, Nemescu D. Spectral analysis of fetal heart rate variability associated with fetal acidosis and base deficit values. Proceedings IEEE International Conference on Development and Application Systems (DAS). 2014a. p. 210–13.
Rotariu C, Pasarica A, Andruseac G, Costin H, Nemescu D. Automatic analysis of the fetal heart rate variability and uterine contractions. Proceedigns international conference and exposition on Electrical and Power Engineering (EPE). 2014b. p. 553–56.
Warmerdam GJJ, Vullings R, Van Laar J, Bergmans JWM, Schmitt L, Oei SG. Using uterine activity to improve fetal heart rate variability analysis for detection of asphyxia during labor. Physiol Meas. 2016;37(3):387.
Article Google Scholar
Gonçalves H, Pinto P, Silva M, Ayres-de-Campos D, Bernardes J. Toward the improvement in fetal monitoring during labor with the inclusion of maternal heart rate analysis. Med Biol Eng Comput. 2015; doi:10.1007/s11517-015-1359-7.
Spilka J, Chudáček V, Janků P, Hruban L, Burša M, Huptych M, et al. Analysis of obstetricians’ decision making on CTG recordings. J Biomed Inform. 2014;51:72–9.
Article Google Scholar
Georgoulas G, Spilka J, Karvelis P, Chudáček V, Stylios C, Lhotská L. A three class treatment of the FHR classification problem using latent class analysis labeling. Proceedings 36^th IEEE Engineering in Medicine and Biology Society Conference (EMBC). 2014. p. 46–9.
Karvelis P, Spilka J, Georgoulas G, Chudáček V, Stylios CD, Lhotská L. Combining latent class analysis labeling with multiclass approach for fetal heart rate categorization. Physiol Meas. 2015;36(5):1001–24.
Article Google Scholar

Download references

Acknowledgments

This work was partially supported by the research project” Intelligent System for Automatic CardioTocoGraphic Data Analysis and Evaluation using State of the Art Computational Intelligence Techniques” within the “Greece-Czech Joint Research and Technology projects 2011-2013” program of the General Secretariat for Research & Technology, Greek Ministry of Education and Religious Affairs, co-financed by Greece, National Strategic Reference Framework (NSRF) and by Czech Grant Agency project number: 14-28462P Statistical methods of intrapartum CTG signal processing in the context of clinical information.

Author information

Authors and Affiliations

Control Engineering Group Department of Computer Science, Electrical and Space Engineering Luleå University of Technology, SE-97187, Luleå, Sweden
George Georgoulas
Laboratory of Knowledge and Intelligent Computing, Department of Computer Engineering, Technological Educational Institute of Epirus, Arta, Kostakioi, Greece
Petros Karvelis & Chrysostomos D. Stylios
CIIRC, Czech Technical, University in Prague, Czech Republic, Prague, Czech Republic
Jiří Spilka, Václav Chudáček & Lenka Lhotská

Authors

George Georgoulas
View author publications
You can also search for this author in PubMed Google Scholar
Petros Karvelis
View author publications
You can also search for this author in PubMed Google Scholar
Jiří Spilka
View author publications
You can also search for this author in PubMed Google Scholar
Václav Chudáček
View author publications
You can also search for this author in PubMed Google Scholar
Chrysostomos D. Stylios
View author publications
You can also search for this author in PubMed Google Scholar
Lenka Lhotská
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to George Georgoulas.

Ethics declarations

On behalf of all authors I would like to declare that this works is in Compliance with the following Ethical Standards:

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

This article is part of the Topical collection on Systems Medicine

Appendix

Table 6 Aggregated confusion matrix for the case of three features and 1-BER criterion

Full size table

Table 7 Aggregated confusion matrix for the case of three features and g-mean criterion

Full size table

Table 8 Aggregated confusion matrix for the case of three features and F-measure criterion

Full size table

Table 9 Aggregated confusion matrix for the case of three features and MCC criterion

Full size table

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Georgoulas, G., Karvelis, P., Spilka, J. et al. Investigating pH based evaluation of fetal heart rate (FHR) recordings. Health Technol. 7, 241–254 (2017). https://doi.org/10.1007/s12553-017-0201-7

Download citation

Received: 25 October 2016
Accepted: 30 May 2017
Published: 04 July 2017
Issue Date: November 2017
DOI: https://doi.org/10.1007/s12553-017-0201-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Investigating pH based evaluation of fetal heart rate (FHR) recordings

Abstract

Similar content being viewed by others

Least Squares Support Vector Machines for FHR Classification and Assessing the pH Based Categorization

Automatic Evaluation of FHR Recordings from CTU-UHB CTG Database

A Survey of Electronic Fetal Monitoring: A Computational Perspective

1 Introduction