Predicting community acquired bloodstream infection in infants using full blood count parameters and C-reactive protein; a machine learning study

Brouwer, Lieke; Cunney, Robert; Drew, Richard J.

doi:10.1007/s00431-024-05441-6

Predicting community acquired bloodstream infection in infants using full blood count parameters and C-reactive protein; a machine learning study

RESEARCH
Open access
Published: 18 April 2024

Volume 183, pages 2983–2993, (2024)
Cite this article

Download PDF

You have full access to this open access article

European Journal of Pediatrics Aims and scope Submit manuscript

Predicting community acquired bloodstream infection in infants using full blood count parameters and C-reactive protein; a machine learning study

Download PDF

Lieke Brouwer^1,2,
Robert Cunney^3,4 &
Richard J. Drew^3,4,5

909 Accesses
Explore all metrics

Abstract

Early recognition of bloodstream infection (BSI) in infants can be difficult, as symptoms may be non-specific, and culture can take up to 48 h. As a result, many infants receive unneeded antibiotic treatment while awaiting the culture results. In this study, we aimed to develop a model that can reliably identify infants who do not have positive blood cultures (and, by extension, BSI) based on the full blood count (FBC) and C-reactive protein (CRP) values. Several models (i.e. multivariable logistic regression, linear discriminant analysis, K nearest neighbors, support vector machine, random forest model and decision tree) were trained using FBC and CRP values of 2693 infants aged 7 to 60 days with suspected BSI between 2005 and 2022 in a tertiary paediatric hospital in Dublin, Ireland. All models tested showed similar sensitivities (range 47% – 62%) and specificities (range 85%-95%). A trained decision tree and random forest model were applied to the full dataset and to a dataset containing infants with suspected BSI in 2023 and showed good segregation of a low-risk and high-risk group. Negative predictive values for these two models were high for the full dataset (> 99%) and for the 2023 dataset (> 97%), while positive predictive values were low in both dataset (4%–20%).

Conclusion: We identified several models that can predict positive blood cultures in infants with suspected BSI aged 7 to 60 days. Application of these models could prevent administration of antimicrobial treatment and burdensome diagnostics in infants who do not need them.

What is Known: • Bloodstream infection (BSI) in infants cause non-specific symptoms and may be difficult to diagnose. • Results of blood cultures can take up to 48 hours.
What is New: • Machine learning models can contribute to clinical decision making on BSI in infants while blood culture results are not yet known.

Machine learning for fast identification of bacteraemia in SIRS patients treated on standard care wards: a cohort study

Article Open access 15 August 2018

Bacteremia detection from complete blood count and differential leukocyte count with machine learning: complementary and competitive with C-reactive protein and procalcitonin tests

Article Open access 26 March 2022

Machine learning pipeline for blood culture outcome prediction using Sysmex XN-2000 blood sample results in Western Australia

Article Open access 24 August 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Detection of bacteria from blood cultures in children may be due to bloodstream infection (BSI), transient (non-significant) bacteraemia, or contamination of the blood culture by skin flora. BSIs are a leading cause of morbidity and mortality in infants world-wide [1,2,3]. The main pathogens involved in BSIs in infants are Gram-negative bacteria (GNB) (e.g., E. coli, Pseudomonas spp. and Klebsiella spp.) and Group B streptococcus (GBS) [3,4,5,6].

Diagnosis of BSI by blood culture can take up to 48 h [7]. This can result in administration of unnecessary treatment or the execution of burdensome diagnostic procedures, such as lumbar puncture.

A combination of clinical signs, biomarkers, and full blood count (FBC) parameters is used for early recognition of BSI. However, in neonates and infants, these signs and markers can be difficult to interpret, as symptoms can ben non-specific and normal ranges of biomarkers and FBC parameters change during the first months of life [8]. Efforts have been put into deriving clinical algorithms that reliably predict BSI upon presentation based on these parameters [9, 10]. Though much progress has been made, the distinction between patients with and without BSI is still not perfect.

It has been suggested that machine learning could contribute to the development of algorithms with a higher accuracy of predicting BSI compared to existing methods [11]. In several patient groups, machine learning algorithms have already shown promising results in predicting of positive blood cultures (and, by extension, BSI) from clinical symptoms, biomarkers and FBC parameters [12, 13]. Some efforts have already been made to apply machine learning methods to predict BSI in infants; Ramgopal et al. showed that a random forest model can predict serious bacterial infections in infants with high specificity and sensitivity [14]. Application of such a model in clinical practice could prevent many infants without BSI having to undergo burdensome diagnostic procedures and unnecessary treatment.

The aim of our current study was to derive an easy-to-apply algorithm that can reliably identify infants with a low risk of community acquired BSI. To do this we applied various machine learning methods on FBC and CRP data from a cohort of infants aged 7 to 60 days presenting at the emergency department in a tertiary hospital in Dublin.

Methods and materials

Study population

We performed a retrospective case-control study at Children’s Health Ireland (CHI) Temple Street, Dublin, a tertiary paediatric hospital. The CHI Research Ethics Committee (REC) approved this study (reference number: REC-194-22). Our study population was composed of all infants aged 7 to 60 days who were admitted via the emergency department of CHI, Temple Street between January 1st 2005 and December 17th 2022, and received a work-up for suspected BSI upon presentation. All infants with a positive or negative blood culture, and who had an FBC and C-reactive protein (CRP) taken as part of their initial investigations, were included. The age range 7 to 60 days was chosen as during this period of life, the reference values for FBC parameters and CRP are relatively stable, while several reference values change considerably in the first days of life and after the first months of life. All Infants with a positive blood culture were included as cases, while all infants with a negative blood culture were included as controls. Infants that did not have FBC, CRP and blood culture results from samples taken on the same day were excluded from analysis. All blood cultures were processed in the microbiological laboratory at CHI, Temple Street using an automated blood culture platform (BacT/ALERT, BioMérieux, Marcy-l'Étoile, France), and this methodology did not change during the study period.

Data collection

Data were extracted from the electronic laboratory information system at CHI Temple Street on the following parameters: age in days on date of sampling, sex, year of sampling, FBC parameters (i.e. white cell count (WCC), neutrophils (N), lymphocytes (L), monocytes (M), eosinophils (E), basophils (B), platelets (PLT), red cell count (RCC), red cell distribution width (RDW), haemoglobin (Hb), haematocrit (HCT), mean platelet volume (MPV), mean corpuscular haemoglobin (MCH), mean corpuscular haemoglobin concentration (MCHC) and mean corpuscular volume (MCV)), CRP and blood culture results. Ratios were calculated for neutrophiles to lymphocytes (NLR), MPV to platelets (MPVPR), monocytes to lymphocytes (MLR) and platelets to lymphocytes (PLR). As data on clinical parameters – e.g. temperature, heart rate and treatment regime – are stored in different databases which are not straightforward to merge with the laboratory databases, we were currently unable to incorporate clinical parameters in our analysis. Cases with blood culture results positive for organisms classified as likely contaminants (Supplementary Material 1) or fungal pathogens were excluded from the dataset, to ensure that included cases with positive blood cultures were likely to represent bacterial BSI. The cases were subsequently grouped into three discrete groups: Gram negative bacteraemia, bacteraemia with Group B Streptococcus, and other clinically significant bacteraemia.

Data analysis

Cut-off values for FBC outliers were decided based on the distribution patterns of the FBC variables and clinical experience. Infants with outliers in any of the FBC variables that were likely to be derived from underlying conditions such as malignancies were excluded from further analysis. Infants that were missing CRP values, values for any FBC variable or data on sex or age were excluded from further analysis. Differences in baseline characteristics age and sex between the cases and controls were assessed by Chi-squared test and Student’s t-test respectively. Exploratory univariate and multivariate analyses were performed using the ggplot2 package [15] in R statistical software version 4.2.2 [16] to construct violin plots and a heatmap. To calculate the association between the independent variables age, sex, FBC parameters and CRP and the dependent variable blood culture result, univariable logistic regression was performed using the gtsummary package [17] in R statistical software.

To be able to train our models using reasonably balanced data, the controls in our dataset were randomly subsampled to reach a case control ratio of 1:3. The subsampled dataset was then randomly split into a training set containing 70% of the data and a test set containing 30% of the data. A multivariable logistic regression model was build based on the training set in R. The model was build based on the results of the univariable model, including all variables with a p-value of 0.2 or less, and subsequently removing variables using the top-down strategy to obtain a model in which all independent variables were significant (p-value < 0.05).

The training set was normalized, and a linear discriminant analysis (LDA) model was build based on the training set using the MASS package [18] in R. A K-nearest neighbour (KNN) model and a support vector machine (SVM) with linear kernel were fitted to the normalized training set using the class [18] and e1071 [19] packages respectively in R. A decision tree and a random forest were build based on the non-normalized training set using the rpart package [20] and randomForest package [21] respectively in R. All analyses were performed including variables with a p-value of 0.2 or less in the univariable regression model, to minimize excluding variables that would make a valuable contribution to the models, as well as to minimize including variables that would not be of importance and would create noise in the model. Additionally, an LDA model and decision tree model were build based on the grouping of pathogens (GNB versus GBS versus controls) in the training set.

All models were subsequently used to make predictions in the test set, and parameters for the prediction (sensitivity, specificity, negative predictive value (NPV), positive predictive value (PPV) and accuracy) were calculated using the epiR package [22]. For each model, the area under the receiver operating characteristic curve (AUROC) was calculated using the pROC package [23].

The decision tree model and the random forest model that were trained using the subsampled dataset were subsequently used to make predictions for the entire dataset (i.e. 76 cases and 2616 controls). Sensitivity, specificity, NPV and PPV were calculated.

Applying the model to a 2023 dataset

After finalisation of the models, 206 infants between the age of 7 and 60 days with a work-up for suspected BSI at CHI Temple Street between January 1st 2023 and September 27th 2023 were included as a second cohort. A prediction was made on the presence of bacteraemia in these children using the trained decision tree model and random forest model. These predictions were subsequently compared to the outcome of the blood culture, and accuracy, sensitivity, specificity, NPV and PPV were calculated.

Results

Dataset construction and descriptive analyses

In total, 2876 infants between 7 and 60 days of age presenting between 2005 and 2022 at the emergency department of CHI Temple Street, had received a work-up for suspected BSI. For 179 (6.2%) infants, the blood culture grew a likely contaminant, and these infants were subsequently excluded from analysis. A further 4 (0.1%) infants were excluded as their FBC results contained outliers. One infant (0.03%) was excluded as their sex was reported as unknown.

Of the remaining 2692 infants, 76 (2.9%) had a positive blood culture and were thus labelled cases, while 2617 (97.1%) had a negative blood culture and were thus labelled controls.

Sex distribution was comparable between the case (34% female) and control group (44% female) (p = 0.11). The median age was 25 days for cases (inter quartile range (IQR) 17–39 days) and 34 days for controls (IQR 21 – 46 days) (p = 0.003). Between 32 and 274 infants were included in each calendar year, with lower numbers of inclusions during the earlier years in our study period, and during the first two years of the SARS-CoV-2 pandemic (2020–2021), while the highest number of inclusions was in 2022 (Fig. 1). Distribution of CRP and the FBC variables in cases and controls are shown in Fig. 2. The heat plot (Fig. 3) shows that WCC, Neutrophils and Lymphocytes were mutually correlated, as were RCC, Hb, HCT, MCV and MCH.

Logistic regression models

In the univariable logistic regression analysis, age, CRP, E, L, N, PLT, NLR, MPVPR and PLR were significantly associated with blood culture results (p-value < 0.05) (Table 1).

Table 1 Univariable logistic regression performed on the full dataset (N = 2692). Odds ratios (OR) with 95% confidence intervals (CIs) and p-values are shown, and refer to the odds of having a positive blood culture result

Full size table

Working top-down from a logistic regression model including all parameters with a p-value of 0.2 or lower in the univariable analysis, we derived a multivariable logistic regression model that included age, CRP, lymphocytes, NLR and MLR as significantly associated with the blood culture results.

Predictive algorithms

The multivariable logistic regression, LDA, KNN, SVM, decision tree and random forest models were all able to predict bacteraemia from FBC parameters and CRP in infants in the subsampled test set with accuracies between 80 and 86%. All models showed moderate to high AUROCs (0.72 – 0.82), and high specificities (ranging between 85 and 95%), but rather low sensitivities, ranging from 47 to 61%. PPV was lowest for the decision tree model at 56.5%, while it was highest for the logistic regression model at with 75%. NPVs for all models were similar, ranging from 85 to 89% (Table 2). Across the multivariable logistic regression model, LDA, random forest and decision tree, the variables CRP, lymphocytes and NLR seemed particularly of importance.

Table 2 Area under the receiver operating characteristic curve (AUROC), accuracy (acc), sensitivity (sens), specificity (spec), positive predictive values (PPV) and negative predictive values (NPV) for the multivariable logistic regression model, linear discriminant analysis model, K-nearest neighbors model, support vector machine model, random forest model and decision tree model

Full size table

The decision tree model (Fig. 5) shows a root split based on NLR, and lower-level splits for monocytes, age and NLR. The largest bin in this group contains 73% of the study population, of whom 92% is culture negative. Another low-risk group consists of those with NLR ≥ 1.6 and < 3.4, and age ≥ 34 days, comprising 6% of the dataset. Simultaneously, we can see three high risk groups, those with NLR < 1.6 and monocytes < 0.29 × 10⁹/L (4%), those with NLR ≥ 1.6 and age < 34 days (14%) and those with NLR ≥ 3.4 and age ≥ 34 days (3%) (Fig. 4).

Segregation of pathogen groups

LDA analysis on the distinct groups of pathogens showed good segregation of GBS and GNB versus controls (Fig. 5a). The variables contributing most to segregation of pathogen groups were CRP, MLR, PLR and neutrophils.

A single tree model on the distinct groups of pathogens showed good segregation of GBS and GNB versus controls in the training set (Fig. 5b). In the test set, it showed an overall accuracy of 89%. The variables that the segregation was based on were CRP, PLR, and monocytes.

Application of the decision tree to the full dataset

When applied to the full dataset, the single tree model classified a total of 2342 (87%) of the children as low-risk, with a high negative predictive value of 99.1% (95% CI 98.6% – 99.4%). It classified a total of 350 infants as high-risk, with a positive predictive value of 15.4% (95% CI 11.8% – 19.6%). The overall accuracy remained high at 88.2%, while the AUROC dropped to 0.57.

The false-negative group contained 22 infants, of which four with GBS (n = 25 in the entire dataset), 12 with GNB (n = 36 in the entire dataset), four with Staphylococcus aureus (n = 6 in the entire dataset), one with Haemophilus influenzae and one with Enterococcus faecalis. The false positive group contained 296 infants, for whom we have no further clinical details.

When applied to the full dataset, the accuracy of the random forest model remained high at 89.5% while the AUROC dropped to 0.60. The model classified 12.7% of the infants as high risk, with a PPV of 19.6%. It classified 87.3% of the infants as low risk, with an NPV of 99.6%. Of the nine infants that were false negative, six had a blood culture positive for Escherichia coli, two for GBS and one for Staphylococcus aureus.

Application of the models to the 2023 dataset

For six infants in the 2023 cohort, the blood culture grew a likely contaminant, and they were excluded from the dataset. The remaining dataset contained 197 infants, of whom seven had a positive blood culture. The pathogens cultured in this cohort were Enterococcus faecalis (n = 1), Escherichia coli (n = 3), Staphylococcus aureus (n = 1) and Streptococcus agalactiae (n = 2). The decision tree model classified 95 (48%) infants as high-risk with a PPV of 4.2%. It classified 102 (52%) infants as low-risk, with a NPV of 97.1%. The random forest model classified 77 (39%) infants as high-risk, with a PPV of 6.5%. It classified 120 (61%) children as low risk, with a NPV of 98.3% (Table 2). The overall accuracies (52.3% and 62.4%) and the AUROC (0.58 and 0.59) were low for both the decision tree and random forest model respectively.

Discussion

In this study, we aimed to derive an easy-to-apply algorithm that can reliably identify infants aged 7–60 days with a low risk of positive blood cultures (and, by extension, BSI) presenting to an Emergency Department in a tertiary paediatric hospital setting in Ireland. This specific age group was chosen as the normal ranges of biomarkers and FBC parameters – which fluctuate after birth and during the first months of life – are rather stable in this age group.

All models tested in our study predicted positive blood cultures with similar sensitivity, specificity, NPV and PPV in a subsampled dataset with a case control ratio of 1:3. However, in the absence of dedicated decision support tools, implementation of these models would not be easy to establish in clinical practice. The decision tree model is an exception to this, as it makes use of a set of concrete cut-off values and can thus be interpreted at the bedside. We therefore selected the decision tree model for testing on the full dataset and the 2023 dataset, as well as the random forest model for comparison.

Both the decision tree model and random forest model showed to be good predictors for positive blood cultures in the full dataset with a high NPV, but low PPV, though the AUROC was rather low. In the 2023 cohort, the predictive value of both models was slightly lower. However, the NPVs were still high at 97.1% and 98.3% respectively. This is important, as an ideal predictive model would have a high NPV, approaching 100% [11]. Both models show low PPVs, under 10%. The low PPV and high NPV are in line with findings of other reports of models predicting positive blood cultures in both infant and adult populations. Despite a low PPV, these models are thought to be able to make a valuable contribution to the clinical decision-making process [12,13,14].

Though our models approach an NPV of 100%, they may not be able to identify all infants with BSI; some infants may be at an early stage of their infection, and FBC and biochemical parameters may not have shifted much at that point in time. The low PPV of our models may be due to infants in our cohort having similar clinical syndromes, such as localised infections not resulting in BSI, or viraemia. Additionally, a portion of the infants may have been diagnosed with culture negative sepsis [24], or a blood culture taken later in the course of disease might have come up positive. As we did not check for culture results from other sample types or follow up cultures, we do not know how much these scenarios have influenced our models’ parameters.

Ideally, a model as the ones presented in this study would have clinical parameters – e.g. temperature and heart rate – incorporated in addition to the laboratory parameters. Unfortunately, we were at present unable to incorporate such clinical parameters. Furthermore, we were unable to compare the prediction of the models to the judgement of physicians, as we did not have information on the initiated treatment. However, models like the ones presented in this study should never be used to solely base clinical decision making on. They should instead be used by physicians as a tool to further support clinical decision making, in addition to clinical expertise. The models presented in this study will therefore mainly be of benefit to clinical decision-making in infants without a clear indication for start of antimicrobial therapy, such as infants who do not appear particularly unwell, or who do not show clear signs of an infection source. In these cases, the models presented here can further support a low risk of BSI, and an observational approach without immediate start of treatment or further invasive procedures can be considered until blood culture results are available.

Furthermore, we have shown that different pathogen groups may induce different patterns in FBC and CRP shifts. Therefore, changes in circulation of pathogens over time could influence the accuracy of a model. During and after the SARS-CoV-2 pandemic, we have seen changes in circulation of many pathogens [25, 26]. This could further explain why our models, trained on data collected before and during the pandemic, underperform in predicting bloodstream infections in a post-pandemic dataset.

In conclusion, we have shown that both a random forest model and decision tree model, trained on data from 2005–2022 in a tertiary hospital in Dublin, Ireland, can predict positive blood cultures in infants aged 7 to 60 days with a high NPV. While the random forest model performs slightly better, the decision tree model is easy to implement and could be of direct assistance in clinical practice. While the PPV of these models is low, these models can support practicing clinicians in recognizing low-risk patients, for whom a reserved attitude towards starting treatment and further diagnostic procedures may be appropriate.

A future validation study is necessary to specifically assess children who do not have clear clinical indication for starting therapy, such as infants who appear particularly unwell or those with a clear source of infection. In addition, further development of these models is desirable to increase the PPV, thereby reducing the number of false positives. Separate models that predict positive blood cultures for distinct pathogen groups may prove more accurate compared to a single model predicting overall positive blood cultures as presented here. Furthermore, it will be important to continuously train these models, to ensure that they remain applicable in changing microbiological landscapes. Lastly, as the models presented here were trained on infants aged 7 to 60 days presenting at the emergency department, the application of these models is limited to infants in this age range with suspected community acquired BSI, and may not be applicable to any other patient groups.

Data Availability

Clinical data used for this study were extracted from the electronic laboratory information system at CHI Temple Street, Dublin, Ireland. This data is not publicly available to maintain patients’ privacy in line with the European General Data Protection Regulation. Anonymized data is available on request.

References

World Health Organization (WHO) (2023) Newborn infections. Available from: https://www.who.int/teams/maternal-newborn-child-adolescent-health-and-ageing/newborn-health/newborn-infections
World Health Organization (WHO) (2023) Group B Streptococcus Vaccine: full value vaccine assessment. Available from: https://www.who.int/publications/i/item/9789240037526
Wojkowska-Mach J et al (2019) Neonate Bloodstream Infections in Organization for Economic Cooperation and Development Countries: An Update on Epidemiology and Prevention. J Clin Med 8(10):1750
Article PubMed PubMed Central Google Scholar
Hallmaier-Wacker LK et al (2022) Incidence and aetiology of infant Gram-negative bacteraemia and meningitis: systematic review and meta-analysis. Arch Dis Child 107(11):988–994
Article PubMed Google Scholar
HSE Health Protection Surveillance Centre (2019) Annual Epidemiological Report; Invasive Streptococcus Group B Infection in Ireland 2018. Dublin HSE HPSC
Ramply B, Vavasseur C, Knowles S (2022) Bacteriological Profiles in Early-Onset-Sepsit (EOS) and Late-Onset-Sepsis (LOS) in Neonates. IMG 115(8):123
Google Scholar
Kuzniewicz MW et al (2020) Time to Positivity of Neonatal Blood Cultures for Early-onset Sepsis. Pediatr Infect Dis J 39(7):634–640
Article PubMed Google Scholar
Henry E, Christensen RD (2015) Reference Intervals in Neonatal Hematology. Clin Perinatol 42(3):483–497
Article PubMed Google Scholar
Molloy EJ, Bearer CF (2022) Paediatric and neonatal sepsis and inflammation. Pediatr Res 91(2):267–269
Article CAS PubMed PubMed Central Google Scholar
Kerste M et al (2016) Application of sepsis calculator in newborns with suspected infection. J Matern Fetal Neonatal Med 29(23):3860–3865
Article PubMed Google Scholar
Celik IH et al (2022) Diagnosis of neonatal sepsis: the past, present and future. Pediatr Res 91(2):337–350
Article PubMed Google Scholar
Fleuren LM et al (2020) Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy. Intensive Care Med 46(3):383–400
Article PubMed PubMed Central Google Scholar
Mooney C et al (2021) Predicting bacteraemia in maternity patients using full blood count parameters: A supervised machine learning algorithm approach. Int J Lab Hematol 43(4):609–615
Article PubMed Google Scholar
Ramgopal S et al (2020) Machine Learning To Predict Serious Bacterial Infections in Young Febrile Infants. Pediatrics 146(3):123
Article Google Scholar
Wickham H (2016) ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag, New York
Book Google Scholar
R Core Team (2022) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/
Sjoberg DD et al (2021) Reproducible Summary Tables with the gtsummary Package. R Journal 13(1):570-80
Article Google Scholar
Venables WN, Ripley BD (2002) Modern Applied Statistics with S. Springer, New York
Book Google Scholar
Meyer D et al (2023) e1071: Misc Functions of the Department of Statistics, Probability Theory Group (Formerly: E1071), TU Wien, R package version 1.7–13
Therneau T, Atkinson B (2022) rpart: Recursive Partitioning and Regression Trees, R package version 4.1.19
Liaw A, Wiener M (2022) Classification and Regression by randomForest, R News 2(3):18–22
Stevenson M et al (2023) epiR: Tools for the Analysis of Epidemiological Data, R package version 2.0.57
Robin X et al (2011) pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 12:77
Article PubMed PubMed Central Google Scholar
Cantey JB, Prusakov P (2022) A Proposed Framework for the Clinical Management of Neonatal “Culture-Negative” Sepsis. J Pediatr 244:203–211
Article PubMed Google Scholar
Alaib H et al (2023) Frequency and Seasonal Variations of Viruses Causing Respiratory Tract Infections in Children Pre- and Post-COVID-19 Pandemic in Riyadh (2017–2022). Cureus 15(1):e33467
PubMed PubMed Central Google Scholar
Hayes LJ et al (2023) Impact of the COVID-19 pandemic on the circulation of other pathogens in England. J Med Virol 95(1):e28401
Article CAS PubMed Google Scholar

Download references

Funding

The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Author information

Authors and Affiliations

Public Health Laboratory, HSE, Cherry Orchard Hospital, Dublin, Ireland
Lieke Brouwer
European Public Health Microbiology Training Programme (EUPHEM), European Centre for Disease Prevention and Control, Stockholm, Sweden
Lieke Brouwer
Irish Meningitis and Sepsis Reference Laboratory, Children’s Health Ireland at Temple Street, Dublin, Ireland
Robert Cunney & Richard J. Drew
Department of Clinical Microbiology, Royal College of Surgeons in Ireland, Dublin, Ireland
Robert Cunney & Richard J. Drew
Clinical Innovation Unit, Rotunda Hospital, Dublin, Ireland
Richard J. Drew

Authors

Lieke Brouwer
View author publications
You can also search for this author in PubMed Google Scholar
Robert Cunney
View author publications
You can also search for this author in PubMed Google Scholar
Richard J. Drew
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

LB, RC, and RD contributed to study conception and design. LB performed data collection and data analysis and wrote a first draft of the manuscript. LB, RC, and RD reviewed the first draft of the manuscript. LB prepared a final draft of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Lieke Brouwer.

Ethics declarations

Ethical approval

This study was conducted in accordance with the Declaration of Helsinki and the protocol was approved by the CHI Research Ethics Committee (REC) (reference number: REC-194-22).

Consent to participate

As this was a retrospective study and all data was anonymized, consent of participants was not required.

Competing interest

The authors declare no competing interests.

Additional information

Communicated by Peter de Winter

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 12 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Brouwer, L., Cunney, R. & Drew, R.J. Predicting community acquired bloodstream infection in infants using full blood count parameters and C-reactive protein; a machine learning study. Eur J Pediatr 183, 2983–2993 (2024). https://doi.org/10.1007/s00431-024-05441-6

Download citation

Received: 03 November 2023
Revised: 04 January 2024
Accepted: 17 January 2024
Published: 18 April 2024
Issue Date: July 2024
DOI: https://doi.org/10.1007/s00431-024-05441-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Predicting community acquired bloodstream infection in infants using full blood count parameters and C-reactive protein; a machine learning study

Abstract

Similar content being viewed by others

Machine learning for fast identification of bacteraemia in SIRS patients treated on standard care wards: a cohort study

Bacteremia detection from complete blood count and differential leukocyte count with machine learning: complementary and competitive with C-reactive protein and procalcitonin tests

Machine learning pipeline for blood culture outcome prediction using Sysmex XN-2000 blood sample results in Western Australia

Introduction