Predictive biomarkers of COVID-19 prognosis identified in Bangladesh patients and validated in Japanese cohorts

Uno, Kazuko; Hasan, Abu; Nakayama, Emi E.; Rahim, Rummana; Harada, Hiromasa; Kaneko, Mitsunori; Hashimoto, Shoji; Tanaka, Toshio; Matsumoto, Hisatake; Fujimiya, Hitoshi; Shioda, Tatsuo; Rahman, Mizanur; Yoshizaki, Kazuyuki

doi:10.1038/s41598-024-63184-8

Predictive biomarkers of COVID-19 prognosis identified in Bangladesh patients and validated in Japanese cohorts

Article
Open access
Published: 03 June 2024

Volume 14, article number 12713, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Predictive biomarkers of COVID-19 prognosis identified in Bangladesh patients and validated in Japanese cohorts

Download PDF

Kazuko Uno¹,
Abu Hasan²,
Emi E. Nakayama³,
Rummana Rahim²,
Hiromasa Harada⁴,
Mitsunori Kaneko⁵,
Shoji Hashimoto⁶,
Toshio Tanaka⁷,
Hisatake Matsumoto⁸,
Hitoshi Fujimiya⁹,
Tatsuo Shioda³,
Mizanur Rahman² &
…
Kazuyuki Yoshizaki¹⁰

662 Accesses
83 Altmetric
Explore all metrics

Abstract

Despite high vaccination rates globally, countries are still grappling with new COVID infections, and patients diagnosed as mild dying at home during outpatient treatment. Hence, this study aim to identify, then validate, biomarkers that could predict if newly infected COVID-19 patients would subsequently require hospitalization or could recover safely with medication as outpatients. Serum cytokine/chemokine data from 129 COVID-19 patients within 7 days after the onset of symptoms in Bangladesh were used as training data. The majority of patients were infected with the Omicron variant and over 88% were vaccinated. Patients were divided into those with mild symptoms who recovered, and those who deteriorated to moderate or severe illness. Using the Lasso method, 15 predictive markers were identified and used to classify patients into these two groups. The biomarkers were then validated in a cohort of 194 Covid patients in Japan with a predictive accuracy that exceeded 80% for patients infected with Delta and Omicron variants, and 70% for Wuhan and Alpha variants. In an environment of widespread vaccination, these biomarkers could help medical practitioners determine if newly infected COVID-19 patients will improve and can be managed on an out-patient basis, or if they will deteriorate and require hospitalization.

Cytokine signature and COVID-19 prediction models in the two waves of pandemics

Article Open access 21 October 2021

Stratification of hospitalized COVID-19 patients into clinical severity progression groups by immuno-phenotyping and machine learning

Article Open access 17 February 2022

Comparing machine learning screening approaches using clinical data and cytokine profiles for COVID-19 in resource-limited and resource-abundant settings

Article Open access 28 June 2024

Introduction

The COVID-19 pandemic has infected over 700 million people and resulted in more than 7 million deaths worldwide as of September 2023. Prior to the availability of vaccines, approximately 80% of infected patients were asymptomatic or mildly ill, while 15%, mainly the elderly, developed severe pneumonia, and 5% progressed to the fatal acute respiratory distress syndrome (ARDS)¹. Severe COVID-19 has also been associated with various complications, including vasculitis, thrombosis, cerebral infarction, myocardial damage, and multiple organ failure^2,3. Risk factors for severe COVID-19 symptoms include underlying conditions such as cardiovascular disease, hypertension, diabetes, chronic lung disease, and chronic kidney disease, as well as demographic factors like age, obesity, and smoking^4,5,6,7.

In light of the above, hospitalization rates across the globe was extremely high, leading to shortages in not only hospital beds, but also medical personnel, ventillators and other essential medical equipment. In Japan for example, at the beginning of the pandemic, mortality rate was high and all COVID patients were hospitalized which caused a massive over-run on medical facilities. As a solution, the government mandated that some private hotels were to be used to house infected patients; this lasted until May 2023 well after vaccination had started.

Mortality rates gradually decreased when global vaccination began in early 2021, by end of that year vaccination rates stood at about 80% in Japan and 90% in individuals aged over 50 (https://www.mhlw.go.jp/stf/seisakunitsuite/bunya/kenkou_iryou/kenkou/kekkaku-kansenshou/yobou-sesshu/syukeihou_00002.html). However, the emergence of the Omicron and other variants presented new challenges and in August 2021 Japan had a doubling of infections with the span of a week even in rural areas. The initial Omicron variant, BA.1, showed reduced affinity for lung epithelial cells but higher transmissibility, resulting in attenuated disease severity^8,9,10. Subsequent Omicron variants such as BA.5 and XBB regained affinity for the lungs, and the effectiveness of treatments with anti-S protein monoclonal antibodies declined due to escape mutations^11,12. Antiviral drugs such as Remdesivir, Ritonavir-boosted Nirmatrelvir and Molnupiravir¹³ were available but the number of deaths in Japan in 2022 exceeded the number of cases seen in 2020–2021, thereby overwhelming hospital capacity. As a result, due to new infections skyrocketing, even the makeshift hospitals housed inside hotels, exceeded maximum capacity. Hence since early 2022 in Japan, only cases that were judged as being severe were hospitalized, leading to a well-needed steady decrease in the number patients who were admitted. By September 2022, out-pattient treatment had become commonplace.

Outpatient care however presented other major challenges including high death rates among the elderly and among patients who were considered as mild on their first hospital visit. Data from the Japanese Ministry of Health Labour and Welfare in 2022, showed that among patients who died during home care, 58% were 80 years and older, and only 20% were confirmed as being unvaccinated. Additionally, 42% of these patients who died while being treated on an out-patient basis were initially diagnosed as mild cases (https://www.mhlw.go.jp/content/10900000/001021500.pdf).

In this context where COVID patients can still experience severe disease progression despite the widespread use of vaccines, having a predictive model that can easily can distinguish patients at high risk of deterioration with just a blood sample would be highly beneficial.

Patients with severe COVID-19 have exhibited elevated levels of inflammatory cytokines and chemokines^4,5,7,14,15, especially Interleukin 1 (IL-1), Tumor Necrosis Factor alpha (TNFα), and IL-6^16,17,18,19. This has led to the hypothesis that ARDS in COVID-19 is driven by a cytokine storm^20,21. With this in mind, in this study, we quantified cytokines/chemokines/soluble receptors in serum samples from COVID-19 patients in Bangladesh who visited the hospital during the omicron stage within 7 days on the onset of symptoms. We used machine learning techniques to successfully identify a combination of 15 biomarkers that could predict if patients would subsequently deteriorate to severe COVID-19 or if they would have only mild symptoms that could be treated with out-patient care.

Results

Predicting patients’ outcome using pretreatment serum cytokine/chemokine levels within 7 days of onset of COVID-19 from Bangladesh data

The serum from patients in Bangladesh collected December 2021–December 2022 was used to train this decision model for several reasons (Table 1). First, the Japanese serum of COVID patients covers a period of approximately 2 years, during which several COVID-19 variants (Wuhan, Alpha, Delta, and Omicron) emerged, raising concern about the model’s reliability due to the diversity of the data. In contrast, the Bangladesh data was limited to a single strain, and serum sample were collected immediately after the onset of the symptoms as described in the Materials and Methods (Table 2). For these reasons, the Bangladeshi samples were more likely to satisfy the uniformity required for the learning model, and therefore the biomarkers idenitified in the Bangladeshi sample were used to predict recovery or deterioration in Japanese patients.

Table 1 Demographic of COVID-19 patients.

Full size table

Table 2 Timing of sampling of patient specimens.

Full size table

The prediction model was constructed by categorizing patients in two groups. The first group was labeled “no hospitalization required” because they were only mildly ill (N = 64 patients) throughout the course of the disease and did not develop pneumonia (although some wealthy patients were hospitalized due to anxiety). The second group (N = 65) was labeled “hospitalization required”, because although they were only mildly ill initially, they subsequently developed pneumonia. Additionally, data from 23 healthy individuals were added to the “no hospitalization required” group to ensure that the model could reliably predict that hospitalization is not necessary when the data from healthy subjects were entered. When the number of patients in each group was known, variables were selected using Logistic LASSO regression²² with the weight of the inverse of the ratio of the number of each on the objective variable side, and we identified a stable 15-biomarker model with an AUC of 0.9321 (Fig. 1A).

As the number of patients in each group is known, the variables were selected using Logistic LASSO regression²² with weights on the objective variable side that are the inverse of the respective headcount ratios, and 15 predictive markers were identified.

Table 3 presents an overview of the performance metrics for this logistic regression model. For the training data (Bangladesh), the model showed a specificity of approximately 91.95%, a sensitivity of 84.85%, and an overall accuracy of 88.89%. Both the negative predictive value (NPV) and the positive predictive value (PPV) were found to be 88.89%. When validated with the Japanese dataset, the results were similar, although the model displayed a lower specificity of about 87.32% and a lower sensitivity of 86.30%. With an accuracy of 86.81%, it was also slightly lower than the training dataset. The NPV was 86.11%, while the PPV were both at 87.50%. These metrics suggest that the logistic regression model performance was consistent across the two diverse datasets, demonstrating robust specificity, sensitivity, and accuracy in both training and validation phases.

Table 3 Performance Metrics of the Logistic Regression Model for Both Training and Validation Datasets.

Full size table

The 15 predictive biomarkers extracted from the Bangladesh data are: BAFF, CTACK, Eotaxin, HGF, IL-6, IL-13, M-CSF, MCP-3, MMP-3, Osteocalcin, SDF-1a, TNF-R1, TNF-R2, TRAIL and TWEAK (Table 4). The data for Bangladesh and Japanese patients, divided into healthy, mild, and severe cases are shown in Fig. 2. BAFF, CTACK, HGF, IL-6, M-CSF, MMP-3, TNF-R1, and TNF-R2 were higher in the severe group than in the mild group, while TRAIL, osteocalcin, and TWEAK showed a decreasing trend. The patients’ final outcomes and the percentage of those outcomes that were correctly predicted by the model are shown in Table 5.

Table 4 The best combination of predictive markers.

Full size table

Table 5 Percentage of correct predictions for each wave of COVID-19 in Japan by markers extracted from the Bangladesh model.

Full size table

Attempting to predict prognosis in Japanese patients by extracted predictive markers

The Bangladesh training data set (Table 4) were fitted to the data of Japanese patients to predict their prognosis and compare the predictions to patients’ actual clinical outcomes. As shown in Fig. 1B, the results yielded an AUC value of 0.9219 suggesting that the extracted biomarkers made accurate predictions of the prognosis of patients in Japan (Table 5). Comparing the biomarker predictions with the actual clinical outcomes of Japanese subjects, particularly in the second and fourth waves, showed that a high percentage of subjects (26.3% and 39.5%, respectively) were judged as “mild” initially, even though their actual outcome was “severe/moderate”. However, the percentage of correct predictions increased in the 5th and 6th waves. In addition, 60% of the respondents whose predictions were inaccurate were aged 70 years or older, with 40% being over 80 years of age. This suggests that elderly patients are at higher risk of deterioration and should be carefully monitored, even if they initially present with only mild symptoms^23,24.

In waves 5 and 6, after the vaccine was widely available in Japan, the accuracy rate of the predictive model exceeded 80%. In Bangladesh, approximately 88% of patients were vaccinated, and n Japan, the vaccine became available to the elderly on May 24, 2021, and to ages 12 and older in August. By August, more than 80% of the elderly had been vaccinated. Our results indicate that the predictive biomarker panel is particularly useful in determining the risk of severe disease in COVID-19 patients in environments where public vaccination rates are high.

Discussion

Since late 2021, the development and widespread use of COVID-19 vaccines and therapeutics has reduced the severity of the disease. As a result, many patients are cured with medication and home care, however, some later become seriously ill and require hospitalization. It is therefore important to determine early in the course of the disease whether patients will recover from the mild symptoms or will worsen and need to be hospitalized. We therefore measured serum cytokine/chemokine/soluble receptor levels in untreated COVID-19 patients presenting at the MILD stage within 7 days of the onset of the disease. The patients were then identified whether they improved as they were or deteriorated to moderate/severe, from which deterioration markers were calculated using the least absolute shrinkage selection operator (LASSO) regression analysis method. We believe that cytokines/chemokines/soluble receptors in the serum reflect a variety of conditions, including the patient's underlying disease, and therefore can be used to determine the worsening status even in the absence of clinical background information.

Data from Evercare Hospital Dhakab in Bangladesh, where patients had nearly identical viral strains, was used as training data for the analysis. From the data 15 biomarkers were extracted: BAFF, CTACK, Eotaxin, HGF, IL-6, IL-13, M-CSF, MCP-3, MMP-3, Osteocalcin, SDF-1a, TNF-R1, TNF-R2, TRAIL and TWEAK (Table 4)²⁵. Based on this group of biomarkers, we predicted patient prognosis by applying it to the cytokines/chemokines/soluble receptors data of Japanese patients. The Bangladesh data was collected over 9 months, whereas the Japanese data was collected over a 2-year span. The Japanese data included patients from wave 2 to wave 6 during which time the viral strains were evolving. Therefore, when calculated, the correct response rate from wave 2 to wave 4 was slightly lower, especially in wave 4, where there were more alpha strains (Table 5). This may be because the alpha strain is highly infectious and carries a high risk of mortality.

Interestingly, during waves 2–4, the model tended to underestimate the severity of disease progression in elderly patients, with 60% of the incorrectly predicted patients being over 70 years of age, and 40% being over 80 years old. These cases were initially predicted by the bio-markers to have a mild outcome but patients actually became severe; disease severity was also incorrectly predicted in older patients whose cytokine levels did not significantly increase. Based on these results, it can be said that the biomarkers selected in this study are very effective for predicting the prognosis of COVID-19 patients after widespread vaccination among the public and that caution should be exercised with elderly patients, even if their symptoms are initially classified as mild.

Data from Evercare Hospital Dhakab, Bangladesh, showed that CTACK, G-CSF, HGF, IL-2Ra, IL-6, IL-8, IL-10, IL-12p40, M-CSF, PDGF, SCF, BAFF, CD-30, CD-163, sgp130, IL-11, MMP-3, Pentraxin-3, TNF-R1, TNF-R2 and TSLP were significantly higher in the severe group than in the mild group. The inclusion of many of the markers that differ between severe and mild cases, such as CTACK, HGF, IL-6, M-CSF, BAFF, MMP-3, TNF-R1 and TNF-R2. In this study, differential markers identified for mild and severe COVID-19 cases support the validity of these markers.

As shown in Fig. 2, BAFF, CTACK, HGF, IL-6, M-CSF, MCP-3, MMP-3, TNF-R1 and TNF-R2 tended to increase when patients worsened. We theorize that there may be an underlying mechanism that connects these biomarkers with COVID severity. Among these markers some are associated with inflammation, mortality or lung damage. For example, IL-6, an important mediator of cytokine release syndrome (CRS) toxicity^{23,26,27,28,29,30,31,32,33,34}, signals inflammatory response leading to coagulation, similar to HGF³⁵. Additonally, expression levels of HGF and MCP-3 were reported to correlate positively with the Murray score used to assess the severity of lung injury in acute respiratory distress syndrome (ARDS)^14,36,37,38. Other cytokines such as MMP3 also play an important role in lung pathological processes such as ARDS, ALI, and lung fibrosis^39,40,41. Predictors of motality include soluble tumor necrosis factor receptor 1 and 2 which have been reported as predictive markers of death in patients with severe COVID-19^42,43. As well, serum TWEAK levels rose during the first week of patients being in intensive care unit (ICU), whereas a decline to baseline values were observed in the second week post-ICU admission (p = 0.032) but not for patients who died while in hospital²⁵. An analysis of receiver-operator characteristics demonstrated that serum TWEAK at the time patients were admitted to ICU is a significant predictor of in-hospital mortality (AUC = 0.689, p = 0.019) in that TWEAK showed a decreasing trend when patients worsened (Fig. 2). Finally BAFF levels hinted at significantly higher risk of both in-hospital and 30-day mortality⁴⁴. To summarize, many of the predictive bio-markers we identified were associated with symptoms typical of COVID-19 in particular lung damage which exacerbates symptoms in COVID patients compared to other viral diseases such as influenza.

While several cytokines and chemokines have been identified as markers for COVD-19 exacerbation, there are still few reports on markers for COVID-19 in patients that have been vaccinated. This study, successfully identified a panel of 15 biomarkers that could predict, with particularly high accuracy, whether vaccinated COVID-19-infected individuals would subsequently have mild symptoms or worsen to medium or severe states that required inpatient treatment. This was done by quantifying serum cytokines within seven days of COVID-19 onset.

A major strength of this study includes the use of a standardized, well-characterized dataset from Bangladesh to develop the predictive model, and the subsequent validation of the model using a diverse dataset from Japan, which spanned multiple COVID-19 variants. The consistent performance of the model across these two distinct populations suggests its robustness. As well, another strength of this predictive model is that blood cytokines and chemokines can be used to predict patient outcome without other clinical markers or details relating to patients’ history of other health issues that may affect the underlying disease status of the patient. Our previous studies have shown that serum cytokine/chemokine levels also reflect the patient's overall health, including the state of any underlying diseases^45,46 and this allows the model to have potential for broader application. As well, even though high percentages of the population is vaccinated, the future of COVID-19 is not certain, and there are still moderate to severe cases presenting at hospitals in addition to mortality among patients diagnosed as mild cases. Our predictive model could help to alleviate hospital congestion by quickly identifying patients that can recover safely as out-patients, while identifying mild cases that will subsequently worsen so that these patients can be hospitalized and receive preventative treatment. This is important particurly in Japan, since as an aging nation, it has a high percentage of patients over 70 who are vulnerable to deterioration. As well, similar to other least developed nations, public resources have been drained in Bangladesh due to the pandemic, hence being able to ascertain with high accuracy which COVID-19 cases are appropriate for out-patient treatment versus those that require hospitalization can go a long way in using the country’s limited and strained resources efficiently, while still protecting the health of citizens.

Materials and methods

Subjects

Patients with COVID-19 in Bangladesh

Patients with clinically suspected SARS-CoV-2 infection who visited Evercare Hospital Dhaka between December 25, 2021 and September 21, 2022 were considered for this study. The study was approved by the Ethical Practice Committee of Evercare Hospital Dhaka (approval number ERC 33/2022-01) and the Research Ethics Committee of the Research Institute for Microbial Diseases, Osaka University, Japan (No. 2021-3).

From the patients considered, 129 confirmed COVID-19 patients who were within 7 days of onset were included. Most of these cases in Bangladesh coincide with the sixth and part of the seventh wave in Japan. Patients were recruited at the first visit to the hospital, blood serum was collected, and the attending physician classified the outcomes as mild, moderate, or severe according to the criteria established by the World Health Organization (WHO)⁴⁷ (Table 1), at the time of discharge. Disease severity was not determined on the patients’ first visit, rather it was on their case sheets at the time they were discharged from the hospital. Laboratory confirmed Mild COVID-19 cases were those with one or more symptoms (e.g., fever, cough, runny nose, fatigue, headache, nausea, vomiting, diarrhea, chest pain, abdominal pain, and loss of taste or smell), but lacked shortness of breath, dyspnea on exertion, and abnormal radiological findings. Laboratory confirmed Moderate COVID-19 cases were those with pneumonia, with oxygen saturation > 93% and may have required low oxygen support. Severe cases developed COVID-19 pneumonia and required hospitalization; patients had dyspnea, respiratory frequency ≥ 30 breaths /min, blood oxygen saturation ≤ 93% on room air, lung infiltrates > 50%, and may have required mechanical ventilation and/or ICU support.

All patients visited the hospital with mild symptoms; 64 subsequently remained in mild condition, 46 declined to moderate, and 19 deteriorated to severe stage . The mean time of hospital visit from onset of symptoms was 2.3 ± 0.12 days for mild patients, 2.3 ± 0.14 days for moderate, and 3.0 ± 0.37(mean ± SE) days for severe patients. COVID-19 were diagnosed based on the PCR tests and the date of onset and vaccination status was recorded by the doctor with the patients’ interview. Most of the patients’ laboratory findings were reported preciously (ref.⁴⁸).

Patients with COVID-19 in Japan

This sample consisted of 197 patients with clinical suspicion of SARS-CoV-2 infection who were admitted to Habikino hospital, and Tokushukai Hospital from the end of June 2020 to the middle of June 2022^49,50. All patients provided blood samples on their first visit and written informed consent and the study was approved by the Ethics Committee of Osaka Habikino Medical Center (Approved ID: 150-7), Tokushukai Hospital (TGE01547) and Louis Pasteur Center for Medical Research (LPC.29). This study followed the principles of the Declaration of Helsinki, and was approved by the institutional review board of Osaka University Hospital (No-885). Data for healthy subjects were obtained from Louis Pasteur Center for Medical Research (LPC.8 and LPC.25).

In Japan, the disease severity of patients was determined at hospital admission according to The Guideline for Medical Treatment of COVID-19 (https://www-mhlw-go-jp/content/000785119-pdf). COVID-19 is classified as Mild, Moderate I, Moderate II, and Severe. However, for comparison with Bangladesh, Moderate II and Severe cases were combined and considered as Severe. The severity of illness here refers to the final outcome of the patient and not their condition at the time they were admitted to hospital. The breakdown of patients, the severity classification and age distribution are shown in Table 1. As indicated in Table 1, 197 untreated COVID-19 patients who visited Habikino and Tokushukai Hospitals within 7 days of onset were included. The date of onset was determined by the doctor based on the results of PCR tests and interviews with the patient. The mean number of days to onset were Mild: 3.08 ± 0.32, Moderate: 3.24 ± 0.45, and Severe: 3.71 ± 0.21 days. Ninety one healthy Japanese subjects with an average age of 63.6 ± 1.9 years were also included.

Cytokines/chemokines/soluble receptors assay

Cytokines, chemokines, and soluble receptors were quantified using Bio-Plex 200, a multiplex cytokine array system (Bio-Rad Laboratories, CA, USA) according to the manufacturer's instructions. Blood sera from healthy subjects and COVID-19 patients were collected and centrifuged at 1600 g for 10 min. Serum samples were frozen at − 80 °C until they were analyzed. We simultaneously quantified cytokines, chemokines, and soluble receptors. The Bio-Plex Human Cytokine 48-Plex Panel and Inflammation Panel (Bio-Rad Laboratories, CA, USA) was used to simultaneously quantify 78 items: CTACK, Eotaxin, FGF basic, G-CSF, GM-CSF, GRO-α, HGF, IFN-α2, IFN-γ, IL-1α, IL-1β, IL-1ra, IL-2, IL-2Rα, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-12(p40), IL-12(p70), IL-13, IL-15, IL-16, IL-17, IL-18, IP-10, LIF, MCP-1(MCAF), MCP-3, M-CSF, MIF, MIG, MIP-1α, MIP-1β, β-NGF, PDGF-BB, RANTES, SCF, SCGF-β, SDF-1α, TNF-α, TNF-β, TRAIL, VEGF) and inflammation panel (37 plex: APRIL, BAFF, CD30, CD163, Chitinase, sgp130, IFN-α 2, IFN-β, IFN-γ, sIL-6Ra, IL-10, IL-11, IL-12(p40), IL-12 (p70), IL-19, IL-20, IL-22, IL-26, IL-27, IL-28A, IL-29, IL-32, IL-34, IL-35, LIGHT, MMP-1, MMP-2, MMP-3, Osteocalcin, Osteopontin, Pentraxin-3, sTNF-R1, sTNF-R2, TSLP, TWEAK. The following were excluded from the inflammation panel for data analysis due to overlap with 48-plex: IFN-α2, IFN-γ, IL-2, IL-8, IL-10, IL-12(p40), and IL-12(p70). Patients’ samples were quantified several times over a 2-year period. Since there were lot-to-lot and measurement-to-measurement errors, these data were corrected based on the values of healthy subjects. The Bangladeshi samples were simultaneously measured with healthy Japanese subjects as a control, and the values of the healthy subjects were used as a reference to correct for lot-to-lot and inter-measurement errors; this was then used to make comparisons with the serum samples of patients in Japan. Biomarkers with large inter-kit errors and low measurement sensitivity were excluded from the analysis: IL-19, IL-20, IL-26, IL-28A, IL-29, IL-35, LIGHT.

Statistical analysis

The distribution of cytokine/chemokine/soluble receptor values in healthy controls was analyzed to determine whether the raw values or log-transformed values were more normally distributed. All parameters had log-transformed values that were more normally distributed (data not shown), and so these were used in our analysis⁴⁵. The t-test results for the data used in Bangladesh and Japan, categorized by mild, moderate, and severe disease are shown in Fig. 2. To correct for inter-measurement error, Japanese healthy controls were used as reference and adjusted accordingly. To allow for multiple comparisons, the p-values were corrected by the Holm using the p.adjust function in the stats package of the R language.

ANOVA was performed and quantitative data was presented as means ± SEM. The significance of the difference between the groups was evaluated using Dunnett’s test with a value of p < 0.05 considered significant. All statistical analyses were carried out with JMP 20.0 Statistical Software (JMP Statistical Discovery LLC, NC, USA).

The aim of the study was to use cytokines/chemokines/soluble receptors data, collected within 7 days of COVID onset, to predict whether a patient would subsequently deteriorate to a moderate disease state or worse and require hospitalization. To achieve this, we employed a binary logistic regression model. To refine the predictors, we utilized the Least Absolute Shrinkage and Selection Operator (LASSO) regression to select relevant cytokines as candidate markers²². The selection of the optimal number of variables was guided by Leave-One-Out Cross Validation (LOO CV). LOO-CV is known to be over-trained, however, in this case, the correct generalization performance could be evaluated using validation data, without the need to consider variations due to partitioning (such as k-fold).

Due to the disproportionate distribution of disease severities in the Bangladeshi COVID-19 patients, we applied weights to the Logistic LASSO regression model to minimize potential bias in the data. In line with the WHO severity classification, the groups that did not require hospitalization were defined as those with mild illness and recovered, while the groups that were hospitalization and required medical treatment were defined as moderate and severe. In addition, healthy individuals were added to the “no hospitalization group” so that the model would correctly predict that hospitalization was not necessary. Here, the number of persons in each group is clinically known. Characteristic markers for the smallest group, especially for the most severely ill patients, may be buried. Therefore, we determined the ratio of the number of people in each breakdown and weighted the objective variable by its reciprocal (the glmnet function of the glmnet package has a weight argument, which can be used to assign weights to the objective variable).

The performance of the logistic regression model was assessed using both training and validation datasets. This included determining the Area Under the Curve (AUC) from the Receiver Operating Characteristic (ROC) curve, and calculating performance metrics such as specificity and sensitivity. All analyses were conducted using the R language v4.2 (https://www.r-project.org/), with the glmnet 4.1-7 package supporting variable selection and logistic regression analysis. The pROC package version 1.18.4 was utilized to evaluate the model's performance through the ROC curve and AUC.

Data availability

The data that support the findings of this study are available from Osaka Habikino Medical Center, Tokushukai Hospital and Evercare Hospital Dhaka. However, the data is not publicly available since it was obtained under license for the current study. Upon reasonable request howoever, the data may be obtained from the corresponding author Kazuko Uno with permission from the respective institutions.

References

Sugiyama, M. et al. Serum CCL17 levels becomes a predictive marker to distinguish mild/moderate and severe/critical disease in patients with COVID-19. Gene 766, 145145. https://doi.org/10.1016/j.gene.2020.145145 (2021).
Article CAS PubMed Google Scholar
Cao, X. COVID-19: Immunopathology and its implications for therapy. Nat. Rev. Immunol. 20(5), 269–270 (2020).
Article CAS PubMed PubMed Central Google Scholar
Huang, C. et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 395(10223), 497–506 (2020).
Article CAS PubMed PubMed Central Google Scholar
Herold, T. et al. Elevated levels of IL-6 and CRP predict the need for mechanical ventilation in COVID-19. J. Allergy Clin. Immunol. 146(1), 128–136-e124 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mehta, P. et al. COVID-19: Consider cytokine storm syndromes and immunosuppression. Lancet 395(10229), 1033–1034 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ruan, Q. et al. Clinical predictors of mortality due to COVID-19 based on an analysis of data of 150 patients from Wuhan, China. Intensive Care Med. 46(5), 846–848 (2020).
Article CAS PubMed PubMed Central Google Scholar
Wu, C. et al. Risk factors associated with acute respiratory distress syndrome and death in patients with coronavirus disease 2019 pneumonia in Wuhan, China. JAMA Intern. Med. 180(7), 934–943 (2020).
Article CAS PubMed Google Scholar
Pulliam, J. R. C. et al. Increased risk of SARS-CoV-2 reinfection associated with emergence of Omicron in South Africa. Science 376(6593), eabnb4947 (2022).
Article Google Scholar
Meng, B. et al. Altered TMPRSS2 usage by SARS-CoV-2 Omicron impacts infectivity and fusogenicity. Nature 603(7902), 706–714 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Suzuki, R. et al. Attenuated fusogenicity and pathogenicity of SARS-CoV-2 Omicron variant. Nature 603(7902), 700–705 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Hoffmann, M. et al. Omicron subvariant BA.5 efficiently infects lung cells. Nat. Commun. 14(1), 3500 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Aggarwal, A. et al. SARS-CoV-2 Omicron BA.5: Evolving tropism and evasion of potent humoral responses and resistance to clinical immunotherapeutics relative to viral variants of concern. EBioMedicine 84, 104270 (2022).
Article CAS PubMed PubMed Central Google Scholar
Schilling, W. H. K. et al. Antiviral efficacy of molnupiravir versus ritonavir-boosted nirmatrelvir in patients with early symptomatic COVID-19 (PLATCOV): An open-label, phase 2, randomised, controlled, adaptive trial. Lancet Infect. Dis. https://doi.org/10.1016/S1473-3099(23)00493-0 (2023).
Article PubMed PubMed Central Google Scholar
Yang, Y. et al. Plasma IP-10 and MCP-3 levels are highly associated with disease severity and predict the progression of COVID-19. J. Allergy Clin. Immunol. 146(1), 119–127-e114 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhang, X. et al. Viral and host factors related to the clinical outcome of COVID-19. Nature 583(7816), 437–440 (2020).
Article ADS CAS PubMed Google Scholar
Chen, N. et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: A descriptive study. The Lancet 395(10223), 507–513 (2020).
Article CAS Google Scholar
Chen, X. et al. Detectable serum severe acute respiratory syndrome coronavirus 2 viral load (RNAemia) is closely correlated with drastically elevated interleukin 6 level in critically ill patients with coronavirus disease 2019. Clin. Infect. Dis. 71(8), 1937–1942 (2020).
Article ADS CAS PubMed Google Scholar
Giamarellos-Bourboulis, E. J. et al. Complex immune dysregulation in COVID-19 patients with severe respiratory failure. Cell Host Microbe 27(6), 992–1000 (2020).
Article CAS PubMed PubMed Central Google Scholar
Huang, C. et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 395, 497–506. https://doi.org/10.1016/s0140-6736(20)30183-5 (2020).
Article CAS PubMed PubMed Central Google Scholar
Hirano, T. & Murakami, M. COVID-19: A new virus, but a familiar receptor and cytokine release syndrome. Immunity 52(5), 731–733 (2020).
Article CAS PubMed PubMed Central Google Scholar
McGonagle, D. et al. The role of cytokines including interleukin-6 in COVID-19 induced pneumonia and macrophage activation syndrome-like disease macrophage activation syndrome-like disease. Autoimmun. Rev. 19(6), 102537 (2020).
Article CAS PubMed PubMed Central Google Scholar
Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 58(1), 267–288 (1996).
Article MathSciNet Google Scholar
Lee, D. W. et al. Current concepts in the diagnosis and management of cytokine release syndrome. Blood 124(2), 188–195. https://doi.org/10.1182/blood-2014-05-552729 (2014).
Article CAS PubMed PubMed Central Google Scholar
Hirahata, K. et al. Characteristics of long COVID: Cases from the first to the fifth wave in greater Tokyo, Japan. J. Clin. Med. https://doi.org/10.3390/jcm11216457 (2022).
Article PubMed PubMed Central Google Scholar
Mikacic, M. et al. Dynamic of serum TWEAK levels in critically Ill COVID-19 male patients. J. Clin. Med. 11(13), 3699 (2022).
Article CAS PubMed PubMed Central Google Scholar
Coomes, E. A. & Haghbayan, H. Interleukin-6 in Covid-19: A systematic review and meta-analysis. Rev. Med. Virol. 30(6), 1–9 (2020).
Article CAS PubMed Google Scholar
Nakayama, E. E. et al. Anti-nucleocapsid antibodies enhance the production of IL-6 induced by SARS-CoV-2 N protein. Sci. Rep. 12, 8108. https://doi.org/10.1038/s4598-022-12252-y (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Galvan-Roman, J. M. IL-6 serum levels predict severity and response to tocilizumab in COVID-19: An observational study. J. Allergy Clin. Immunol. 147(1), 72-80e78 (2021).
Article CAS PubMed Google Scholar
Chi, Y. et al. Serum cytokine and chemokine profile in relation to the severity of coronavirus disease 2019 in China. J. Infect. Dis. 222(5), 746–754. https://doi.org/10.1093/infdis/jiaa363 (2020).
Article CAS PubMed Google Scholar
Ling, L. et al. Longitudinal cytokine profile in patients with mild to critical COVID-19. Front. Immunol. https://doi.org/10.3389/fimmu.2021.763292 (2021).
Article PubMed PubMed Central Google Scholar
Liu, Y. et al. An inter-correlated cytokine network identified at the center of cytokine storm predicted COVID-19 prognosis. Cytokine https://doi.org/10.1016/j.cyto.2020.155365 (2021).
Article PubMed PubMed Central Google Scholar
Kwon, J.-S. et al. Factors of severity in patients with COVID-19: Cytokine/chemokine concentrations, viral load, and antibody responses. Am. J. Trop. Med. Hyg. 103(6), 2412–2418. https://doi.org/10.4269/ajtmh.20-1110 (2020).
Article CAS PubMed PubMed Central Google Scholar
Queiroz, M. A. F. et al. Cytokine profiles associated with acute COVID-19 and long COVID-19 syndrome. Fron. Cell. Infect. Microbiol. https://doi.org/10.3389/fcimb.2022.922422 (2022).
Article Google Scholar
Lucas, C. et al. Longitudinal analyses reveal immunological misfiring in severe COVID-19. Nature 584, 463–469. https://doi.org/10.1038/s41586-020-2588-y (2020).
Article CAS PubMed PubMed Central Google Scholar
Zaira, B. et al. Correlation between hepatocyte growth factor (HGF) with D-dimer and interleukin-6 as prognostic markers of coagulation and inflammation in long COVID-19 survivors. Curr. Issues Mol. Biol. 45(7), 5725–5740 (2023).
Article CAS PubMed PubMed Central Google Scholar
Gelzo, M. et al. Matrix metalloproteinases (MMP) 3 and 9 as biomarkers of severity in COVID-19 patients. Sci. Rep. 12(1), 1212 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, L. et al. Scoring cytokine storm by the levels of MCP-3 and IL-8 accurately distinguished COVID-19 patients with high mortality. Sign. Transduct. Target Ther. 5(1), 292 (2020).
Article MathSciNet CAS Google Scholar
Perreau, M. et al. The cytokines HGF and CXCL13 predict the severity and the mortality in COVID-19 patients. Nat. Commun. 12(1), 4888 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, H. S. & Kim, W. J. The role of matrix metalloproteinase in inflammation with a focus on infectious diseases. Int. J. Mol. Sci. 23(18), 10546 (2022).
Article CAS PubMed PubMed Central Google Scholar
Kadry, R. et al. Pharmacological inhibition of MMP3 as a potential therapeutic option for COVID-19 associated acute respiratory distress syndrome. Infect. Disord. Drug Targets 21(6), e170721187996 (2021).
Article CAS PubMed PubMed Central Google Scholar
Shi, S. et al. Matrix metalloproteinase 3 as a valuable marker for patients with COVID-19. J. Med. Virol. 93(1), 528–532 (2021).
Article CAS PubMed Google Scholar
Tegethoff, S. A. et al. TNF-related apoptosis-inducing ligand, interferon gamma-induced protein 10, and C-reactive protein in predicting the progression of SARS-CoV-2 infection: A prospective cohort study. Int. J. Infect. Dis. 122, 178–187 (2022).
Article CAS PubMed PubMed Central Google Scholar
Choreno-Parra, J. A. et al. Clinical and immunological factors that distinguish COVID-19 from pandemic influenza A(H1N1). Front. Immunol. 12, 593595 (2021).
Article CAS PubMed PubMed Central Google Scholar
Mansouri, L. et al. Role of kidney function and concentrations of BAFF, sPD-L1 and sCD25 on mortality in hospitalized patients with COVID-19. BMC Nephrol. 23(1), 299 (2022).
Article CAS PubMed PubMed Central Google Scholar
Uno, K. et al. Pretreatment prediction of individual rheumatoid arthritis patients’ response to anti-cytokine therapy using serum cytokine/chemokine/soluble receptor biomarkers. PLoS One 10(7), e0132055. https://doi.org/10.1371/journal.pone.0132055.eCollection (2015).
Article PubMed PubMed Central Google Scholar
Uno, K. et al. Cytokine/chemokine changes in plasma of patients with MPO-ANCA RPGN: Before and after IVIg therapy. ADC Lett. Infect. Dis. Control 4(2), 41–43 (2017).
Google Scholar
World Health Organization. Clinical Management of COVID-19. Geneva, Switzerland: WHO. WHO Reference Number:WHO/2019-nCoV/clinical/2020.5 (2020).
Hasan, A. et al. Enhancement of IL-6 production induced by SARS-CoV-2 nucleocapsid protein and Bangladeshi COVID-19 patients’ sera. Viruses 15, 2018. https://doi.org/10.3390/v15102018 (2023).
Article CAS PubMed PubMed Central Google Scholar
Kayano, T. et al. Number of averted COVID-19 cases and deaths attributable to reduced risk in vaccinated individuals in Japan. Lancet Reg. Health West. Pac. 28, 100571 (2022).
PubMed PubMed Central Google Scholar
Baba, H. et al. Statistical analysis of mortality rates of coronavirus disease 2019 (COVID-19) patients in Japan across the 4C mortality score risk groups, age groups, and epidemiological waves: A report from the nationwide COVID-19 cohort. Open Forum Infect. Dis. 10, ofac638 (2023).
Article PubMed Google Scholar

Download references

Acknowledgements

This work was supported by JSPS KAKENHI Grant (Grant-in-Aid for Scientific Research (C)) Number JP.20K10460. We thank Center for Infectious Disease Education and Research, CiDER for their support (JM00000160). A.H. is a recipient of a scholarship from the RONPAKU Program of Japan Society for the Promotion of Science (ID no. R12209). We are grateful to Ms. L. Russell for the substantive editing of this manuscript.

Author information

Authors and Affiliations

IFN and Host-Defense Research Laboratory, Louis Pasteur Center for Medical Research, Kyoto, Kyoto, 606-8225, Japan
Kazuko Uno
Evercare Hospital Dhaka, Plot-81, Block-E, Bashundhara R/A, Dhaka, 1229, Bangladesh
Abu Hasan, Rummana Rahim & Mizanur Rahman
Research Institute for Microbial Diseases, Osaka University, Suita, Osaka, 565-0781, Japan
Emi E. Nakayama & Tatsuo Shioda
Yao Tokushukai General Hospital, Yao, Osaka, 581‑001, Japan
Hiromasa Harada
Suita Tokushukai Hospital, Suita, Osaka, 565‑0814, Japan
Mitsunori Kaneko
Osaka Prefectural Hospital Organization Osaka Habikino Medical Center, Habikino, Osaka, 583‑8588, Japan
Shoji Hashimoto
Kinki Central Hospital, Itami, Hyogo, 664-8533, Japan
Toshio Tanaka
Trauma and Acute Critical Care Center, Osaka University, Suita, Osaka, 565‑0871, Japan
Hisatake Matsumoto
Dynacom Co., Ltd., Chiba, Chiba, 261-7125, Japan
Hitoshi Fujimiya
Department of Organic Fine Chemicals, Institute of Scientific and Industry Research, Osaka University, Suita, Osaka, Japan
Kazuyuki Yoshizaki

Authors

Kazuko Uno
View author publications
You can also search for this author in PubMed Google Scholar
Abu Hasan
View author publications
You can also search for this author in PubMed Google Scholar
Emi E. Nakayama
View author publications
You can also search for this author in PubMed Google Scholar
Rummana Rahim
View author publications
You can also search for this author in PubMed Google Scholar
Hiromasa Harada
View author publications
You can also search for this author in PubMed Google Scholar
Mitsunori Kaneko
View author publications
You can also search for this author in PubMed Google Scholar
Shoji Hashimoto
View author publications
You can also search for this author in PubMed Google Scholar
Toshio Tanaka
View author publications
You can also search for this author in PubMed Google Scholar
Hisatake Matsumoto
View author publications
You can also search for this author in PubMed Google Scholar
Hitoshi Fujimiya
View author publications
You can also search for this author in PubMed Google Scholar
Tatsuo Shioda
View author publications
You can also search for this author in PubMed Google Scholar
Mizanur Rahman
View author publications
You can also search for this author in PubMed Google Scholar
Kazuyuki Yoshizaki
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: K.U., E.E.N and K.Y. Investigation: K.U., E.E.N., A.H. Data analysis; K.U. and H.F. Resources: R.U., R.R., H.H., M.K., S.H., T.T, H.M., Writing-Original Draft: K.U. and E.E.N.; Writing-Review and Editing: K.Y, Supervision: K.Y, Project Administration: K.Y. and Tat S; Funding Acquisition: Tat.S, K.U. and K.Y.

Corresponding author

Correspondence to Kazuko Uno.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Uno, K., Hasan, A., Nakayama, E.E. et al. Predictive biomarkers of COVID-19 prognosis identified in Bangladesh patients and validated in Japanese cohorts. Sci Rep 14, 12713 (2024). https://doi.org/10.1038/s41598-024-63184-8

Download citation

Received: 01 November 2023
Accepted: 27 May 2024
Published: 03 June 2024
DOI: https://doi.org/10.1038/s41598-024-63184-8
Springer Nature Limited

Predictive biomarkers of COVID-19 prognosis identified in Bangladesh patients and validated in Japanese cohorts

Abstract

Similar content being viewed by others

Cytokine signature and COVID-19 prediction models in the two waves of pandemics

Stratification of hospitalized COVID-19 patients into clinical severity progression groups by immuno-phenotyping and machine learning

Comparing machine learning screening approaches using clinical data and cytokine profiles for COVID-19 in resource-limited and resource-abundant settings

Introduction

Results

Predicting patients’ outcome using pretreatment serum cytokine/chemokine levels within 7 days of onset of COVID-19 from Bangladesh data

Attempting to predict prognosis in Japanese patients by extracted predictive markers

Discussion

Materials and methods

Subjects

Patients with COVID-19 in Bangladesh

Patients with COVID-19 in Japan

Cytokines/chemokines/soluble receptors assay

Statistical analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Predictive biomarkers of COVID-19 prognosis identified in Bangladesh patients and validated in Japanese cohorts

Abstract

Similar content being viewed by others

Cytokine signature and COVID-19 prediction models in the two waves of pandemics

Stratification of hospitalized COVID-19 patients into clinical severity progression groups by immuno-phenotyping and machine learning

Comparing machine learning screening approaches using clinical data and cytokine profiles for COVID-19 in resource-limited and resource-abundant settings

Introduction

Results

Predicting patients’ outcome using pretreatment serum cytokine/chemokine levels within 7 days of onset of COVID-19 from Bangladesh data

Attempting to predict prognosis in Japanese patients by extracted predictive markers

Discussion

Materials and methods

Subjects

Patients with COVID-19 in Bangladesh

Patients with COVID-19 in Japan

Cytokines/chemokines/soluble receptors assay

Statistical analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation