1 Introduction

Burkitt lymphoma (BL) is an aggressive non-Hodgkin lymphoma that represents a significant health problem in African children (López et al., 2022). The risk factors of BL in sub-Saharan Africa (SSA) include Plasmodium falciparum (the parasite that causes malaria) (Derkach et al., 2019) and Epstein-Barr virus (EBV) (Bornkamm, 2009). These infections may influence BL risk by triggering the hallmark translocation of c-MYC into the vicinity of immunoglobulin genes on chromosomes 14, 2, or 22 (IGMYC)(Alaggio et al., 2022).

BL can be treated successfully in high-income countries (Ozuah et al., 2020), however, cure rates are low in SSA because of limited diagnostic capacity (Chamba et al., 2023), which leads to significant delay and often compels local clinicians to base treatment decisions on clinical diagnosis. The association of BL with P. falciparum and EBV holds promise for innovative, potentially less invasive, faster point-of-care diagnosis of BL, but few studies have evaluated malaria markers, with most focusing on EBV plasma markers (Coghill et al., 2020; Donati et al., 2006; Westmoreland et al., 2017; Xian et al., 2021). For example, in one study that evaluated serological markers for EBV infection, immunoglobulin G antibodies against four EBV peptides (BHRF1, BMRF1, BBLF1, and BZLF1) were associated with moderate discrimination of BL cases from controls, based on an area under the curve (AUC) of 0.76 (95% confidence interval: 0.6 to 0.9) (Coghill et al., 2020). Three studies of EBV DNA in plasma reported better discrimination of BL cases from controls using study-defined EBV DNA thresholds (Donati et al., 2006; Westmoreland et al., 2017; Xian et al., 2021) of which one achieved an AUC of 0.94 (95% CI: 0·85–1·00) in a study of 25 Ugandan children with BL compared to 25 controls (Xian et al., 2021).

Metabolomics offer a powerful platform for investigating biology and biomarkers of risk for multiple conditions (Andrade et al., 2020; Moaddel et al., 2022), yet, to date only limited metabolomic data are available in relation to the risk of BL in humans. In mice, measured serum metabolites discriminate mice implanted with Raji cells (a BL-derived cee line) from control wild-type mice based on differential abundance of metabolites (Yang et al., 2017). Altered metabolite profiles, notably, differences in metabolites in the glycolytic (Na et al., 2019), cholesterol, and lipid pathways (Abdrabou et al., 2021), have been used to demonstrate differences in patients with severe malaria as well as those with asymptomatic infection with low parasitemia (< 1000 parasites/µL) versus those with undetectable parasitemia (Faucher et al., 2002). These associations may have possible relevance for BL in which alterations in lipid metabolism have been reported (Ambrosio et al., 2012) and might be correlated with EBV-mediated effects (Li et al., 2004).

Here, we carried out a pilot study where we measured plasma metabolites in children from Uganda with and without BL (Peprah et al., 2020) in order to obtain their baseline metabolomic profiles, to assess associations with BL, given that both malaria and EBV, which are established environmental and endogenous risk factors of BL, have been associated with disruptions in metabolite levels. We aimed to examine the potential diagnostic utility of blood-based metabolomics for childhood BL.

2 Methods

2.1 Study population

For the present investigation, we selected specimens from 25 male cases and 25 male controls aged 6–11 years enrolled from two districts in Uganda as part of the Epidemiology of Burkitt lymphoma in East African children and minors (EMBLEM) study (Peprah et al., 2020; Legason et al., 2017). Blood samples from BL cases were collected prior to any chemotherapy (including steroids) received in the hospital. BL cases confirmed by local histology/cytology (61% of cases), otherwise by clinical features, imaging, and laboratory results, were enrolled as previously described (Peprah et al., 2020). Controls were enrolled from 100 random representative villages selected in the study area in Uganda, with frequency matching to the age and sex distribution of historical cases in the area (Maziarz et al., 2017). The controls were healthy children based on the fact they were enrolled in their homes at a time where they were living normal lives without symptoms of BL, no history of any cancer diagnosis, and had no complaints requiring medical care. So, the controls were assumed to be cancer free individuals. Venous blood specimens were collected in EDTA from children diagnosed with BL (Peprah et al., 2020) for clinical tests and research. Research blood samples were separated into plasma, buffy coat, and red blood cell fractions and stored frozen at -80o C until testing (Maziarz et al., 2017). P. falciparum infection was determined using microscopy of thick-film blood smears or rapid diagnostic tests (RDT) to detect the P. falciparum-specific malaria histidine-rich protein 2 (Pf-HRP2) and the pan-lactate dehydrogenase (pLDH) antigen shared by other Plasmodia that parasitize humans. The weighted P. falciparum prevalence (PfPR) in these districts was estimated to vary from 55 to 75% (Maziarz et al., 2017) and the average annual cumulative P. falciparum infections per child in the study regions was estimated using Malaria Atlas Project data (Weiss et al., 2019) to vary from 300 to 400 (Broen et al., 2023). The prevalence of stool parasitic infection, evaluated through the Kato method, was minimal (< 1–2%), thus it was excluded from the analytical consideration of this comorbidity.

2.2 Determination of plasma metabolite concentrations

Aliquots (50 µl) of previously unthawed plasma, from samples collected between 8:00–11:00 AM before breakfast, to minimize post-prandial distortions of circulating metabolites, were retrieved and shipped on dry ice for analysis. Metabolites were extracted from plasma (10 µl) and concentrations obtained using the MxP 500 (Biocrates Life Science AG, Austria) following the manufacturer’s protocol. Metabolites were measured using a Nexera High Performance Liquid Chromatography (HPLC) system (Shimadzu) coupled to a 6500 QTRAP® mass spectrometer (AB Sciex) with an electrospray ionization source as previously described (Moaddel et al., 2022). Samples were analyzed by flow injection analysis-tandem mass spectrometry (FIA-MS/MS) and liquid chromatography-tandem mass spectrometry (LC-MS/MS) using a 6500 + QTRAP® instrument. Briefly, two UHPLC methods were run using the MxP Quant 500 Column System for the LC-MS/MS methods with solvent A as water containing 0.2% formic acid) and solvent B as acetonitrile containing 0.2% formic acid. Analytes in the LC-MS/MS part are quantified using either external 7-point calibration curves with labeled standards or internally with labeled standards. Zero samples were used for analyte LODs for LC-MS/MS. For the FIA-MS/MS method, the FIA plate was run at a flow rate of 30 µL/min with FIA solvent as the mobile phase. Analytes in the FIA-MS/MS part are quantified using internal standards. Zero samples were used for analyte LODs for FIA Method 1 and MetIDQ was used to provide LODs for FIA method 2. Concentrations were calculated using the Analyst/MetIDQ software and reported in µmol/L. Data were quantified using appropriate mass spectrometry software (Sciex Analyst®) and imported into Biocrates MetIDQ™ software for further analysis. The data were normalized to internal quality controls. Laboratory staff were blinded to case-control status of the samples. In the quality control, the coefficients of variations (CVs) were < 10% for the majority of the measured metabolites and < 20% for the measured lipids, with few exceptions; 15 metabolites had CV > 30%, none of which were associated with BL in our study.

2.3 Statistical analysis

We analyzed metabolites with values  the limit of detection (LOD) in at least 24% (> 12) of the subjects tested, resulting in 187 metabolites being excluded (Supplementary Tables 1 and 2) and 440 metabolites being studied (Supplementary Table 3). For these 440 metabolites, we set values that were below the LOD (“as missing values”) to one half the value of LOD of that metabolite. Results were log-transformed for analysis.

Unconditional logistic regression models were fit to estimate odds ratios (ORs) and 95% confidence intervals (95% CIs) for the association between BL and one standard deviation (SD) of the log-transformed value of the plasma metabolite. Primary models were adjusted for age at enrollment in single-year increments. Age is positively correlated with cumulative exposure to P. falciparum and naturally acquired immunity. Models were further adjusted for P. falciparum infection as a factor (negative or positive). Sensitivity analyses were performed that stratified by median age at enrollment, P. falciparum infection (negative and positive) and median hemoglobin concetration to assess effect modification of the primary association. Given the small sample size, we calculated the false-discovery rate (FDR) to identify potentially important patterns (FDR ≤ 0.001). We performed forward stepwise logistic regression models to select a subset of metabolites that were independently associated with BL using a p 0·0003 (top 12 metabolites) criterion to enter into the model and a p < 0.05 to remain in the model. We assessed the performance of malaria-related variables (age, P. falciparum infection, and hemoglobin concentration) versus models also including the metabolites identified in the stepwise models to discriminate between children with BL versus those without BL using the Receiver Operating Curve (ROC), with the AUC a as summary performance measure.

Associations between BL and biochemical pathways for 21 metabolite-sub-classes that together comprise the 440 metabolites were assessed. We calculated p-values of association for each pathway using a parametric bootstrap method, such that within each bootstrap replication, p-values were generated from a vector of score test statistics from an estimated covariance matrix with a multivariate normal distribution (mean = 0), based on 100,000 permutations. We also performed a principal components analysis (PCA) using the varimax rotation approach. The pathway score by the first generated principal component for each class was used as a predictor in unconditional logistic regression to assess the associations between BL and the obtained pathway score, controlling for age at enrollment, P. falciparum infection and hemoglobin concetration . Pathway associations were subjected to Bonferroni correction based on 21 chemical class tests performed, i.e., P = 0.0024 (0·05/21). Finally, to improve the interpretation of our metabolite data, we performed exploratory analysis using calculated custom sums and ratios based on predefined values trademarked by Biocrates (MetaborIndicator™) to assess biological pathways (e.g., inflammation) and then examined the association between BL and the custom sums and ratios in the 440 metabolites. The ratio analysis was not adjusted for multiple comparisons.

All statistical analyses were performed using SAS version 9·4 (SAS Institute, Cary, NC), and R version 3·6 ·1 (R Development Core Team, Vienna, Austria).

3 Results

We studied 25 children with BL and 25 children without BL (Table 1). The mean age of the participants with recorded age (n = 46) was 8 years; the exact age was not known for four children. There were no differences between cases and controls with respect to height, weight, or body mass index (Table 1). P. falciparum infection was detected in seven (28%) cases and 11 (44%) controls (P = 0·38), but none of the participants had malaria symptoms. Mean hemoglobin concentration was lower in BL cases than cancer-free controls (10.0 versus 12·4 g/dL, P < 0·0001), consistent with the notion that BL cases are exposed to more malaria and are therefore more likely to have mild anemia (defined as < 11 g/dL) (White, 2018).

Table 1 Characteristics of Burkitt lymphoma cases and cancer-free controls in the study1

The primary analysis identified 42 plasma metabolites associated with BL, including three amino acids and 39 lipids (Table 2), based on an FDR ≤ 0·001 (Supplementary Tables 4 and 5). Most of these metabolites (n = 41) were associated with elevated ORs for BL (4·47 − 25·2) and included triglycerides (TAG), phosphatidylcholines (PCs), diglycerides, hexosylceramides, amino acids and biogenic amines (Table 2). The TAGs were generally positively associated with BL status (Table 2). Putrescine was the only a biogenic amine associated with BL, and only one metabolite, homoarginine, was associated with decreased ORs for BL (OR = 0.1, 95% CI: 0.03, 0.35) (Table 2). Five metabolites, including TAG (18:0_38:6), alpha-aminobutyric acid (AABA), ceramide (d18:1/20:0), PC ae C40:6 and PC ae C38:6, were top signals associated with BL status (ORs = 6.9 to 14.7, P < 2.4✕10− 4). These associations were not changed in sensitivity analyses adjusting for asymptomatic P. falciparum infection (Table 2). There were no material differences in the estimates of associations across subgroups based on median age at enrollment, P. falciparum infection (negative and positive) and median hemoglobin concentration, but the small sample size led to high confidence intervals in the subgroups. Two metabolites associated with elevated ORs of BL, namely, TAG (18:0_38:6) and AABA, were selected using stepwise logistic regression. Models that included these two metabolites, baseline age, P. falciparum infection and hemoglobin concentration discriminated BL cases from controls with an AUC of 0·97 (97% CI: 0·94, 1·00). This model was a significant improvement over the model that only included P. falciparum infection, baseline age, and hemoglobin level (AUC = 0·85, 95% CI: 0·74, 0·96; P for difference = 0·043, Fig. 1).

Table 2 Odds ratios and 95% CIs (per 1-SD) for the association of endemic Burkitt lymphoma and plasma metabolites reaching the FDR < 0.001 threshold based on 25 case-cancer-free control pairs
Fig. 1
figure 1

Receiver operating curve (ROC) and area under the curve (AUC) as a summary of performance of two metabolites identified in stepwise models plus malaria variables (age, P. falciparum status, and hemoglobin conetration ) or when using the malaria variables alone to discriminate Burkitt Lymphoma cases from controls

Results for red ROC curve: AUC = 0·85 (95% CI: 0·74, 0·96) for risk factors including age at enrollment, P. falciparum status and hemoglobin concetration  (low versus high). Results for the blue curve: AUC = 0·97 (95% CI: 0·94, 1·00) for the above-mentioned risk factors and two metabolites selected from the forward stepwise regression models, including triacylglyceride (18:0_38:6) and AABA (alpha-aminobutyric acid) (AUC 0·58 versus 0·97, P value for difference = 0·043)

We observed statistically significant positive associations for BL with high ratio of cystine/cysteine (OR per 1-SD = 3·18, 95% CI 1·06, 9·52, P = 0·039) and low ratio of the sum of all lysophosphatidylcholines (LPCs) to the sum of all PCs (OR per 1-SD = 0·28, 95% CI 0·10, 0·78, P = 0·015, Table 3). Pathway and PC analyses showed significant positive associations of BL with nucleobase metabolites (OR = 15.60), dihydroceramides (OR = 4.36), and ceramides (OR = 2.79), as well as an inverse association of BL with LPCs (OR = 0.25) (Table 4).

Table 3 Association between metabolite ratios and Burkitt lymphoma based on 25 case-control pairs
Table 4 Pathway analysis and principal components analysis (PCA) for the association between chemical sub-classes of plasma metabolites and Burkitt lymphoma in 25 case-control pairs a

4 Discussion

Our findings demonstrate the ability to detect and quantify many plasma metabolites in children with and without BL in Uganda. We found that two metabolites, AABA and TAG (18:0_38:6), whose levels were detected mostly in BL cases, performed reliably to discriminate BL cases from controls in our study. Here we show that concentrations of these two plasma metabolites, plus P. falciparum infection, age and hemoglobin concetration significantly improved the discrimination of BL cases from controls with an AUC of 0.97, compared to an AUC of 0.85 when using only P. falciparum infection, age, and hemoglobin concetration. The AUC of 0.97 also is also higher than that obtained using plasma anti-EBV antibodies (AUC = 0.76) (Coghill et al., 2020), but comparable to that in studies that used EBV DNA copy number (AUC = 0.94) (Mulama et al., 2014; Westmoreland et al., 2017; Xian et al., 2021). Our results suggest that blood-based metabolites could provide another laboratory approach to facilitate the diagnosis of BL, which has hitherto only been shown in mice (Yang et al., 2017).

Our analysis is preliminary with notable limitations: the cross-sectional design limits temporal inferences; the sample size is small; we studied only boys; we performed multiple comparisons. The initial study design aimed to examine associations between environmental and endogenous risk factors for BL. However, our scope encompasses the investigation of the metabolomics profiles of BL, given that both environmental and endogenous factors that have been associated with disruptions in metabolite levels. While, the lack of prospective study to validate our findings, lack of data on comorbidities, and lack of data on diet are limitations, this was meant as a pilot study to test the feasibility of measuring metabolites in our sample set and to generate new hypotheses about metabolite associations with BL. Cross-sectional metabolite profiles give a broad “snapshot” view of differences in the body’s metabolism, which our pilot study suggests warrant closer study to examine the pathophysiology and risk profile of BL. AABA has been associated with early colorectal cancer (Ni et al., 2014), for example, and is a key intermediate in the synthesis of ophthalmic acid, a tripeptide analog of glutathione (Kar et al., 2017). It is possible that endogenous increases in AABA might be a biomarker of ophthalmic acid and associated oxidative damage due to altered cellular redox balance (Soga et al., 2006). Such a mechanism is consistent with our finding for the results assessed by the cystine/cysteine ratio (Banjac et al., 2008). However, increased AABA concentrations can also result from exogenous factors, particularly a diet rich in thiamin and vitamins B2, B3, and B6, as suggested in murine dietary studies (Kar et al., 2017). Diets rich in polyphenols may affect metabolite profiles by altering gut microbiota to favor Clostridum sporogenes (Gao et al., 2020), which can metabolize tryptophan to indole 3-propionic acid (IPA) (Peron et al., 2022). In our study, IPA showed a trend towards decreased ORs for BL (Supplementary Table 4), consistent with its reported anti-inflammatory and anti-oxidant activity (Negatu et al., 2020). In fact, IPA supplementation has been linked to cytostatic and anti-neoplastic properties in breast cancer (Sari et al., 2020).

Putrescine was associated with an increased OR for BL and is consistent with the role of deregulated polyamine metabolism in the development of cancer at all stages (Palmer, & Wallace, 2010). Decreasing intracellular concentrations of polyamines can produce cell stasis and/or cell death; ornithine decarboxylase, an enzyme that converts ornithine to putrescine, could be targeted therapeutically as it is upregulated in several cancers. Moreover, α-difluoromethylornithine (DFMO) (eflornithine), an irreversible inhibitor to ornithine decarboxylase, has been shown to be effective in primary prevention of colorectal cancer (Palmer, & Wallace, 2010). However, cancer cells can compensate for a decreased in polyamine synthesis by increasing uptake of exogenous polyamines via their polyamine transport system (PTS), including sodium dependent transporters for increasing uptake of putrescine (Palmer, & Wallace, 2010).

The high ratio of the sum of LPCs/the sum of PCs is considered a surrogate of phospholipase A2 (PLA2) activity (Park et al., 2012). PLA2 activity is typically associated with intravascular inflammation in cardiovascular disease (Lavi et al., 2008), as well as with brain swelling in cerebral malaria (Pappa et al., 2015). The inverse association with BL, which is opposite that with acute malaria, suggests that the pathophysiology of malaria in BL is different from that in acute malaria syndromes (Vadas et al., 1993). BL is associated with dysregulation of MYC, which is known to transactivate many genes, including of genes involved in fatty acid biosynthesis (Morrish et al., 2010); thus, associations with lipids could reflect reverse-causality. However, given that plasma lipids are elevated in people infected with P. falciparum (Faucher et al., 2002), and with recent reports indicating that lipids reprogram the Krebs cycle to become the preferred source of carbon (versus glucose or glutamine) and increases histone H3 and H4 acetylation and gene activation (McDonnell et al., 2016), it is possible that altered lipids may influence chromatin access in B cells with IG::MYC translocations (McDonnell et al., 2016) and contribute to progression to BL (Basso & Dalla-Favera, 2015; Robbiani et al., 2015). The inverse association between homoarginine, which is a competitive substrate for nitric oxide synthase (NO), with BL status is consistent with NO modulated transcriptional activity of the c-MYC gene promoter (Park, & Wei, 2003).

5 Conclusion

In conclusion, we present preliminary results indicating that plasma metabolites are altered in children with BL compared to healthy controls, with concentrations of two metabolites TAG (18:0_38:6) and AABA as potential biomarkers of the presence of BL in children in Uganda. The findings warrant further examination, given the potential utility of differential abundance of metabolites to reduce diagnostic delays of BL in SSA.