Pancreatic carcinoma, pancreatitis, and healthy controls: metabolite models in a three-class diagnostic dilemma

Leichtle, Alexander Benedikt; Ceglarek, Uta; Weinert, Peter; Nakas, Christos T.; Nuoffer, Jean-Marc; Kase, Julia; Conrad, Tim; Witzigmann, Helmut; Thiery, Joachim; Fiedler, Georg Martin

doi:10.1007/s11306-012-0476-7

Pancreatic carcinoma, pancreatitis, and healthy controls: metabolite models in a three-class diagnostic dilemma

Original Article
Open access
Published: 06 November 2012

Volume 9, pages 677–687, (2013)
Cite this article

Download PDF

You have full access to this open access article

Metabolomics Aims and scope Submit manuscript

Pancreatic carcinoma, pancreatitis, and healthy controls: metabolite models in a three-class diagnostic dilemma

Download PDF

Alexander Benedikt Leichtle¹,
Uta Ceglarek²,
Peter Weinert³,
Christos T. Nakas⁴,
Jean-Marc Nuoffer⁸,
Julia Kase⁵,
Tim Conrad⁶,
Helmut Witzigmann⁷,
Joachim Thiery² &
…
Georg Martin Fiedler^9,2

2921 Accesses
34 Citations
1 Altmetric
Explore all metrics

Abstract

Metabolomics as one of the most rapidly growing technologies in the “-omics” field denotes the comprehensive analysis of low molecular-weight compounds and their pathways. Cancer-specific alterations of the metabolome can be detected by high-throughput mass-spectrometric metabolite profiling and serve as a considerable source of new markers for the early differentiation of malignant diseases as well as their distinction from benign states. However, a comprehensive framework for the statistical evaluation of marker panels in a multi-class setting has not yet been established. We collected serum samples of 40 pancreatic carcinoma patients, 40 controls, and 23 pancreatitis patients according to standard protocols and generated amino acid profiles by routine mass-spectrometry. In an intrinsic three-class bioinformatic approach we compared these profiles, evaluated their selectivity and computed multi-marker panels combined with the conventional tumor marker CA 19-9. Additionally, we tested for non-inferiority and superiority to determine the diagnostic surplus value of our multi-metabolite marker panels. Compared to CA 19-9 alone, the combined amino acid-based metabolite panel had a superior selectivity for the discrimination of healthy controls, pancreatitis, and pancreatic carcinoma patients \( [ {\text{volume under ROC surface}}\;\left( {\text{VUS}} \right) = 0. 8 9 1 { }\left( { 9 5\,\% {\text{ CI }}0. 7 9 4- 0. 9 6 8} \right)]. \) We combined highly standardized samples, a three-class study design, a high-throughput mass-spectrometric technique, and a comprehensive bioinformatic framework to identify metabolite panels selective for all three groups in a single approach. Our results suggest that metabolomic profiling necessitates appropriate evaluation strategies and—despite all its current limitations—can deliver marker panels with high selectivity even in multi-class settings.

A systematic review on metabolomics-based diagnostic biomarker discovery and validation in pancreatic cancer

Article 10 August 2018

A new panel of pancreatic cancer biomarkers discovered using a mass spectrometry-based pipeline

Article Open access 09 November 2017

Discrimination of pancreatic cancer and pancreatitis by LC-MS metabolomics

Article Open access 01 April 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Pancreatic cancer is the fourth leading cause of cancer death in the United States, and most patients diagnosed with pancreatic cancer develop clinical symptoms usually late in the course of the disease (Lowenfels and Maisonneuve 2006). Therefore, only 20 % of patients can be treated with a potentially curative therapy and only about 3–5 % survive at least 5 years (Michl et al. 2006). For these patients, time, especially the so called ‘biomarker lead time’ between the onset of asymptomatic cancer still localized to the organ of origin and clinical diagnosis (Konforte and Diamandis 2012), is crucially important (Hazelton and Luebeck 2011). Although recent modeling studies have illustrated that blood-based biomarkers might provide a successful tool for the early detection and differentiation of premalignant lesions, substantial methodological enhancements of unanticipated extent (Burgess 2012) are still required. Yachida et al. (2010) demonstrated a latency of about 17 years from the initiating mutation to pancreatic cancer death. Similarly, Hori and Gambhir (2011) stated “that shedding rates of current clinical blood biomarkers are likely 10⁴-fold too low to enable detection of a developing tumor within the first decade of tumor growth” and suggested to increase sensitivity and specificity by introducing multi-marker panels of up to 10 biomarkers. In a proof-of-principle study for evaluating the utility of multiplexed circulating biomarkers, Brand et al. (2011) investigated the selectivity of 83 proteins and their combinations. Two panels consisting of CA 19-9, ICAM-1, and OPG, as well as CA 19-9, CEA, and TIMP-1 were found to discriminate pancreatic cancer patients from healthy control subjects and from patients with benign pancreatic conditions, respectively. Since the cohorts in this study were compared separately, an integral model encompassing all three disease states was not obtained. Whereas Brand et al. (2011) focused on known tumor markers, tumor-associated peptides, etc., other studies have employed several of the emerging “-omics” subspecialties, such as proteomics (Fiedler et al. 2009), transcriptomics (Zhang et al. 2010), and—as the probably closest to the “bedside” (Van and Veenstra 2009)—metabolomics (Bathe et al. 2011; Ceglarek et al. 2009; Nishiumi et al. 2010; OuYang et al. 2011; Urayama et al. 2010; Zhang et al. 2011). The latter bears the chance to learn from the intricacies that have plagued “-omics” researchers over the last years, standardization (Van and Veenstra 2009), data processing (Blekherman et al. 2011) and data interpretation (Kholodenko et al. 2012), amongst others.

In this study, we addressed these challenges by using a three-class study design. We collected highly standardized samples of pancreatic cancer patients, subjects with pancreatitis, and healthy controls. Following tandem mass spectrometric metabolite profiling, we evaluated the differences between groups and applied Bayesian methodology to identify multi-metabolite models as “meta-markers”, which are selective for each of the three study groups and provide improved diagnostic performance compared to CA 19-9, the conventional tumor marker.

2 Materials and methods

2.1 Patients and samples

We recruited patients suffering from pancreatic cancer (\( n = 40 \)), healthy controls (\( n = 40 \)), and patients hospitalized due to acute pancreatitis (\( n = 2 6 \)) at the University Hospital of Leipzig in the context of previously published studies (Fiedler et al. 2009; Leichtle et al. 2012). We collected cubital vein fasting samples of cancer patients and healthy controls in two independent sets. Additionally, we collected fasting serum samples of 26 patients with pancreatitis as inflammatory control group (A, C, and D, \( n_{\text{total}} = 10 6 \); Table 1). We adjusted subjects according to age and gender and performed blood sampling from patients before the initiation of specific therapy. Diagnosis of pancreatic cancer was confirmed by histologic examination in all cases. Healthy controls showed no evidence of actual disease in physical examination and routine laboratory testing [alkaline phosphatase, bilirubin, C-reactive protein, CA 19-9, CEA, creatinine, γ-glutamyltransferase, transaminases (Roche Modular, Germany)]. Pancreatitis patients were diagnosed clinically without proof of pancreatic carcinoma during the study period. Serum samples were collected and stored (at −80 °C) using standardized techniques and protocols (Baumann et al. 2005).

Table 1 Baseline data for the two study sets (A and C) as well as the pancreatitis cohort (D)

Full size table

2.2 Chemicals, standards and consumables

Methanol and isopropanol (gradient grade) were purchased from Merck (Darmstadt, Germany). The amino acid isotope labelled standard kit (NSK-A, Cambridge Isotope Laboratories, Andover, USA) was used as internal standard. Water (HPLC grade) was obtained from J. T. Baker (Deventer, Netherlands). The derivatization reagent 3n butanolic HCl was made in-house by mixing 4:1 v/v of 1-butanol (for spectroscopy) from Merck (Darmstadt, Germany) and acetyl chloride (p.a.) from Sigma-Aldrich (Steinheim, Germany). 96-well polypropylene microtiter plates were purchased from Greiner Bio-One (Frickenhausen, Germany). Sampling material was obtained from Sarstedt (Nümbrecht, Germany). For sample storage 450 μL CryoTubes™ were purchased from Sarstedt.

2.3 Sample pretreatment

Sample derivatization was performed according to our previously described protocols (Brauer et al. 2011). Shortly, serum samples were diluted 1:10 with methanol for protein precipitation. After centrifugation we placed 10 μL of the supernatant into 96-well polypropylene microtiter plates and diluted it with 100 μL of the internal standard solution. Following evaporation at 70 °C for 40 min, we added 60 μL of 3n butanolic-HCl for derivatization at 65 °C for 18 min. The residual solution was again evaporated at 70 °C for 40 min and then reconstituted with 150 μL of the mobile phase (1/1 v/v isopropanol/water). After 15 min of gentle shaking of the microtiter plate at room temperature, we analyzed the samples by flow injection analysis (FIA)-tandem mass spectrometry (MS/MS).

2.4 Tandem mass spectrometry

An API 3000 MS/MS (Applied Biosystems, Germany) equipped with a Turbo Ion Spray Source (TIS) in combination with an HTC Pal autosampler and a PE 200 microgradient pump was used for flow injection analysis (FIA). 25 μL of the sample were directly injected at a flow rate of 80 μL/min in an analysis time of 1.5 min. We detected amino acids by a neutral loss scan of 102 in the mass range of 130–280 or multiple reaction monitoring (MRM). Quantitative analysis using internal standards for 26 amino acids was performed using ChemoView™ 1.4.2 (Applied Biosystems, Germany). A comprehensive overview of mass transitions, internal standards, and performance data for the different amino acids is summarized in Brauer et al. (2011).

2.5 Bioinformatic analysis

Statistical analyses were conducted (unless otherwise stated) using R for Windows (Version 2.14.2) and its related CRAN packages (http://cran.r-project.org/). We tested data for normality by the Anderson–Darling test (nortest) and the gender distribution in the subgroups by the binomial test (stats). The homogeneity of variances of the quantitative routine laboratory data was evaluated with the approximative Fligner–Killeen test (stats), whereas the paired differences were investigated by Games–Howell testing (script source: http://aoki2.si.gunma-u.ac.jp/R/src/tukey.R). Three missing CA 19-9 values were imputed by multiple imputation (MI) with 3 chains of imputation (which were averaged thereafter), a \(\hat{\text{R}}\) value of 1.1, and bootstrap as random imputation method until conversion (after 7111 iterations). Three pancreatitis samples with non-random missing data as a consequence of insufficient sample volume were excluded from further analysis. The selectivity of single amino acid concentrations was assessed in all disease states simultaneously via the volume under ROC surface (VUS), which is the three-dimensional analogue of AUROC analysis (Nakas and Yiannoutsos 2004). The VUS’ and their associated confidence intervals were calculated nonparametrically using \( B = 2 ,000 \) bootstraps and 50 k subdivisions on the amino acid concentrations (DiagTest3Grp). As we assumed a high degree of collinearity in the amino acid concentrations, we computed Kendall’s τ as well as its Hochberg-adjusted significance (ltm, corrplot) and plotted the correlation matrix (Fig. 1). Based on our previous results indicating that marker models comprising combinations of different amino acids and/or CA 19-9 might be superior to single amino acids and/or CA 19-9 with respect to their selectivity (Leichtle et al. 2012), we also evaluated combined models. These models consisted on the one hand of the conventional tumor marker CA 19-9 combined with different principal components (PC1, PC2, …) of the different amino acid concentrations to control for multicollinearity, which is a significant constraint on variable selection (Leigh 1988). On the other hand we also combined CA 19-9 with mere amino acid concentrations to avoid potentially unnecessary transformation steps prior to modeling. After Yeo–Johnson transformation (car) of the amino acid concentrations, we generated PCs (princomp and factoMineR), from which the first six PCs had eigenvalues >1.0 and cumulatively covered 78.7 % of the variance. For the modeling in a three-state design, we merged sample set A with C and obtained one tumor group, one control group and one pancreatitis group (set D). In the second step we used CA 19-9 alone and combined with the PCs of the amino acid concentrations as well as with mere concentrations for Bayes-averaged multinomial logit modeling [mlogitBMA, bic.mlogit(mlogitBMA)] using Begg and Gray approximation. We validated the latter model by CAR [‘Correlation-Adjusted (marginal) coRrelation’] scores (care) assuming empirical values of 1.0, 0.3, and 0.1 as responses of the pancreatic carcinoma, the healthy control, and the pancreatitis groups, respectively, in a CAR model truncated to a number of variables comparable to the penalized multinomial logit model. We computed the VUS for the four predictors, namely CA 19-9, the PCA-based mlogitBMA-predictor (PCA), the amino acid concentration-based mlogitBMA-predictor (AA), and the amino acid concentration-based CAR-model-predictor (CAR) analogously to the VUS values of the amino acid concentrations. The ROC surface plots (Fig. 2) were drawn using MATLAB (The MathWorks, Natick, MA, USA). Since significant differences of the VUS between predictors and CA 19-9 do not imply inferiority or superiority, we performed non-inferiority and superiority testing applying bootstrap techniques on Δ_VUS (\( B_{\text{outer}} = 1 ,000 \), boot, \( B_{\text{inner}} = 1 ,000 \), DiagTest3Grp, UBELIX Cluster of the University of Bern). We constructed the corresponding CIs and tested for the overlap of Δ₀ and the Δ_VUS’ CI according to the methods proposed by Liu et al. (2006), Tunes da Silva et al. (2009), and Lesaffre (2008) at a predefined δ level of 5 % which was considered to be medically reasonable designing this and a previous study (Leichtle et al. 2012). We visualized the performance data in a forest plot [Fig. 3, forestplot(rmeta), R version 2.15.0, cf. Mascha (2010)].

2.6 Ethics

The study was approved by the Ethics Committee of the Medical Faculty of the University of Leipzig (Reg. No. 013-2005) and it fulfills the requirements of the Helsinki declaration. All study subjects gave written informed consent to participate in the study.

3 Results and discussion

3.1 Descriptives

We collected serum samples of 40 (20 males/20 females) pancreatic carcinoma patients, 26 (22 males/4 females, \( P_{\text{binomial}} = 0.000 5 \)) pancreatitis patients, and 40 (20 males/20 females) healthy controls. Table 1 displays the distributions of age, BMI, UICC cancer staging of the patients, the CA 19-9, bilirubin, and HbA1c concentrations in the three different sample sets. Of 26 amino acid concentrations, we found only four (arginine, glutamic acid, phenylalanine, and tryptophan) unaltered between the study groups (Table 2). Several amino acid concentrations were non-normally distributed. In order to detect sample set-specific alterations in the values, we also compared the amino acid concentrations of the tumor patients and the healthy control group between the two sample sets and found no significant differences (Table 2).

Table 2 Inter-group significance (P values) of the differences in CA 19-9 and amino acid concentrations as evaluated by Games–Howell testing and the homogeneity of variances as determined by Fligner–Killeen testing (column P)

Full size table

3.2 Correlations

We evaluated the multicollinearity of the amino acid concentrations by generating their correlation matrix (Kendall’s τ and its Hochberg-adjusted significance). Kendall’s τ values ranged between −0.516 (aspartic acid with glutamine) and 0.709 (threonine with glutamine), the P values between 0.001 and 0.97 (Fig. 1).

3.3 Modeling

We hypothesized that combinatory markers including several amino acids and/or CA19-9 might have additive or even multiplicative effects and thus be superior to single amino acids and/or CA 19-9 in diagnostics of pancreatic cancer (Brand et al. 2011). Therefore, in addition to evaluating the single VUS of the amino acid concentrations, we also generated models based on CA 19-9 combined with PCs and Bayesian multinomial logit model averaging (mlogitBMA) as well as on CA 19-9 conjoined with amino acid concentrations. Furthermore, we used models based on CAR scores to evaluate their three-class selectivity in comparison with that of single amino acid concentrations and CA 19-9 as a validation method for the mlogitBMA models. The best PC-based mlogitBMA-model comprised CA 19-9 and PC2. The best amino acid concentration-based mlogitBMA-model contained CA 19-9 and aspartic acid, which both were also contained in the truncated amino acid concentration-based CAR score-model. To evaluate the selectivity of both, amino acid concentrations and modeled predictors, we computed their volume under the ROC surface (VUS) (Table 3). The VUS of the amino acid concentrations spanned from 0.180 (arginine, 95 % CI 0.106–0.270) to 0.850 (glutamine, 95 % CI 0.761–0.929). The VUS of CA19-9 was 0.528 (95 % CI 0.400–0.654), and the predictors PCA, AA, and CAR had VUSs of 0.604 (95 % CI 0.446–0.745), 0.891 (95 % CI 0.794–0.968), and 0.871 (95 % CI 0.776–0.952), respectively. For a random classifier, the VUS could be geometrically determined (Landgrebe and Duin 2007) with a value of \( 1.\bar{6}. \) To illustrate the selectivity of CA 19-9 and the predictors, we generated true class rate-plots depicting the ROC surface (Fig. 2).

Table 3 Volume under receiver operator characteristics curve (VUS) and 95 % confidence intervals of the amino acid and CA 19-9 concentrations as well as of a random classifier [cf. Landgrebe and Duin (2007)] with respect to the discriminatory power between pancreatic cancer patients, healthy controls, and pancreatitis patients

Full size table

3.4 Non-inferiority and superiority testing

Since it is generally accepted that significant difference alone might not be an adequate measure of non-inferiority or superiority, we sequentially tested for both with an a priori-defined acceptance criterion (equivalence limit) of \( \delta = 5\,\% \) Δ_VUS. We computed the lower and upper limits of the (100 − 2δ) % bootstrap confidence interval of the estimated Δ_VUS as proposed by Liu et al. (2006). The results are shown in Fig. 3. Following the criteria by Mascha (2010) we deduced non-inferiority for predictors AA and CAR compared to CA 19-9. Furthermore, superiority of the AA and CAR predictors over CA 19-9 alone was derived in a second step, since the lower CI of Δ_VUS was positive.

“The ideal biological marker(s) for cancer risk assessment and early detection must have high sensitivity and specificity, be found in a biosample obtained using minimally invasive procedures, and be analyzed using a highthroughput, cost-effective assay.” These requirements stated by Van and Veenstra (2009) are challenging to fulfill. Particularly, in the case of pancreatic cancer diagnostics this challenge is even bigger due to the number of differential diagnoses, which are difficult to discern from malignant disease even for experienced clinicians (Gong et al. 2012). Furthermore, chronic pancreatitis patients also have a 15-fold higher risk than the general population to develop pancreatic cancer (Huggett and Pereira 2011). In order to identify biomarkers capable of discriminating different disease states, we designed a study including pancreatic cancer patients and healthy controls of two independently collected sample sets as well as an additional group of pancreatitis patients, since the principal feasibility of the metabolomic approach to pancreatic cancer was recently shown (Bathe et al. 2011; Tesiram et al. 2012; Zhang et al. 2012). Samples were processed following highly standardized preanalytical protocols and applying a routinely used tandem mass spectrometric technique. By comparing the sample groups, we found 22 of 26 amino acids altered in at least one out of ten possible comparisons. The number of different metabolites is comparable to that given by Bathe et al. (2011) who found 22 of 58 metabolites significantly different between malignant and benign pancreatic disease applying ¹H NMR and 2D NMR spectroscopy, with OuYang et al.’s (2011) ¹H NMR spectroscopy results showing significant alterations of at least 8 metabolites between only 17 pancreatic carcinoma patients and 25 healthy controls. It is consistent with Urajama et al.’s (2010) combined GC/TOF-MS, LC/ESI-MS, and LC/LTQ-Orbitrap study revealing 26 significantly different metabolites in a comparison of 5 pancreatic cancer samples and 5 mixed pancreatitis/healthy controls, and with Nishiumi et al.’s (2010) GC–MS investigations based on 21 pancreatic cancer patients and 9 healthy volunteers identifying 18 of 60 metabolites as significantly different. In addition to the inter-class comparisons of the different sample sets, we also evaluated the inter-sample set differences in the respective classes (cancer_A − cancer_C and control_A − control_C) and found no significant differences. Since the sample groups were homogeneous, we preferred a joint analysis in the modeling approach over a split-half design to keep the degree of random error as low as possible (Knottnerus and Muris 2003; Ransohoff and Gourlay 2010). Although the previously published metabolome profiling studies of pancreatic carcinoma are heterogeneous regarding the used mass-spectrometric techniques and the studied metabolites, they all share canonical variance-based evaluation strategies with two-class comparisons. Additionally, only one of the studies (Bathe et al. 2011) assessed the selectivity (e.g. AUROC or VUS analyses) of the marker metabolites. Our aim was to perform a comprehensive data analysis that also allows a clear interpretation of the diagnostic value of the markers (Leichtle et al. 2012). To this end, we implemented four unexampled features in our bioinformatic pipeline: (1) The computation of three-class VUS values of the single amino acid concentrations as an integral selectivity measure, (2) a Bayesian multinomial logit model averaging procedure to extend the previously used binomial logistic regression modeling (Leichtle et al. 2012) on the three-class study design to generate multi-marker panels (including CA 19-9), (3) the VUS-based analysis of the panel predictors, and, finally, (4) their non-inferiority and superiority determination. The VUS values of the single amino acid concentrations ranged from 0.18 slightly above a random classifier to 0.85 (glutamine), which is close to the best panel predictors. As none of the previous metabolite profiling studies on pancreatic cancer performed VUS analysis, we can only rely on the utterly inconsistent P values they present, in Urayama et al.’s (2010) case 0.000021, or in Nishiumi et al.’s (2010) 0.97 for glutamine. CA 19-9 alone reached the 10th rank, which is probably attributable to its low selectivity between benign and malign pancreatic diseases (Fig. 2a). Since our previous investigations (Leichtle et al. 2012) indicated a high degree of multicollinearity in the amino acid concentrations, which is known to impede many feature selection techniques (Jesneck et al. 2009; Leigh 1988), we set up a Kendall’s correlation matrix to visualize the multicollinearity and its significance. As expected, the full range of correlation spanned from −0.516 to 0.709, which supported the inclusion of the frequently recommended PC-based analysis approach, albeit it has been shown that variance-based techniques might not always yield optimal predictors (Leichtle et al. 2012). To compute robust predictive multi-metabolite marker panels, we combined CA 19-9 and the PCs as well as CA 19-9 and the mere amino acid concentrations and used a Bayesian multinomial logit model averaging procedure for our categorical three-class study design (Robin et al. 2009). The first model consisted of CA 19-9 and PC2 providing a two-marker “panel” predictor (PCA) with a VUS of 0.604. The omission of PC1 and preference of PC2 with less contribution to explained variance during the mlogitBMA procedure is an astounding finding possibly reflecting a predilection of variables sharing covariance with CA 19-9. The second model based on amino acid concentrations included CA 19-9 and aspartic acid providing a two-marker “panel” predictor (AA) with a VUS of 0.891. Our results indicate that CA 19-9 provides the selectivity mainly for the discrimination between healthy controls and pancreatic cancer patients (Table 2), whilst aspartic acid predominantly contributed to the identification of pancreatitis patients. Nishiumi et al. (2010) reported a borderline significant P value of 0.075 for aspartic acid, whereas Urayama et al. (2010), OuYang et al. (2011) and Bathe et al. (2011) did not mention significant differences. Our results and panel predictors, however, require extremely cautious interpretation since in a previous study an analytical variability >25 % was observed for aspartic acid (Brauer et al. 2011). On the other hand, regarding the substantial impact of especially pancreatic disease on nutrition, it was not unexpected to find models different to those of our previous study on colorectal cancer (Leichtle et al. 2012). The mechanisms disturbing amino acid homeostasis and enabling the discrimination of pancreatic cancer patients from pancreatitis patients on the basis of metabolite profiles are not entirely elucidated. Schrader et al. (2009) suggested—apart from malnutrition—mainly inflammatory effects and pointed at the inverse relationship between the circulating amino acid concentrations and the degree of inflammation present e.g. in hemodialysis patients. Whether increased tumor-associated proteolytic activity (Findeisen and Neumaier 2012) contributes not only to the generation of specific peptide decay profiles, but also to the specificity of the amino acid profiles is still unknown. To validate our results and the Bayesian modeling approach, we also applied model selection techniques based on CAR scores (Zuber and Strimmer 2011) as a non-Bayesian linear alternative. Since our study covered three—more or less—independent classes, we could neither rely on a binary (CAT score) nor on a metric (CAR score) response. Therefore we assumed empirical values of 1.0, 0.3, and 0.1 as “responses” of the respective groups while acknowledging that such a procedure might be somewhat artificial and not necessarily justified by the underlying pathophysiology. To gain a comparable number of model variables as in the penalized mlogitBMA-model and thereby an at least limited comparability, we used a two-predictor CAR model including CA 19-9 and aspartic acid. The CAR panel predictor had a VUS of 0.871 similar to the value obtained with the Bayesian modeling approach. As the final evaluation step, we performed a two-step non-inferiority and superiority testing based on the bootstrapped Δ_VUS and on a ± δ equivalence range of 5 % as outlined in a previous study (Leichtle et al. 2012). CA 19-9’s VUS ± δ served as reference we tested the other predictors’ Δ_VUS against. In the first step, we observed non-inferiority only for the AA and CAR panel predictors, but not for the PCA panel predictor, whereas in the second step, we determined superiority of AA and CAR panel predictors (Fig. 3). These encouraging results indicate an improved selectivity of the models compared to CA 19-9 alone. Our study has several limitations to be considered. First, we merged the sample sets A and C to keep the degree of random error as low as possible in our modeling analysis (Knottnerus and Muris 2003). However, the “external” validity of the results could not thus be evaluated (Ransohoff and Gourlay 2010). Therefore, subsequent studies are necessary in order to assess the generalizability of our predictor models. Second, due to high preanalytical standardization and refinement of our bioinformatic methodology, the variability of the analytical method itself might have become the main source of bias. With our study design and evaluation strategy, we probably have reached an analytical boundary, that still requires substantial improvements (Hori and Gambhir 2011). Therefore, new analytical techniques are necessary to reach both, superior sensitivity and stability at the same time. The third limitation of the study originates from the strong penalization of our Bayesian modeling approach: The predictor panels generated by the mlogitBMA procedure were both two-component panels consisting of CA 19-9 and another variable. Especially in the case of PC-based modeling and the selection of the second PC while leaving out the first, a considerable amount of selectivity might have been lost. On the other hand, the amino acid concentration-based model was superior in selectivity [without taking misclassification costs into account (Klawonn et al. 2011)], suggesting that PCs might not serve as optimal modeling variables when Occam’s razor is strictly availed. Finally, rather than proposing a superior diagnostic metabolite model or “meta-marker” our results suggest that our bioinformatic framework combined with a methodology refined to sufficient sensitivity and stability might provide a valuable diagnostic tool for metabolic profiling even in the three-class differentiation dilemma of health, inflammation, and malignancy.

4 Short summary

Multi-marker panels have been suggested to improve the selectivity of pancreatic cancer diagnostics and its differentiation from various benign lesions. However, a comprehensive framework for the statistical evaluation of marker panels in a multi-class setting has not yet been established.

Using a disease model encompassing pancreatic cancer, pancreatitis, and healthy controls, 106 standardized serum samples, and metabolic profiling, we generated models to discriminate between the three study groups.

Multi-marker models are superior to the conventional tumor marker CA 19-9 in simultaneously differentiating between pancreatic cancer, pancreatitis, and healthy controls.

Our comprehensive bioinformatic approach provides a novel framework to address a common diagnostic challenge, and thus paves the way for biomarker validation in a clinical three-class setting.

Abbreviations

¹H NMR:: Proton nuclear magnetic resonance (spectrometry)
2D NMR:: 2-Dimensional nuclear magnetic resonance (spectrometry)
AU(RO)C:: Area under the (receiver operator characteristics) curve
mlogitBMA:: Bayesian multinomial logit model averaging
CA 19-9:: Carbohydrate antigen 19-9
CAR scores:: ‘Correlation-adjusted (marginal) correlation’ scores
CEA:: Carcinoembryonic antigen
GC–MS:: Gas chromatography mass spectrometry
GC/TOF–MS:: Gas chromatography/time of flight mass spectrometry
ICAM-1:: Intracellular adhesion molecule 1
LC/ESI-MS:: Liquid chromatography/electrospray ionization mass spectrometry
LC/LTQ-Orbitrap:: Liquid chromatography/linear ion trap combined with an orbitrap (mass spectrometry)
OPG:: Osteoprotegerin
PC(A):: Principal component (analysis)
TIMP-1:: TIMP metallopeptidase inhibitor 1
VUS:: Volume under (ROC) surface

References

Bathe, O. F., Shaykhutdinov, R., Kopciuk, K., Weljie, A. M., McKay, A., Sutherland, F. R., et al. (2011). Feasibility of identifying pancreatic cancer based on serum metabolomics. Cancer Epidemiology Biomarkers & Prevention, 20, 140–147.
Article CAS Google Scholar
Baumann, S., Ceglarek, U., Fiedler, G. M., Lembcke, J., Leichtle, A., & Thiery, J. (2005). Standardized approach to proteome profiling of human serum based on magnetic bead separation and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. Clinical Chemistry, 51, 973–980.
Article PubMed CAS Google Scholar
Blekherman, G., Laubenbacher, R., Cortes, D. F., Mendes, P., Torti, F. M., Akman, S., et al. (2011). Bioinformatics tools for cancer metabolomics. Metabolomics : Official Journal of the Metabolomic Society, 7, 329–343.
Article CAS Google Scholar
Brand, R. E., Nolen, B. M., Zeh, H. J., Allen, P. J., Eloubeidi, M. A., Goldberg, M., et al. (2011). Serum biomarker panels for the detection of pancreatic cancer. Clinical Cancer Research : an Official Journal of the American Association for Cancer Research, 17, 805–816.
Article CAS Google Scholar
Brauer, R., Leichtle, A., Fiedler, G. M., Thiery, J., & Ceglarek, U. (2011). Preanalytical standardization of amino acid and acylcarnitine metabolite profiling in human blood using tandem mass spectrometry. Metabolomics : Official Journal of the Metabolomic Society, 7, 344–352.
Article CAS Google Scholar
Burgess, D. J. (2012). Biomarkers: Major mathematical hurdles for biomarker-based screening. Nature Reviews Cancer, 12, 3.
Article CAS Google Scholar
Ceglarek, U., Leichtle, A., Brugel, M., Kortz, L., Brauer, R., Bresler, K., et al. (2009). Challenges and developments in tandem mass spectrometry based clinical metabolomics. Molecular and Cellular Endocrinology, 301, 266–271.
Article PubMed CAS Google Scholar
Fiedler, G. M., Leichtle, A. B., Kase, J., Baumann, S., Ceglarek, U., Felix, K., et al. (2009). Serum peptidome profiling revealed platelet factor 4 as a potential discriminating Peptide associated with pancreatic cancer. Clinical Cancer Research, 15, 3812–3819.
Article PubMed CAS Google Scholar
Findeisen, P., & Neumaier, M. (2012). Functional protease profiling for diagnosis of malignant disease. Proteomics. Clinical Applications, 6, 60–78.
Article PubMed CAS Google Scholar
Gong, P. L., Liu, T. T., & Shen, X. Z. (2012). Differentiation of autoimmune pancreatitis with pancreatic carcinoma remains a challenge to physicians. Journal of Digestive Diseases, 13, 267–273.
Article PubMed CAS Google Scholar
Hazelton, W. D., & Luebeck, E. G. (2011). Biomarker-based early cancer detection: Is it achievable? Science Translational Medicine, 3, 109fs9.
Article PubMed Google Scholar
Hori, S. S., & Gambhir, S. S. (2011). Mathematical model identifies blood biomarker-based early cancer detection strategies and limitations. Science Translational Medicine, 3, 109ra116.
Article PubMed Google Scholar
Huggett, M. T., & Pereira, S. P. (2011). Diagnosing and managing pancreatic cancer. The Practitioner, 255(21–5), 2–3.
Google Scholar
Jesneck, J. L., Mukherjee, S., Yurkovetsky, Z., Clyde, M., Marks, J. R., Lokshin, A. E., et al. (2009). Do serum biomarkers really measure breast cancer? BMC cancer, 9, 164.
Article PubMed Google Scholar
Kholodenko, B., Yaffe, M. B., & Kolch, W. (2012). Computational approaches for analyzing information flow in biological networks. Science Signaling, 5, re1.
Article PubMed Google Scholar
Klawonn, F., Höppner, F., May, S. (2011) An alternative to ROC and AUC analysis of classifiers advances in intelligent data analysis X in gama, In J. E. Bradley, J. Hollmén (Eds.). Berlin/Heidelberg: Springer. pp. 210–221.
Knottnerus, J. A., & Muris, J. W. (2003). Assessment of the accuracy of diagnostic tests: the cross-sectional study. Journal of Clinical Epidemiology, 56, 1118–1128.
Article PubMed CAS Google Scholar
Konforte, D., & Diamandis, E. P. (2012). Is early detection of cancer with circulating biomarkers feasible? Clinical Chemistry.
Landgrebe, T., & Duin, R. P. W. (2007). A simplified volume under the ROC hypersurface. SAIEE Africa Research Journal, 98, 94–100.
Google Scholar
Leichtle, A., Nuoffer, J.-M., Ceglarek, U., Kase, J., Conrad, T., Witzigmann, H., et al. (2012). Serum amino acid profiles and their alterations in colorectal cancer. Metabolomics, 8, 643–653.
Article PubMed CAS Google Scholar
Leigh, J. P. (1988). Assessing the importance of an independent variable in multiple regression: is stepwise unwise? Journal of Clinical Epidemiology, 41, 669–677.
Article PubMed CAS Google Scholar
Lesaffre, E. (2008). Superiority, equivalence, and non-inferiority trials. Bulletin of the NYU Hospital for Joint Diseases, 66, 150–154.
PubMed Google Scholar
Liu, J. P., Ma, M. C., Wu, C. Y., & Tai, J. Y. (2006). Tests of equivalence and non-inferiority for diagnostic accuracy based on the paired areas under ROC curves. Statistics in Medicine, 25, 1219–1238.
Article PubMed Google Scholar
Lowenfels, A. B., & Maisonneuve, P. (2006). Epidemiology and risk factors for pancreatic cancer. Best Practice & Research Clinical Gastroenterology, 20, 197–209.
Article Google Scholar
Mascha, E. J. (2010). Equivalence and noninferiority testing in anesthesiology research. Anesthesiology, 113, 779–781.
Article PubMed Google Scholar
Michl, P., Pauls, S., & Gress, T. M. (2006). Evidence-based diagnosis and staging of pancreatic cancer. Best Practice & Research Clinical Gastroenterology, 20, 227–251.
Article Google Scholar
Murdoch, D. J., & Chow, E. D. (1996). A graphical display of large correlation matrices. The American Statistician, 50, 178–180.
Google Scholar
Nakas, C. T., & Yiannoutsos, C. T. (2004). Ordered multiple-class ROC analysis with continuous measurements. Statistics in Medicine, 23, 3437–3449.
Article PubMed Google Scholar
Nishiumi, S., Shinohara, M., Ikeda, A., Yoshie, T., Hatano, N., Kakuyama, S., et al. (2010). Serum metabolomics as a novel diagnostic approach for pancreatic cancer. Metabolomics: Official Journal of the Metabolomic Society, 6, 518–528.
Article CAS Google Scholar
OuYang, D., Xu, J., Huang, H., & Chen, Z. (2011). Metabolomic profiling of serum from human pancreatic cancer patients using 1H NMR spectroscopy and principal component analysis. Applied Biochemistry and Biotechnology, 165, 148–154.
Article PubMed Google Scholar
Ransohoff, D. F., & Gourlay, M. L. (2010). Sources of bias in specimens for research about molecular markers for cancer. Journal of Clinical Oncology : Official Journal of the American Society of Clinical Oncology, 28, 698–704.
Article Google Scholar
Robin, X., Turck, N., Hainard, A., Lisacek, F., Sanchez, J. C., & Muller, M. (2009). Bioinformatics for protein biomarker panel classification: What is needed to bring biomarker panels into in vitro diagnostics? Expert Review of Proteomics, 6, 675–689.
Article PubMed CAS Google Scholar
Schrader, H., Menge, B. A., Belyaev, O., Uhl, W., Schmidt, W. E., & Meier, J. J. (2009). Amino acid malnutrition in patients with chronic pancreatitis and pancreatic carcinoma. Pancreas, 38, 416–421.
Article PubMed CAS Google Scholar
Tesiram, Y. A., Lerner, M., Stewart, C., Njoku, C., & Brackett, D. J. (2012). Utility of nuclear magnetic resonance spectroscopy for pancreatic cancer studies. Pancreas, 41, 474–480.
Article PubMed Google Scholar
Tunes da Silva, G., Logan, B. R., & Klein, J. P. (2009). Methods for equivalence and noninferiority testing. Biology of Blood and Marrow Transplantation: Journal of the American Society for Blood and Marrow Transplantation, 15, 120–127.
Article Google Scholar
Urayama, S., Zou, W., Brooks, K., & Tolstikov, V. (2010). Comprehensive mass spectrometry based metabolic profiling of blood plasma reveals potent discriminatory classifiers of pancreatic cancer. Rapid Communications in Mass Spectrometry: RCM, 24, 613–620.
Article PubMed CAS Google Scholar
Van, Q. N., & Veenstra, T. D. (2009). How close is the bench to the bedside? Metabolic profiling in cancer research. Genome Medicine, 1, 5.
Article PubMed Google Scholar
Yachida, S., Jones, S., Bozic, I., Antal, T., Leary, R., Fu, B., et al. (2010). Distant metastasis occurs late during the genetic evolution of pancreatic cancer. Nature, 467, 1114–1117.
Article PubMed CAS Google Scholar
Zhang, L., Farrell, J. J., Zhou, H., Elashoff, D., Akin, D., Park, N. H., et al. (2010). Salivary transcriptomic biomarkers for detection of resectable pancreatic cancer. Gastroenterology, 138(949–57), e1–e7.
Google Scholar
Zhang, H., Wang, Y., Gu, X., Zhou, J., & Yan, C. (2011). Metabolomic profiling of human plasma in pancreatic cancer using pressurized capillary electrochromatography. Electrophoresis, 32, 340–347.
Article PubMed Google Scholar
Zhang, L., Jin, H., Guo, X., Yang, Z., Zhao, L., Tang, S., et al. (2012). Distinguishing pancreatic cancer from chronic pancreatitis and healthy individuals by (1)H nuclear magnetic resonance-based metabonomic profiles. Clinical Biochemistry, 45, 1064–1069.
Article PubMed CAS Google Scholar
Zuber, V., & Strimmer, K. (2011). High-dimensional regression and variable selection using CAR scores. Statistical Applications in Genetics and Molecular Biology, 10, 34.
Article Google Scholar

Download references

Acknowledgments

The authors would like to thank David Gurtner from the UBELIX Linux High Performance Computing Cluster of the University of Bern for his outstanding technical support, Xavier Robin from the Department of Structural Biology and Bioinformatics of the Biomedical Proteomics Research Group at the University of Geneva for inspiring discussions, and Johanna Sistonen from the Pharmacogenomics and Drug Metabolism Group of the Center of Laboratory Medicine at the Inselspital Bern for her thorough revision of the manuscript.

Conflict of interest

None.

Open Access

This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Author information

Authors and Affiliations

Center of Laboratory Medicine, University Institute of Clinical Chemistry, Inselspital—Bern University Hospital, Inselspital INO F 502/UKC, 3010, Bern, Switzerland
Alexander Benedikt Leichtle
Institute of Laboratory Medicine, Clinical Chemistry, and Molecular Diagnostics, University Hospital Leipzig, 04103, Leipzig, Germany
Uta Ceglarek, Joachim Thiery & Georg Martin Fiedler
Leibniz Supercomputing Centre, Bavarian Academy of Sciences and Humanities, Boltzmannstr. 1, 85748, Garching, Germany
Peter Weinert
Laboratory of Biometry, University of Thessaly, Fytokou Str., N. Ionia, 38446, Magnesia, Greece
Christos T. Nakas
Department of Hematology, Oncology and Tumor Immunology, Campus Virchow Clinic, and Molekulares Krebsforschungszentrum, Charité—Universitätsmedizin, Augustenburger Platz 1, 13353, Berlin, Germany
Julia Kase
Department of Mathematics, Free University of Berlin, Arnimallee 6, 14195, Berlin, Germany
Tim Conrad
Clinic of Visceral Surgery, University Hospital Leipzig, Liebigstr. 20, 04103, Leipzig, Germany
Helmut Witzigmann
Center of Laboratory Medicine, University Institute of Clinical Chemistry, Inselspital—Bern University Hospital, Inselspital INO F 610/UKC, 3010, Bern, Switzerland
Jean-Marc Nuoffer
Center of Laboratory Medicine, University Institute of Clinical Chemistry, Inselspital—Bern University Hospital, Inselspital INO F 603/UKC, 3010, Bern, Switzerland
Georg Martin Fiedler

Authors

Alexander Benedikt Leichtle
View author publications
You can also search for this author in PubMed Google Scholar
Uta Ceglarek
View author publications
You can also search for this author in PubMed Google Scholar
Peter Weinert
View author publications
You can also search for this author in PubMed Google Scholar
Christos T. Nakas
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Marc Nuoffer
View author publications
You can also search for this author in PubMed Google Scholar
Julia Kase
View author publications
You can also search for this author in PubMed Google Scholar
Tim Conrad
View author publications
You can also search for this author in PubMed Google Scholar
Helmut Witzigmann
View author publications
You can also search for this author in PubMed Google Scholar
Joachim Thiery
View author publications
You can also search for this author in PubMed Google Scholar
Georg Martin Fiedler
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alexander Benedikt Leichtle.

Additional information

Alexander Benedikt Leichtle and Uta Ceglarek have contributed equally to this study.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Leichtle, A.B., Ceglarek, U., Weinert, P. et al. Pancreatic carcinoma, pancreatitis, and healthy controls: metabolite models in a three-class diagnostic dilemma. Metabolomics 9, 677–687 (2013). https://doi.org/10.1007/s11306-012-0476-7

Download citation

Received: 07 September 2012
Accepted: 15 October 2012
Published: 06 November 2012
Issue Date: June 2013
DOI: https://doi.org/10.1007/s11306-012-0476-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Pancreatic carcinoma, pancreatitis, and healthy controls: metabolite models in a three-class diagnostic dilemma

Abstract

Similar content being viewed by others

A systematic review on metabolomics-based diagnostic biomarker discovery and validation in pancreatic cancer

A new panel of pancreatic cancer biomarkers discovered using a mass spectrometry-based pipeline

Discrimination of pancreatic cancer and pancreatitis by LC-MS metabolomics

1 Introduction