Migrating from partial least squares discriminant analysis to artificial neural networks: a comparison of functionally equivalent visualisation and feature contribution tools using jupyter notebooks

Mendez, Kevin M.; Broadhurst, David I.; Reinke, Stacey N.

doi:10.1007/s11306-020-1640-0

Migrating from partial least squares discriminant analysis to artificial neural networks: a comparison of functionally equivalent visualisation and feature contribution tools using jupyter notebooks

Original Article
Open access
Published: 21 January 2020

Volume 16, article number 17, (2020)
Cite this article

Download PDF

You have full access to this open access article

Metabolomics Aims and scope Submit manuscript

Migrating from partial least squares discriminant analysis to artificial neural networks: a comparison of functionally equivalent visualisation and feature contribution tools using jupyter notebooks

Download PDF

10k Accesses
33 Citations
14 Altmetric
Explore all metrics

Abstract

Introduction

Metabolomics data is commonly modelled multivariately using partial least squares discriminant analysis (PLS-DA). Its success is primarily due to ease of interpretation, through projection to latent structures, and transparent assessment of feature importance using regression coefficients and Variable Importance in Projection scores. In recent years several non-linear machine learning (ML) methods have grown in popularity but with limited uptake essentially due to convoluted optimisation and interpretation. Artificial neural networks (ANNs) are a non-linear projection-based ML method that share a structural equivalence with PLS, and as such should be amenable to equivalent optimisation and interpretation methods.

Objectives

We hypothesise that standardised optimisation, visualisation, evaluation and statistical inference techniques commonly used by metabolomics researchers for PLS-DA can be migrated to a non-linear, single hidden layer, ANN.

Methods

We compared a standardised optimisation, visualisation, evaluation and statistical inference techniques workflow for PLS with the proposed ANN workflow. Both workflows were implemented in the Python programming language. All code and results have been made publicly available as Jupyter notebooks on GitHub.

Results

The migration of the PLS workflow to a non-linear, single hidden layer, ANN was successful. There was a similarity in significant metabolites determined using PLS model coefficients and ANN Connection Weight Approach.

Conclusion

We have shown that it is possible to migrate the standardised PLS-DA workflow to simple non-linear ANNs. This result opens the door for more widespread use and to the investigation of transparent interpretation of more complex ANN architectures.

The application of artificial neural networks in metabolomics: a historical perspective

Article 18 October 2019

Informative metabolites identification by variable importance analysis based on random variable combination

Article 17 May 2015

Feature selection using distributions of orthogonal PLS regression vectors in spectral data

Article Open access 22 January 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Within a biological system, metabolite concentrations are highly interdependent (Dunn et al. 2011). As such, the usefulness of multivariate data analysis in metabolomics stems from the need to extract biological information from inherently complex covariant data, where metabolite interaction is as important as individual changes in concentration. Historically, partial least squares (PLS), a.k.a. projection to latent structures (Wold 1975; Wold et al. 1993), has been the standard multivariate machine learning (ML) method used to construct predictive models to classify metabolite profiles. The underlying theory of PLS, and its utility to metabolomics, has been documented many times (Geladi and Kowalski 1986; Gromski et al. 2015; Wold et al. 1993, 2001). A key benefit of PLS is the ability to visualise (via a latent variable score plot) the projected metabolomic relationship (clustering) between individual samples before classification.

There are many machine learning (ML) alternatives to PLS, several of which have been applied to metabolomics data. The most popular include support vector machines (Steinwart and Christmann, 2008), random forests (Breiman 2001), and artificial neural networks (Bishop 1995; Wilkins et al. 1994); however, despite coexisting for a similar length of time, none of these methods have gained the popularity of PLS. A survey of publications listed on the Web of Science using the keywords metabolite*, metabolom* or metabonom* reveals that up to and including 2018, 2224 publications list the use of PLS as a key term, whereas the alternatives were listed < 500 times (combined number). The key to the popularity of PLS over alternative methods can be distilled into a single word—interpretability. Historically, the primary aim of machine learning (ML) has been accurate prediction, not statistical inference (Mendez et al. 2019a). As such, methods for statistically interpreting either the similarities between each individual metabolite profile, or the importance of individual metabolites across multiple samples, have been a secondary consideration. The ability for PLS to visualise and infer statistical confidence intervals upon the latent relationships within and between sample classes, together with the fact that a PLS model can be reduced to a simple linear regression (and thus exposed to multiple well established post-hoc statistical tests), means that it sits alone as an effective hybrid prediction-inference algorithm for high dimensional data (Eriksson et al. 2013; Wold 1975; Wold et al. 1993).

Artificial neural networks (ANNs) are also of particular interest because in their simplest form, as with PLS, they can be considered as a combination of dimensionality reduction and multiple linear regression. In fact, for a linear ANN, with a single hidden layer, the only difference between ANN and PLS is the manner in which the constituent model parameters are optimised (Fig. 1). ANNs can be generally considered a projection-based method which share a structural equivalence with PLS (Mendez et al. 2019a). With non-linear ANNs the projection to latent structures ethos is preserved but now non-linear, rather than linear, latent structures can be modelled.

ANNs were first applied to metabolomic profiling ca. 1992 by Goodacre et al. (1992). At that time, due to lack of compute power and poor software availability, ANNs were very slow to train and considered difficult to interpret. As such, by the early 2000s they had been widely disregarded and relegated to an intellectual curiosity not considered able to provide meaningful biological insight (Goodacre 2003). With recent advancements in computational power, the availability of easily accessible yet powerful open-source packages (e.g. TensorFlow and PyTorch), and the general success within industry and other research fields, the reintroduction of ANNs warrants renewed investigation. We recently showed that ANNs have similar predictive ability to PLS across multiple diverse metabolomics data sets (Mendez et al. 2019c). However, within the domain of metabolomics, if ANNs are to become a truly viable alternative to PLS it will be necessary to develop similar standardised and robust methods for data visualisation, evaluation, and statistical inference (Mendez et al. 2019a).

Recently, the increased availability of well curated open-source software libraries, particularly from R and Python programming communities, has increased the availability and utility of many ML methods, including ANNs. Moreover, the massive increase in available computer power has reduced compute times such that methods previously intractable due to computational expense, such as bootstrap confidence intervals (Efron 1988), have enabled non-parametric statistical inference to be derived for previously considered uninterpretable ‘black box’ methods. This opens the door for the development of an ANN framework comparable to that of PLS-DA.

The aim of this study is to migrate the standardised optimisation, visualisation, evaluation, and statistical inference techniques commonly used in a PLS-DA binary classification over to a non-linear, single hidden layer, ANN algorithm, and then conduct a direct comparison of utility. We provide two functionally equivalent workflows (PLS-DA vs. ANN) implemented using the Python programming language, and presented as open-access Jupyter Notebooks (https://cimcb.github.io/MetabProjectionViz/). The workflows were applied to two previously published metabolomics datasets by Chan et al. (2016) and Ganna et al. (2016), but are written to be used with any data set suitably formatted following previous guidelines (Mendez et al. 2019b). Both workflows include cross-validated hyperparameter optimisation, latent variable projection scores plots, classification evaluation using receiver operator characteristic curves, bootstrap resampling for statistical inference of feature contribution and generalisability of prediction metrics.

2 Methods

2.1 Partial least squares discriminant analysis (PLS-DA)

PLS-DA (Wold 1975; Wold et al. 1993) is a widely used multivariate ML algorithm used for classifying and interpreting metabolomics data, especially applicable when the number of metabolites (independent variables) is much larger than the number of data points (samples). PLS uses the projection to latent space approach to model the linear covariance structure between two matrices (X and Y). If the X matrix is thought of as a set of N data points in M-dimensional space (where, N = number of samples, and M = number of metabolites), and Y is a binary vector (length N) describing the class of each samples (e.g. case = 1 and control = 0), and if we consider the algorithm geometrically, the PLS algorithm rotates and projects X into a lower K dimensional space (typically K = 2 or 3), represented by the scores matrix T, such that discrimination (covariance) between the two labelled groups in the subspace is maximised (Eriksson et al. 2013). For this study, PLS-DA models was optimised using the iterative SIMPLS algorithm (de Jong, 1993). T can be derived from X using Eq. (1), where W, the X-weight matrix, describes how the X-variables are linearly combined, or geometrically rotated, to form the score vectors, $t_{1} \,t_{2} \, \ldots \,t_{K}$.

$${\mathbf{T}}\,{\mathbf{ = }}\,{\mathbf{XW}}$$

(1)

The predicted classification (Y*) can then be calculated from T using Eq. (2), where C is the Y-weights matrix describing how the Y vector is rotated to map to the covariance described by T.

$${\mathbf{Y}}^{*\,} \, = \,{\mathbf{TC^{\prime}}}$$

(2)

These matrix equations, Eq. (1) and Eq. (2), can be combined and simplified to a single linear regression, Eq. (3), where B_PLS is a vector of coefficient values.

$${\mathbf{Y}}^{*\,} \, = \,{\mathbf{TC^{\prime}}}$$

$${\mathbf{Y}}^{*\,} \, = \,{\mathbf{XWC^{\prime}}}$$

$${\mathbf{Y}}^{*\,} \, = \,{\mathbf{XB}}_{{{\mathbf{PLS}}}}$$

(3)

This matrix equation, Eq. (3), can also be described as a single linear regression in standard form, Eq. (4), where $\beta_{0} \,...\,\beta_{N}$ is a vector of linear coefficients.

$$y^{*} = \beta_{0} + \beta_{1} x_{1} + \beta_{1} x_{2} + \ldots + \beta_{M} x_{M}$$

(4)

2.1.1 PLS-DA optimisation

The optimal number of latent variables, K, is determined such that the T matrix is just sufficient to accurately describe the underlying latent structure in X but not so large as to also model random correlation and produce a model that is a poor classification tool for new X-data (see cross-validation in Sect. 3.4). In machine learning terminology any parameter which is used to define a model’s structure, or an optimisation algorithm characteristic, is known as a hyperparameter. Thus, the number of latent variables is the single PLS-DA hyperparameter.

2.1.2 PLS-DA evaluation

In order to provide some level of independent model evaluation it is common practice to split the source data set into two parts: training set and test set (typically, 2/3 training and 1/3 test). Once the optimal number of latent variables has been determined using the training data only (${\mathbf{X}}_{{{\mathbf{train}}}}$ and ${\mathbf{Y}}_{{{\mathbf{train}}}}$), the resulting model, ${\mathbf{Y}}^{*\,} \, = \,{\mathbf{XB}}_{{{\mathbf{PLS}}}}$, is then independently evaluated by applying the test data (${\mathbf{X}}_{{{\mathbf{test}}}}$; suitably transformed and scaled) to the model, ${\mathbf{Y}}_{{{\mathbf{Test}}}}^{*} \, = \,{\mathbf{X}}_{{{\mathbf{test}}}} {\mathbf{B}}_{{{\mathbf{PLS}}}}$. A measure of the predictive ability of the model can then be calculated by comparing the training prediction $({\mathbf{Y}}_{{{\mathbf{train}}}}^{*} )$ to the expected training outcome (Y_train), and the test prediction (Y ^*_test ) to the expected test outcome (Y_test).

While true effectiveness of a model can only be assessed using test data (Westerhuis et al. 2008; Xia et al. 2013), for small data sets it is dangerous to use a single random data split as the only means of model evaluation, as the random test data set may not accurately represent the training data set (Mendez et al. 2019c). An alternative is to use bootstrap resampling. Bootstrap resampling is a method for calculating confidence intervals using random sampling with replacement (DiCiccio and Efron 1996; Efron 1981, 2000). The theoretical details of this methodology are beyond the scope of this paper. Briefly, this technique allows the accurate estimation of the sampling distribution of almost any statistic using repeated random sampling. Each random sample selects ~ 2/3 of the data points (called the in-bag sample) leaving ~ 1/3 (the out-of-bag sample).

Bootstrapping can be used to calculate confidence measurements for the evaluating the optimal ML model configuration for a given metabolomics data set (Broadhurst and Kell 2006; Mendez et al. 2019b; Xia et al. 2013). A model with fixed hyperparameter values is retrained on data, randomly sampled with replacement (in-bag), and then evaluated on the unused data (out-of-bag) for r resamples (typically r = 100). The predicted outcome from each in-bag bootstrap resample as well as other outputs, including the predicted outcome, latent scores, latent loadings, and feature contribution metrics are stored after each resampling. The out-of-bag prediction of classification is also stored, as this can be considered an unbiased estimate of the model’s performance when shown new data. Using these stored outputs, 95% confidence intervals are calculated using the commonly-used bias-corrected and accelerated (BCa) method; this method adjusts the percentiles to account for the bias and skewness in the bootstrap distribution (Efron 1987). Following bootstrap resampling, a measure of generalised prediction of each model is calculated as the median and 95% confidence intervals of the in-bag and out-of-bag predictions.

2.1.3 PLS-DA visualisation

For a given PLS-DA model it is common practice to visualise the projection of X into the latent variable space to provide a generalised understanding of the metabolomic relationship (clustering) between individual samples before classification. For this, the scores matrix, T, described in Eq. (1), can be represented as a scatter plot (scores plot) such that each axis of the plot represents a column of the T-matrix. For example, a scatter plot of t₁ vs. t₂ will represent the projections of X onto the first two latent variables (i.e. each data point represents a projection of a given sample’s metabolite profile). It is in this latent variable space that one would expect to see different metabotypes cluster. The associated weight vectors (columns of W) can also be visualised individually and interpreted as an indication of how the X-variables are linearly combined to create each score vector, Eq. (5).

$$\begin{array}{*{20}c} {t_{1} = w_{0,1} + w_{1,1} x_{1} + w_{2,1} x_{2} + \ldots + w_{M,1} x_{M} } \\ {t_{2} = w_{0,2} + w_{1,2} x_{2} + w_{2,2} x_{2} + \ldots + w_{M,2} x_{M} } \\ \ldots \\ {t_{K} = w_{0,K} + w_{1,K} x_{1} + w_{2,K} x_{2} + \ldots + w_{M,K} x_{M} } \\ \end{array} \,(5)$$

For a single optimised model, latent scores plots can be generated for training, cross-validation, and test X-data sets independently. This is a useful method for determining if overtraining has occurred (see supplementary Jupyter Notebooks).

2.1.4 PLS-DA variable contribution

For PLS-DA, there are two common methods used to estimate variable contribution. First, as discussed, a PLS-DA model can be reduced to a single multiple linear regression, Eq. (3), thus feature contribution can be inferred directly from the model’s regression coefficients, B_PLS. Second, for more of a focus on the importance of the X-variables on the latent projection, the variable influence on projection (VIP) scores can be calculated using Eq. (6) (Favilla et al. 2013). VIP is the weighted,$,w_{i}^{2}$ combination of the sum of squares of Y explained by each latent variable, $SSY_{i}$, normalised to the cumulative sum of square, $SSY_{cum}$,

where $M$ is the total number of metabolites, and $K$ is the total number of latent variables.

$${\mathbf{VIP}} = \sqrt {M \times \frac{{\mathop \sum \nolimits_{i = 1}^{K} w_{i}^{2} \times SSY_{i} }}{{SSY_{cum} }}}$$

(6)

The average VIP score is equal to 1 because the sum of squares of all VIP scores is equal to the number of variables in X. Thus, if all X-variables have the same contribution to the model, they will have a VIP score equal to 1. VIP scores larger than 1 indicate the most relevant variables. Bootstrap resampling (Sect. 2.1.2) can be applied to calculate 95% confidence intervals for both the B_PLS coefficient values and VIP scores, from which estimates of significant contribution to the model can be determined.

2.2 Artificial neural network (ANN)

ANNs consist of layered weighted networks of interconnected mathematical operators (neurons). The most prevalent ANN is the feed-forward neural network. Here, each neuron acts as a weighted sum of the outputs of the previous layer (or input data) transformed by an activation function (typically linear or logistic function). This is described in Eq. (7), using notation from Fig. 1a, where ${t}_{j}$ is the output for the j^th neuron in the hidden layer, ${f}_{0}$ is the activation function, $x$ is a vector of input variables (x₁, x₂, …, x_M), ${w}_{i,j}$ is the weight from input variable, x_i, to the neuron, and w_0,j is a constant offset value.

$$t_{j} = f_{0} \left( {w_{0,j} + \mathop \sum \limits_{i = 1}^{M} w_{i,j} \times x_{i} } \right)$$

(7)

A neuron with a linear activation function connected to multiple input variables is mathematically equivalent to a linear regression with multiple independent variables, Eq. (8), where w_0,j … w_N,j is a vector of linear coefficients.

$$t_{j} = w_{0,j} + w_{1,j} x_{1} + w_{2,j} x_{2} + \cdots + w_{M,j} x_{M}$$

(8)

A neuron with a logistic activation function, f₀ (), is equivalent to the multivariate logistic regression describe in Eq. (9).

$$t_{j} = \frac{1}{{1 + e^{{ - \left( { w_{0,j} + \mathop \sum \nolimits_{i = 1}^{M} w_{i,j} \times x_{i} } \right)}} }}$$

(9)

An ANN with a single linear hidden layer and a single linear output neuron is mathematically equivalent to a PLS-DA model (Fig. 1). Replacing all the linear neurons with logistic neurons in the two-layer ANN results in a complex non-linear projection-based discriminant model. For this study, we use a two-layer ANN with logistic activation functions in both layers.

2.2.1 ANN optimisation

During ANN training, the interconnection weights between each layer of neurons are optimised using an iterative algorithm known as back-propagation. This algorithm has been described in detail elsewhere (Bishop 1995). The effectiveness of this optimisation method is dependent on a set of hyperparameters. A two-layer feedforward ANN has 5 hyperparameters: 1 parameter to determine the model structure, the number of neurons in the hidden layer (equivalent to number of latent variables) and 4 parameters that characterise the learning process. These determine the rate and momentum of traversing local error gradients (specifically learning rate, momentum, and decay of the learning rate over time) and the number of times the back-propagation is applied to the ANN (the number of training epochs). For this study, preliminary explorative analysis indicated that hyperparameters: momentum, decay, epochs could be set to a constant value (0.5, 0 and 400 respectively) with little variation on performance. This reduced the number of tuneable hyperparameters to: (i) the number of neurons in the hidden layer, and (ii) the learning rate.

2.2.2 ANN evaluation

Model evaluation using a test set and model evaluation using bootstrap resampling is identical to that described in Sect. 2.1.2. except replacing the PLS-DA prediction, Y^*, with the ANN equivalent.

2.2.3 ANN visualisation

For an equivalent representation of the PLS-DA projection to latent space, we provide a projection to neuron space. Each hidden neuron represents a transformed weighted sum of the X-variables (Eq. 7). Thus, for each pairwise combination of neurons, plotting the weighted sum before transformation provides a similar means to PLS-DA for visualising and interpreting any clustering between individual samples before classification. Similarly, associated weight vectors can also be visualised individually and interpreted as an indication of how the X-variables are linearly combined to create each neuron scores vector before transformation.

2.2.4 ANN variable contribution

For ANN, several variable contribution metrics have been proposed (Olden et al. 2004); however, the two most comparable metrics to the PLS-DA B_PLS coefficients and VIP scores are the Connection Weight Approach (CWA) (Olden and Jackson 2002) and Garson’s Algorithm (GA) (Garson 1991), respectively. Similar to B_PLS, for a two-layer ANN with linear activation functions (Fig. 1b), feature contribution can be inferred directly from a model’s linear coefficients, B_ANN, as shown in Eq. (10), where C is the weights for the hidden-output layer, and W is the weights for the input-hidden layer.

$${\mathbf{CWA}} = {\mathbf{B}}_{{{\mathbf{ANN}}}} = {\mathbf{CW}}$$

(10)

This equation can be used to calculate variable contribution for two-layer non-linear ANNs, renamed as CWA, and describes relative (and directional) metabolite contribution.

While VIP may not be directly applied to non-linear ANNs, a similar measure of weighted absolute relative contribution of each metabolite per neuron can be calculated using Garson’s Algorithm (Garson 1991). First, absolute CWA_i,jvalues are calculated across the network by multiplying each neuron input weight, w_i,j, to the corresponding output weight,c_jand converting to an absolute value.

$$\left| {CWA_{i,j} } \right| = \left| {w_{i,j} \times c_{j} } \right|$$

(11)

Second, as shown in Eq. (12), for each hidden neuron the total absolute connection weight value is calculated, where $M$ is the total number of metabolites.

$$\left| {CWA_{j} } \right| = \mathop \sum \limits_{i = 1}^{M} \left| {CWA_{i,j} } \right| { }$$

(12)

Then, the overall contribution for each input variable, GA_i, is calculated as shown in Eq. (13), where $K$ is the total number of hidden layer neurons.

$$GA_{i} = \mathop \sum \limits_{j = 1}^{K} \left( {\frac{{\left| {CWA_{i,j} } \right|}}{{\left| {CWA_{j} } \right|}}} \right)$$

(13)

Unlike VIP there is no general threshold of importance for Garson’s Algorithm, so we propose using the average GA score as a comparable equivalent to indicate metabolites of importance in the model.

2.3 Computational workflow

The standard workflow for the PLS visualisation and interpretation, and the proposed equivalent ANN visualisation and interpretation is described in Fig. 2. Both the PLS-DA and ANN workflows were implemented in the Python programming language using a package called ‘cimcb’ (https://github.com/CIMCB/cimcb) developed by the authors. This package contains tools for the analysis and visualisation of untargeted and targeted metabolomics data. The package is based on existing well curated open-source packages (including numpy (Kristensen and Vinter, 2010), scipy (Virtanen et al. 2019), bokeh (Bokeh Development Team 2018), keras (Chollet 2015), pandas (McKinney 2010), scikit-learn (Pedregosa et al. 2011), and Theano (Theano Development Team2016)). It utilises these packages through helper functions specifically designed to simplify the application to metabolomics data, following guidelines previously described (Mendez et al. 2019b).

Each step of the respective PLS-DA and ANN workflow is described in detail in the associated Jupyter Notebook file (included in supplementary material and https://cimcb.github.io/MetabProjectionViz/). The method of embedding explanatory text within functional code and visualisations follows previously published guidelines (Mendez et al. 2019b). The generic workflow is now briefly described.

2.3.1 Prepare data

For an adequate comparison of visualisation and interpretation methods, across PLS and ANN, it was important that identical data were used in both models. The X matrix of metabolite concentrations, and associated Y vector of classification labels (case = 1, control = 0) were extracted from the excel spreadsheet. Metabolites in X were included for modelling if they had a QC relative standard deviation (RSD_QC) < 20% and < 10% missing data (Broadhurst et al. 2018). The datasets were split using a ratio of 2:1 (2/3 training, 1/3 test) using stratified random selection. After splitting the data into training and test sets, the columns of X were natural log transformed, mean centred, and scaled to unit variance with missing values imputed using k-nearest neighbour prior to modelling following standard protocols for metabolomics (Broadhurst and Kell 2006). The means and standard deviations calculated from the training set were applied to scale the test set data.

2.3.2 Hyperparameter optimisation

For both PLS-DA and ANN algorithms the optimal hyperparameter values were determined using 5-fold cross-validation (CV) with 10 Monte Carlo repartitions (Broadhurst and Kell 2006; Hastie et al. 2009; Xia et al. 2013). For the PLS-DA workflow, a linear search was used to optimise the number of latent variables (1 to 6). For the ANN workflow, a grid search was used to optimise the number of neurons (2 to 6) and the learning rate (0.001 to 1). The optimal hyperparameter values were determined by evaluating plots of R² and Q² statistics. Two plots were generated: (i) a standard R²and Q² plot against hyperparameter values, and (ii) an alternative plot of $\left| {R^{2} - Q^{2} } \right| vs. Q^{2}$. Using the later plot, the optimal hyperparameter was selected at the point of inflection of the outer convex hull. The area under the receiver operating characteristic curve (AUC) is a recommended alternative non-parametric measure of classification performance (Szymańska et al. 2012), thus equivalent plots of AUC_Full and AUC_cv metrics are also generated for comparison.

2.3.3 Permutation test

Following hyperparameter optimisation, a permutation test was applied to the optimal model configuration. In a permutation test, the expected outcome label is randomised (permuted), and the model with fixed hyperparameter values is subsequently trained and evaluated (Lindgren et al. 1996). For both PLS-DA and ANN, this process was repeated (n = 100) using fivefold CV to construct a distribution of the permuted model statistics. While R²and Q² statistics are commonly used in permutation testing (Eriksson et al. 2013), AUC_Full and AUC_cv metrics were also included for ANNs, given its common usage as a measure of non-linear classification performance.

2.3.4 Model evaluation using test set

As previously described in Sect. 2.1.2, the measure of the predictive ability of the model using a test set is calculated by comparing the training score (${\mathbf{Y}}_{{{\mathbf{train}}}}^{*}$) to the expected outcome (Y_train) classification, and the test score (${\mathbf{Y}}_{{{\mathbf{test}}}}^{*}$) to the expected outcome (Y_test) classification. This is visualised using three plots:

1.
A violin plot that shows the distribution of the predicted score, by outcome, for the training and test set.
2.
A probability density plot that shows the distribution of the predicted score, by outcome, for the training and test set via overlapping probability density functions.
3.
A receiver operator characteristic (ROC) curve of the training and test sets.

2.3.5 Model evaluation using bootstrap resampling

Model evaluation using bootstrap resampling is described in Sect. 2.1.2. Following bootstrap resampling (n = 100), a measure of generalised prediction of each model is calculated and visualised using the protocol described in 2.3.4, except this time presenting the 95% confidence intervals of the 100 in-bag and out-of-bag predictions.

2.3.6 Model visualisation: scores plot & weights plot

Pairwise latent variable scores plots and associated weight vector plots are also provided. The scores plots are similar in construction to those generated during hyperparameter optimisation, except they are based on the in-bag and out-of-bag scores averaged across repeated prediction for each sample (aggregate score). 95% confidence intervals for each class are calculated using standard parametric methods. The 95% confidence intervals for each weight vector plots were constructed using the distribution of each weight variable across the 100 bootstrap resampled models. Any metabolite weight with a confidence interval crossing the zero line (coloured blue) are considered non-significant to the latent variable (or neuron).

2.3.7 Variable contribution plots

The B_PLS coefficients and VIP scores for the PLS models were calculated using the methods described in Sect. 2.1.4. The CWA and Garson scores were calculated for the ANNs using the methods described in Sect. 2.2.4. There metrics were also applied to all 100 models of each type generated during the bootstrap resampling. Variable contribution plots were constructed. The 95% confidence intervals for each vector plots were calculated using the distribution of each variable’s metric across the 100 bootstrap resampled models. Any metabolite weight with a confidence interval crossing the zero line are considered non-significant to the latent variable (or neuron).

The variable contribution metrics for each model type was compared and contrasted through visual inspection of a scatter plots of B_PLSvs. CWA_ANN and of VIP_PLS vs.Garson_ANN scores, and by calculating the associated Pearson’s correlation coefficient.

3 Results

3.1 Datasets

In this study, a previously published dataset by Chan et al. (2016) was used to illustrate the standardised PLS workflow and the proposed equivalent ANN workflow. This urine nuclear magnetic resonance (NMR) dataset, comprised of 149 metabolites, is publicly available on Metabolomics Workbench (Study ID: ST0001047). For the work described herein a binary classification was performed: gastric cancer (n = 43) vs. healthy controls (n = 40).

The computational libraries developed for this study require data to be converted to a standardised format using the tidy data framework (Wickham, 2014). This standardised format has been previously described (Mendez et al. 2019b, 2019c), and allows for the efficient reuse of these workflows for other studies. To demonstrate this, we include the application of the identical workflows and visualisation techniques to a second previously published dataset (Ganna et al. 2016) as a supplementary document. This plasma liquid chromatography-mass spectrometry (LC–MS) dataset, comprised of 189 named metabolites, is publicly available on MetaboLights (Study ID: MTBLS90), and for this study, samples were split into two classes by sex: males (n = 485) and females (n = 483). This dataset did not report QC measurements and therefore the data cleaning step was unable to be performed.

Following data cleaning, for the urine NMR gastric cancer data set 52 metabolites were included in data modelling (case = 43 vs. control = 40). Figures 3, 4, 5 and 6 (and Supplementary Figs. S1-2) show the optimisation, visualisation, evaluation and statistical inference for the PLS-DA compared to the ANN algorithms. Similar plots are provided in supplementary documentation for the plasma LC–MS data set (males = 485 vs. females = 483). All 4 workflows are also available as interactive Jupyter notebooks (https://cimcb.github.io/MetabProjectionViz/), either to be downloaded or to be run in the cloud through mybinder.org. See Mendez et al. (2019b) for guidance.

3.2 Model optimisation

Using the = $\left| {R^{2} - Q^{2} } \right| vs. Q^{2}$ plot, both the number of latent variables (LV = 2; Fig. 3b) and ANN hyperparameters (learning rate = 0.03 & hidden neurons = 2; Fig. 3d) were clearly interpretable. These findings were verified using permutation testing (Supplementary Fig. 1).

3.3 Model evaluation and visualisation

Strategies for model evaluation and visualisation were successfully transferred from PLS-DA to ANNs. For both example data sets the ANN model performed slightly better than the PLS-DA for both the training and test data sets (Fig. 4). Both models somewhat overtrained despite rigorous cross-validation. For the PLS-DA model the AUC_Train = 0.97 and the AUC_Test = 0.89. For the ANN model the AUC_Train = 1.00 and AUC_Test = 0.90. Bootstrap remodelling also showed similar results. The PLS-DA model had an in-bag area under the ROC curve (AUC) with 95% CI of 0.92–0.99. Similarly, the ANN produced an in-bag AUC with 95% CI of 0.95–0.99. The out-of-bag predictions showed that both models overtrained with out-of-bag AUC 95% CI of 0.72–0.98 (PLS-DA) and 0.77–1.00 (ANN). The bootstrap projections confirmed these findings and illustrated that the models were still able to project significant mean differences between classes, for both the in-bag and out-bag projections (Fig. 5).

3.4 Model inference

Feature contribution was determined by calculating bootstrap confidence intervals for the model coefficients B_PLS (or equivalent CWA_ANN) and of the VIP_PLS (or equivalent Garson_ANN). Across the two models, B_PLS and CWA_ANN showed a high degree of correlation (Fig. 6a; Pearson’s r = 0.85, p = 2.8 × 10⁻¹⁵). Twenty-three metabolites significantly contributed to the PLS-DA model and 25 metabolites significantly contributed to the ANN model, with an overlap of 17 metabolites being significant in both models (Fig. 6a). The VIP_PLS and Garson_ANN values showed a reduced, but still significant, degree of correlation with each other (Fig. 6b; Pearson’s r = 0.75, p = 1.33 × 10⁻¹⁰). Based on median values alone (Fig. 6b), 12 metabolites were deemed as “important” across both models and an additional 12 metabolites were “important” in one, but not both models. When taking into consideration bootstrapped confidence intervals (Fig. 6d) VIP_PLS and Garson_ANN yielded 7 and 8 “important” metabolites, respectively. Six metabolites deemed “important” by Garson_ANN were also deemed important by VIP_PLS. Although mathematical calculations for variable contribution were different for the two models, Fig. 6 shows that the overall visualisation strategy was transferrable.

4 Discussion

The migration of the PLS-DA optimisation, evaluation, and interpretation workflow to a single hidden layer ANN was successful. The strategy for visualising hyperparameter optimisation was adapted to the $\left| {R^{2} - Q^{2} } \right| vs. Q^{2}$ plot (Fig. 3c–d) and readily employable to both model types. Not only did it allow for simultaneous interpretation of 2 hyperparameters (ANNs), but it provides an alternate interpretation strategy for PLS-DA optimisation if the standard R² and Q² vs hyperparameter value plot is ambiguous. Model evaluation and projection (scores) plots were directly transferrable from PLS-DA to ANNs. Projecting the neuron weights (in place of latent variables) before the transfer function allows for a comparative and clear visual disruption of sample similarity. The bootstrap resampling/remodelling enabled both the PLS-DA and ANN models’ predictions to be interpreted with statistical rigor. Both models had similar performance, but as described (and expected) in the bootstrap projections (Fig. 5) and loadings (Supplementary Fig. S2).

CWA and Garson provided suitable variable contribution metrics for the ANN model. The surprising similarity between B_PLS and CWA_ANN, and VIP_PLS and Garson_ANN indicates the validity of both CWA_ANN and Garson_ANN as methods of determining feature importance. These findings are validated by the second study (supplementary documentation). It is important to note that no one ML method will be superior for identifying the most biological plausible metabolites. The high level of overlap between comparable variable contribution methods, in these results, suggest that deviations are likely random false discoveries due to lack of power (as reflected in the 95% CIs are how close they are to the zero line). As the cut-off for both VIP and Garson_ANN are not statistically justified limits (Tran et al. 2014), we recommend opting for B_PLS for PLS and CWA_ANNfor ANN, and using the 95% CI from bootstrap resampling to determine statistically significant metabolites.

As a side note, it is worth discussing two additional points. First, there is an advantage of using bootstrap resampled predictions and projections once the optimal hyperparameters are fixed. This is particularly important if the sample size is small and there may be large differences in results depending on how the samples are split into training and test sets. The out-of-bag predictions provide an unbiased estimate of model performance, and the averaged out-of-bag projections a more realistic estimate of generalised class-based cluster similarity. Bootstrapping can also aid in preventing false discoveries regarding metabolite significance, as the resulting 95% CIs will identify metabolites with unstable contributions to the model. Second, model outcomes and resulting interpretations can affected by the quality of the input data. We have previously shown that PLS and ANNs show similar predictive ability, when using the same input data, and that sample size is an important determinant of model stability (Mendez et al. 2019c). However, to our knowledge, an extensive comparison of different data cleaning (Broadhurst et al. 2018), pre-treatment (van den Berg et al. 2006), and imputation (Di Guida et al. 2016; Do et al. 2018) procedure options has not been performed for ANNs. As such, individual users should consider and test these effects prior to modelling their own data.

5 Conclusion and future perspectives

We have shown that for binary discrimination using metabolomics data it is possible to migrate the workflow from PLS-DA to a single hidden layer non-linear ANN. For the two presented examples the ANN does not perform any better than PLS-DA, and based on coefficient plots there is very similar feature contribution. However, these results show that ANNs can be evaluated alongside PLS-DA for any data set (using the provided Jupyter notebooks it is possible to evaluate any binary classification data set provided it is formatted appropriately before uploading). If a highly non-linear relation should arise, then ANN may be a better approach to PLS. This remains to be proven.

More importantly these results open the door to investigating more complex models. As discussed previously (Mendez et al. 2019a), an area of increasing interest to the metabolomics community is multi-block data integration (e.g. multi-omic or multi-instrument). Currently, methods employed are based on hierarchical application of multiple linear projection models. For example, OnPLS (Löfstedt and Trygg, 2011; Reinke et al. 2018) is a combinatorial amalgamation of multiple PLS models, and Mixomics (Rohart et al. 2017) is a stepwise integration of canonical correlation analysis and sparse PLS. The inherent flexibility of ANN architecture allows complex relationships to be combined into a single model. It may be possible to build an ANN to combine multiple data blocks into a single model without resorting to over-simplified data concatenation. For these types of models to be useful will be necessary to incorporate feature importance, and interpretable visualisation strategies. The work presented here is a first step to applying statistical rigor and interpretability to more complex ANN models.

Data availability

The metabolomics and metadata used in this paper were retrieved Metabolomics Workbench (https://www.metabolomicsworkbench.org/) Study ID: ST0001047, and from Metabolights (https://www.ebi.ac.uk/metabolights/) study identifier: MTBLS90. This data were converted from the original data format to a clean format compliant with the Tidy Data framework, this is available at the CIMCB GitHub project page: https://github.com/CIMCB/MetabProjectionViz.

Software availability

All software developed for this paper is available at the CIMCB GitHub project page: https://github.com/CIMCB.

References

Bishop, C. M. (1995). Neural networks for pattern recognition. New York, United States of America: Oxford University Press.
Google Scholar
Bokeh Development Team (2018). Bokeh: Python library for interactive visualization. https://bokeh.pydata.org/en/latest/
Breiman, L. (2001). Random forests. Machine Learning,45, 5–32.
Google Scholar
Broadhurst, D. I., & Kell, D. B. (2006). Statistical strategies for avoiding false discoveries in metabolomics and related experiments. Metabolomics,2, 171–196.
CAS Google Scholar
Broadhurst, D., Goodacre, R., Reinke, S. N., Kuligowski, J., Wilson, I. D., Lewis, M. R., et al. (2018). Guidelines and considerations for the use of system suitability and quality control samples in mass spectrometry assays applied in untargeted clinical metabolomic studies. Metabolomics,14, 72.
PubMed PubMed Central Google Scholar
Chan, A. W., Mercier, P., Schiller, D., Bailey, R., Robbins, S., Eurich, D. T., et al. (2016). (1)H-NMR urinary metabolomic profiling for diagnosis of gastric cancer. British Journal of Cancer,114, 59–62.
CAS PubMed Google Scholar
Chollet, F. (2015). Keras. https://keras.io/
de Jong, S. (1993). SIMPLS: An alternative approach to partial least squares regression. Chemometrics and Intelligent Laboratory Systems,18, 251–263.
Google Scholar
Di Guida, R., Engel, J., Allwood, J. W., Weber, R. J. M., Jones, M. R., Sommer, U., et al. (2016). Non-targeted UHPLC-MS metabolomic data processing methods: A comparative investigation of normalisation, missing value imputation, transformation and scaling. Metabolomics,12, 93.
PubMed PubMed Central Google Scholar
DiCiccio, T. J., & Efron, B. (1996). Bootstrap confidence intervals. Statistical Science,11, 189–212.
Google Scholar
Do, K. T., Wahl, S., Raffler, J., Molnos, S., Laimighofer, M., Adamski, J., et al. (2018). Characterization of missing values in untargeted MS-based metabolomics data and evaluation of missing data handling strategies. Metabolomics,14, 128.
PubMed PubMed Central Google Scholar
Dunn, W. B., Broadhurst, D. I., Atherton, H. J., Goodacre, R., & Griffin, J. L. (2011). Systems level studies of mammalian metabolomes: the roles of mass spectrometry and nuclear magnetic resonance spectroscopy. Chemical Society Reviews,40, 387–426.
CAS PubMed Google Scholar
Efron, B. (1981). Nonparametric estimates of standard error—the jackknife, the bootstrap and other methods. Biometrika,68, 589–599.
Google Scholar
Efron, B. (1987). Better bootstrap confidence intervals. Journal of the American Statistical Association,82, 171–185.
Google Scholar
Efron, B. (1988). Bootstrap confidence—intervals—good or bad. Psychological Bulletin,104, 293–296.
Google Scholar
Efron, B. (2000). The bootstrap and modern statistics. Journal of the American Statistical Association,95, 1293–1296.
Google Scholar
Eriksson, L., Byrne, T., Johansson, E., Trygg, J., & Vikström, C. (2013). Multi- and megavariate data analysis: basic principles and applications (3rd ed.). Malmö, Sweden: Umetrics Academy.
Google Scholar
Favilla, S., Durante, C., Vigni, M. L., & Cocchi, M. (2013). Assessing feature relevance in NPLS models by VIP. Chemometrics and Intelligent Laboratory Systems,129, 76–86.
CAS Google Scholar
Ganna, A., Fall, T., Salihovic, S., Lee, W., Broeckling, C. D., Kumar, J., et al. (2016). Large-scale non-targeted metabolomic profiling in three human population-based studies. Metabolomics,12, 4.
Google Scholar
Garson, G. D. (1991). Interpreting neural network connection weights. AI Expert,6, 47–51.
Google Scholar
Geladi, P., & Kowalski, B. R. (1986). Partial least-squares regression: a tutorial. Analytica Chimica Acta,185, 1–17.
CAS Google Scholar
Goodacre, R. (2003). Explanatory analysis of spectroscopic data using machine learning of simple, interpretable rules. Vibrational Spectroscopy,32, 33–45.
CAS Google Scholar
Goodacre, R., Kell, D. B., & Bianchi, G. (1992). Neural networks and olive oil. Nature,359, 594–594.
Google Scholar
Gromski, P. S., Muhamadali, H., Ellis, D. I., Xu, Y., Correa, E., Turner, M. L., et al. (2015). A tutorial review: Metabolomics and partial least squares-discriminant analysis–a marriage of convenience or a shotgun wedding. Analytica Chimica Acta,879, 10–23.
CAS PubMed Google Scholar
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The Elements of Statistical Learning (2nd ed.). New York, United States of America: Springer.
Google Scholar
Kristensen, M.R.B. and Vinter, B. (2010) Numerical Python for scalable architectures, Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model, Association for Computing Machinery, pp. 1–9.
Lindgren, F., Hansen, B., Karcher, W., Sjöström, M., & Eriksson, L. (1996). Model validation by permutation tests: Applications to variable selection. Journal of Chemometrics,10, 521–532.
CAS Google Scholar
Löfstedt, T., & Trygg, J. (2011). OnPLS—a novel multiblock method for the modelling of predictive and orthogonal variation. Journal of Chemometrics,25, 441–455.
Google Scholar
McKinney, W. (2010) Data Structures for Statistical Computing in Python. Proceedings of the 9th Python in Science Conference, 445, 51–56.
Mendez, K. M., Broadhurst, D. I., & Reinke, S. N. (2019a). The application of artificial neural networks in metabolomics: A historical perspective. Metabolomics,15, 142.
PubMed Google Scholar
Mendez, K. M., Pritchard, L., Reinke, S. N., & Broadhurst, D. I. (2019b). Toward collaborative open data science in metabolomics using Jupyter Notebooks and cloud computing. Metabolomics,15, 125.
PubMed PubMed Central Google Scholar
Mendez, K. M., Reinke, S. N., & Broadhurst, D. I. (2019c). A comparative evaluation of the generalised predictive ability of eight machine learning algorithms across ten clinical metabolomics data sets for binary classification. Metabolomics,15, 150.
PubMed PubMed Central Google Scholar
Olden, J. D., & Jackson, D. A. (2002). Illuminating the “black box”: a randomization approach for understanding variable contributions in artificial neural networks. Ecological Modelling,154, 135–150.
Google Scholar
Olden, J. D., Joy, M. K., & Death, R. G. (2004). An accurate comparison of methods for quantifying variable importance in artificial neural networks using simulated data. Ecological Modelling,178, 389–397.
Google Scholar
Pedregosa, F., Varoquaux, l., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E., (2011). Scikit-learn: machine learning in Python. The Journal of Machine Learning Research,12, 2825–2830.
Google Scholar
Reinke, S. N., Galindo-Prieto, B., Skotare, T., Broadhurst, D. I., Singhania, A., Horowitz, D., et al. (2018). OnPLS-based multi-block data integration: A multivariate approach to interrogating biological interactions in asthma. Analytical Chemistry,90, 13400–13408.
CAS PubMed PubMed Central Google Scholar
Rohart, F., Gautier, B., Singh, A., & Lê Cao, K.-A. (2017). mixOmics: An R package for ‘omics feature selection and multiple data integration. PLOS Computational Biology,13, e1005752.
PubMed PubMed Central Google Scholar
Steinwart, I., & Christmann, A. (2008). Support Vector Machines. New York, United States of America: Springer.
Google Scholar
Szymańska, E., Saccenti, E., Smilde, A. K., & Westerhuis, J. A. (2012). Double-check: Validation of diagnostic statistics for PLS-DA models in metabolomics studies. Metabolomics,8, 3–16.
PubMed Google Scholar
Theano Development Team (2016) Theano: A Python framework for fast computation of mathematical expressions. arXiv:1605.02688.
Tran, T. N., Afanador, N. L., Buydens, L. M. C., & Blanchet, L. (2014). Interpretation of variable importance in partial least squares with significance multivariate correlation (sMC). Chemometrics and Intelligent Laboratory Systems,138, 153–160.
CAS Google Scholar
van den Berg, R. A., Hoefsloot, H. C. J., Westerhuis, J. A., Smilde, A. K., & van der Werf, M. J. (2006). Centering, scaling, and transformations: improving the biological information content of metabolomics data. BMC Genomics,7, 142.
PubMed PubMed Central Google Scholar
Virtanen, P., Gommers, R., Oliphant, T., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., Walt, S., Brett, M., Wilson, J., Millman, K., Mayorov, N., Nelson, A., Jones, E., Kern, R., Larson, E. and SciPy 1.0 Contributors (2019) SciPy 1.0—Fundamental algorithms for scientific computing in Python. arXiv:1907.10121.
Westerhuis, J. A., Hoefsloot, H. C. J., Smit, S., Vis, D. J., Smilde, A. K., van Velzen, E. J. J., et al. (2008). Assessment of PLSDA cross validation. Metabolomics,4, 81–89.
CAS Google Scholar
Wickham, H. (2014). Tidy data. Journal of Statistical Software,59, 1–23.
Google Scholar
Wilkins, M. F., Morris, C. W., & Boddy, L. (1994). A comparison of Radial Basis Function and backpropagation neural networks for identification of marine phytoplankton from multivariate flow cytometry data. Computer Applications in the Biosciences,10, 285–294.
CAS PubMed Google Scholar
Wold, H. (1975). Path models with latent variables: The NIPALS approach (pp. 307–357). Quantitative sociology: Elsevier.
Google Scholar
Wold, S., Johansson, E., & Cocchi, M. (1993). PLS: Partial least squares projections to latent structures, 3D QSAR in drug design: Theory. Kluwer/Escom, Dordrecht, The Netherlands: Methods and Applications.
Google Scholar
Wold, S., Sjöström, M., & Eriksson, L. (2001). PLS-regression: A basic tool of chemometrics. Chemometrics and Intelligent Laboratory Systems,58, 109–130.
CAS Google Scholar
Xia, J., Broadhurst, D. I., Wilson, M., & Wishart, D. S. (2013). Translational biomarker discovery in clinical metabolomics: An introductory tutorial. Metabolomics,9, 280–299.
CAS PubMed Google Scholar

Download references

Acknowledgements

This work was partly funded through an Australian Research Council funded LIEF grant (LE170100021).

Author information

Authors and Affiliations

Centre for Integrative Metabolomics & Computational Biology, School of Science, Edith Cowan University, Joondalup, 6027, Australia
Kevin M. Mendez, David I. Broadhurst & Stacey N. Reinke

Authors

Kevin M. Mendez
View author publications
You can also search for this author in PubMed Google Scholar
David I. Broadhurst
View author publications
You can also search for this author in PubMed Google Scholar
Stacey N. Reinke
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors conceived of the idea. KMM and DIB developed the software. KMM wrote the manuscript. DIB and SNR edited the manuscript.

Corresponding authors

Correspondence to David I. Broadhurst or Stacey N. Reinke.

Ethics declarations

Conflicts of interest

The authors have no disclosures of potential conflicts of interest related to the presented work.

Human and animal rights

No research involving human or animal participants was performed in the construction of this manuscript.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 11914 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Mendez, K.M., Broadhurst, D.I. & Reinke, S.N. Migrating from partial least squares discriminant analysis to artificial neural networks: a comparison of functionally equivalent visualisation and feature contribution tools using jupyter notebooks. Metabolomics 16, 17 (2020). https://doi.org/10.1007/s11306-020-1640-0

Download citation

Received: 30 November 2019
Accepted: 13 January 2020
Published: 21 January 2020
DOI: https://doi.org/10.1007/s11306-020-1640-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Migrating from partial least squares discriminant analysis to artificial neural networks: a comparison of functionally equivalent visualisation and feature contribution tools using jupyter notebooks

Abstract

Introduction

Objectives

Methods

Results

Conclusion

Similar content being viewed by others

The application of artificial neural networks in metabolomics: a historical perspective

Informative metabolites identification by variable importance analysis based on random variable combination

Feature selection using distributions of orthogonal PLS regression vectors in spectral data

1 Introduction

2 Methods

2.1 Partial least squares discriminant analysis (PLS-DA)

2.1.1 PLS-DA optimisation

2.1.2 PLS-DA evaluation

2.1.3 PLS-DA visualisation

2.1.4 PLS-DA variable contribution

2.2 Artificial neural network (ANN)

2.2.1 ANN optimisation

2.2.2 ANN evaluation

2.2.3 ANN visualisation

2.2.4 ANN variable contribution

2.3 Computational workflow

2.3.1 Prepare data

2.3.2 Hyperparameter optimisation

2.3.3 Permutation test

2.3.4 Model evaluation using test set

2.3.5 Model evaluation using bootstrap resampling

2.3.6 Model visualisation: scores plot & weights plot

2.3.7 Variable contribution plots

3 Results

3.1 Datasets

3.2 Model optimisation

3.3 Model evaluation and visualisation

3.4 Model inference

4 Discussion

5 Conclusion and future perspectives

Data availability

Software availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflicts of interest

Human and animal rights

Additional information

Publisher's Note

Electronic supplementary material

Supplementary file1 (DOCX 11914 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation