Accurate and fast identification of minimally prepared bacteria phenotypes using Raman spectroscopy assisted by machine learning

Thomsen, Benjamin Lundquist; Christensen, Jesper B.; Rodenko, Olga; Usenov, Iskander; Grønnemose, Rasmus Birkholm; Andersen, Thomas Emil; Lassen, Mikael

doi:10.1038/s41598-022-20850-z

Accurate and fast identification of minimally prepared bacteria phenotypes using Raman spectroscopy assisted by machine learning

Article
Open access
Published: 30 September 2022

Volume 12, article number 16436, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Accurate and fast identification of minimally prepared bacteria phenotypes using Raman spectroscopy assisted by machine learning

Download PDF

Benjamin Lundquist Thomsen¹,
Jesper B. Christensen¹,
Olga Rodenko¹,
Iskander Usenov^2,3,
Rasmus Birkholm Grønnemose⁴,
Thomas Emil Andersen⁴ &
…
Mikael Lassen¹

4127 Accesses
15 Citations
4 Altmetric
Explore all metrics

Abstract

The worldwide increase of antimicrobial resistance (AMR) is a serious threat to human health. To avert the spread of AMR, fast reliable diagnostics tools that facilitate optimal antibiotic stewardship are an unmet need. In this regard, Raman spectroscopy promises rapid label- and culture-free identification and antimicrobial susceptibility testing (AST) in a single step. However, even though many Raman-based bacteria-identification and AST studies have demonstrated impressive results, some shortcomings must be addressed. To bridge the gap between proof-of-concept studies and clinical application, we have developed machine learning techniques in combination with a novel data-augmentation algorithm, for fast identification of minimally prepared bacteria phenotypes and the distinctions of methicillin-resistant (MR) from methicillin-susceptible (MS) bacteria. For this we have implemented a spectral transformer model for hyper-spectral Raman images of bacteria. We show that our model outperforms the standard convolutional neural network models on a multitude of classification problems, both in terms of accuracy and in terms of training time. We attain more than 96% classification accuracy on a dataset consisting of 15 different classes and 95.6% classification accuracy for six MR–MS bacteria species. More importantly, our results are obtained using only fast and easy-to-produce training and test data.

Rapid identification of pathogenic bacteria using Raman spectroscopy and deep learning

Article Open access 30 October 2019

A New and Fast Approach for Antimicrobial Resistance Detection: Combination of Artificial Intelligence and Surface-Enhanced Raman Spectra

Drug-resistant Staphylococcus aureus bacteria detection by combining surface-enhanced Raman spectroscopy (SERS) and deep learning techniques

Article Open access 16 September 2021

Introduction

While some health crises, such as the corona pandemic, are unforeseeable and require immediate measures, others are slow to develop, intractable in nature, but may in time become a larger threat to human health^1,2. An example of the latter is antimicrobial resistance (AMR)^3,4,5,6. AMR occurs when microbes, such as bacteria and fungi survive exposure to compounds that would normally inhibit their growth or kill them. This drives a process of selection, allowing strains with resilience to grow and spread. Although AMR is a naturally occurring process, it is dramatically accelerated by selective pressures such as overuse of antimicrobials^7,8,9,10,11. Conventional techniques used for identifying AMR in bacteria are disk diffusion test, epsilometer test, and microdilution, which require culturing and can take days^12,13. The long processing time of these techniques can be life threatening to the infected patient, but is also problematic, since the pathogenic bacteria might spread and infect more people. Therefore it is common practice to prescribe broad spectrum antibiotics to patients, which leads to unnecessary treatment¹⁴. Thus, the already widespread and increasing inadequacy of antimicrobial therapy, is attributed to the overuse of antimicrobials in healthcare and agriculture^5,8,15. In 2019, the World health organisation (WHO) declared AMR as “one of the 10 biggest global public health threats facing humanity” and according to a report released by the UN ad hoc Interagency Coordinating Group on Antimicrobial Resistance (IACG), if no action is taken, antimicrobial resistant pathogens could annually cause 10 million deaths by 2050².

To mitigate the potential disaster of a post antibiotic era, organisations such as WHO and IACG, are calling for the development of fast point-of-care diagnostic that will facilitate treatment with targeted antimicrobials^1,5. To achieve this many different technologies have been studied^{12,16,17,18,19}. One very promising technology is Raman spectroscopy (RS). RS is a technique based on inelastic scattering that occurs when photons collide with molecules and allows for unique signal decomposition for a wide range of molecules²⁰. Importantly, RS has the advantage of being fast, low-cost, label-free, and does not necessarily require pre-analytical cultivation. Several studies have shown that the capabilities of RS can be significantly strengthened when assisted by chemometric tools and machine learning (ML)^{19,21,22,23,24,25,26,27,28,29,30,31}. Yet, some shortcomings must be addressed before it will be a viable platform for reliable bacteria identification and point-of-care diagnostics applications. Foremost, RS is sensitive to factors such as the growth stage of the analysed cells, changes in measurement environment and inconsistency in sample preparation²³. Therefore, it is convenient to prepare samples in a way that reduce the difficulty of classification. Approaches such as preparing single bacteria or monolayer mats of bacteria are unfortunately complex and require expertise, custom equipment, and can take hours^25,32,33. Moreover, inconsistencies in sample preparation may cause changes to Raman spectra, necessitating more data for ML models to capture the breadth of variations needed to reach clinically relevant accuracies¹⁹. Furthermore, RS bacteria studies dealing with patient samples are rare and it cannot be assumed that using data from laboratory cultured samples will allow for accurate identification of genuine patient samples. Additionally, there is little to no endorsement of standards for Raman measurement parameters and sample preparation methods- and parameters^22,23. This lack, stupendously impedes consolidation of databases, slowing the aggregation of big-data that could be used for clinical applications. To reach clinically relevant accuracies using RS, these issues must be addressed and solving them all will require collective effort.

In this work, we focus on addressing the issues of simple sample preparation and changes in measurement environment³⁴. We reduce sample preparation to merely transferring the bacteria to the measurement environment (as shown in Fig. 1a), minimising the issue of sample inconsistency. This procedure comes with the additional benefit of removing sample preparation as an inhibiting parameter for data consolidation. Moreover, to alleviate the situation of limited data availability for ML model training, we have developed a novel spectral transformer (ST) ML model that is efficient after training on both small- and large RS bacteria datasets. To feed the ST with good representative training data we have developed a novel data augmentation algorithm, henceforth known as NoiseMix. We demonstrate that our ST model in conjunction with NoiseMix allows for accurate classification of both single bacteria and multilayer mats of bacteria all in one go, while importantly only relying on fast- and easy-to-produce training data acquired on thick multilayer mats of bacteria. To our knowledge, this is a completely new approach for acquiring training data and subsequently classification of bacteria using RS assisted by ML. Explicitly, we demonstrate the capabilities of our developed ST ML model and NoiseMix on a dataset consisting of 12 classes of bacteria from minimally prepared bacteria samples and 3 non-bacteria classes. We find that NoiseMix improves the average classification accuracy by 12.9% for the four different tests compared to only utilising class balancing and slope removal. Further, we demonstrate that the ST model can distinguish between antibiotic resistant- and susceptible phenotypes, i.e. MR S. epidermidis (MRSE), MS S. epidermidis (MSSE), 2 types of MR S. aureus (MRSA), and two types of MS S. aureus (MSSA). We obtain identification accuracies of 97.7$\%$ and 94.6$\%$ between MRSE-MSSE and MRSA-MSSA isolates, respectively. In addition to identifying minimally prepared samples, we perform detailed benchmark tests of the ST by comparing it with a convolutional neural network (CNN) developed in the work by Ho. et al. on multiple RS bacteria datasets²⁵. We find that our ST model significantly outperforms the CNN model in terms of computation time, which is improved by one order of magnitude, and that it generally outperforms the CNN model in terms of classification accuracy, for which we achieve an improvement of 7.5 $\%$ compared to the reference CNN model²⁵.

Results

A home-built Raman microscope is used to acquire training and validation datasets of minimally prepared bacteria samples. The schematics of our Raman microscope for acquiring Raman hyper-spectral maps is shown in Fig. 1b. The reason for using a home-built system is that it gives us the possibility to optimize the Raman microscopes signal-to-noise ratio (SNR) and tailor the system to the task of detecting bacteria. Hereby we can acquired Raman spectra using very short measurement times down to 0.1 second and also have a relatively cheap system compared to commercial Raman microscopes. For more details about the microscope and spectrometer, see the Methods section.

Fast- and simple training data acquisition

Successful classification of bacteria using RS and ML relies heavily on having a large training database to be used in the model training- and validation steps. The collection of data therefore often becomes as important as ML algorithms themselves, since over- or underrepresented data will lead to biased predictions. If RS is to be considered for fast in-situ diagnostic applications, the complexity and time cost of sample preparation must be significantly reduced^34,35,36. To explore how much we can simplify and reduce sample preparation time- and complexity, we experimented with simply transferring bacteria samples from a bacterial monoculture directly to a CaF$_2$ objective slide followed by Raman raster-scan measurements. This approach causes the depth of the bacterial samples to naturally vary from mono- to multilayer-deep mats, causing large variations in the intra-sample SNR³². Training data maps produced in this manner necessitates manual segmentation, as the maps may contain areas without bacteria (background). To avoid the need for manual segmentation, we instead produce training data exclusively from measurements of multilayer bacterial mats. However, data originating from measurements of multilayer bacterial mats has a limited SNR distribution compared to data acquired from bacterial mono- to multilayers. With the purpose of synthetically recreating the natural variances that may appear in test data, we produce training data by varying the spectroscope integration time from 0.1 to 1 seconds (10 averages for each acquisition). With this process, and an automated Raman spectroscopy setup (see Methods), we acquire several thousands of training spectra a day. Our final reference bacteria database contains in excess of 5200 raw Raman spectra for each of the 12 bacterial species and 3 non-bacteria species. All raw data is linearly pre-processed by a simple procedure (see Methods) before being used for either data augmentation, model training or model prediction.

Data augmentation using NoiseMix

With inspiration from computer vision wherein “extra” training data is often augmented for example by rotating, flipping, blurring, or adding white noise to images, we have developed a data augmentation algorithm (NoiseMix) that allows us to synthetically create additional training data and thereby enhance model generalization and performance. The NoiseMix augmentation algorithm (see Supplementary Material for technical details), works by taking fast and easy to produce Raman spectra from multilayer bacterial mats, and then mixing in data with even more “noise” of both the measurement surface/environment, and noise data from measurements in the environment. In addition to increasing the quantity of training-data examples, NoiseMix, as implemented here, brings two further advantages. Firstly, it allows a synthetic extension of the RS dataset towards the region of lower SNR distributions. In this sense, training data with an arbitrarily low SNR can in principle be realized, although the SNR is in practice kept above a certain minimum value to avoid inclusion of training examples consisting of pure noise. Remarkably, we find that the NoiseMix augmentation algorithm allows high-accuracy identification of single bacteria although the original training examples are exclusively gathered from multilayer bacterial mats. Secondly, the NoiseMix algorithm provides a means to leverage all data of class-imbalanced datasets by ensuring that all classes are represented by the same amount of data in each training epoch.

Raman identification of bacteria with the spectral transformer

Bacteria identification using RS has in recent years experienced a significant performance gain as deep-learning techniques such residual connections and CNNs have proved more capable than more classical supervised learning methods such as logistic regression and support vector machines^25,37,38. To improve even further upon this, we have developed an attention-based deep-learning model inspired by current state-of-the-art in computer vision and natural language programming. The ST model (sketched in Fig. 1c and explained in more detail in Methods) is a compact version of the standard transformer encoder³⁹, but differs by using sequence pooling to map the sequential outputs of the transformer to a singular class.

Our ST model architecture is initially parameterized by three arguments ST(-pe)(i, j, k), where i is the depth of the transformer encoder, j is the number of heads in the multi-head attention layer, k is the multilayer perceptron ratio, and the inclusion of -pe signifies an optional positional embedding. The three arguments were treated as additional hyperparameters of our model and were selected using a Tree-structured Parzen Estimator, using one training and validation split on the isolate classification task²⁵, i.e. we did not use our own RS data to fit our model architecture to the task at hand.

Our main results are summarized in Fig. 2, with a displaying the confusion matrix of the 15-class (12 bacteria and 3 non-bacteria) classification task. An overall accuracy in excess of 96$\%$ is achieved over the 12 bacteria classes using an ST-pe(1,10,3) model trained using the AdamW optimizer and applying NoiseMix. Figure 2b breaks down a comparison in accuracy between multiple different ML models, with and without applying NoiseMix, on the same 15-class classification task. We observe that augmenting training data with NoiseMix significantly improves model performance in the test phase for both the three ST models and the reference CNN model, and we find that both ST model architectures outperform the reference CNN model on our 15-class dataset.

In addition to model accuracy (given as ratio of correct bacteria classifications to total number of bacteria classifications) we also report a density metric in Fig. 2b. The density (or bacteria coverage) is defined as the ratio of bacteria classifications to the total amount of classifications made in each test. This metric is included in our case because part of our test data for some bacteria consists of data from background (see e.g. Fig. 3 below), and hence not every measurement should be affiliated with a bacteria type. Notably, the density metric is significantly increased by applying NoiseMix which is attributed to the algorithms’ ability to improve classification for low-SNR signals.

Figure 2c compares model classification performance on three different bacteria datasets (for an overview of the datasets and the applied training process, see Supplementary Material). Datasets “Bacteria ID 1”, and “Bacteria ID 2” originate from the work of Ho et. al.²⁵. For these datasets we observe only a marginal improvement, on average, by using either of the two tested ST models. The final dataset “E. coli binary” originates from our own RS database and contains Raman spectra from E. coli ATCC 25922 and E. coli ATCC 35218. For this dataset, the ST models again significantly outperforms the CNN models, suggesting that the ST architecture may perform well on a broader task of spectroscopy-based classification problems.

As a final performance benchmark, we compared the computation time of the ST model with that of the reference CNN model²⁵ (see Supplementary Material). We generally observe an approximate speed-up of one order of magnitude in favor of the developed ST model. However, it should be noted that a small amount of this speed-up may be caused by differences in model hyperparameters, such as weight decay, parameter amount, and learning rate, and that the difference therefore cannot solely be attributed to the model architectures.

Visualization and differentiation of E. coli isolates

For better understanding of the capability and performance of our developed ST model and NoiseMix we visualize the analysis by showing the Raman maps and ST prediction maps. We conduct tests both on monocultures and mixes of monocultures as seen in Fig. 3. Figure 3a,b show visual images of the test area for two monocultures of E. coli ATCC 25922 and E. coli ATCC 35218, respectively. The Raman maps are acquired with a step size of 1 $\upmu $m over an area 50$\upmu $m x 50$\upmu $m and are plotted for the ringbreathing mode vibrations of l-Phe (Raman shift 1004 cm$^{-1}$). Each Raman map consist of 2601 points and each point (Raman spectrum, 700-1600 cm$^{-1}$) is acquired from 10 averages with an integration time of 0.5 seconds, with a complete measurement time of 217 minutes. Comparing the visual images, Raman intensity maps, and the prediction maps in Fig. 3a,b, we find an excellent agreement between the different forms of visualization. From the Raman intensity contour maps depicted in Fig. 3, it is evident that the Raman intensity decreases in the demarcation zone between CaF$_2$ and bacteria. This is in part due to a thinner bacteria layer (monolayer) and in part due to the smaller laser-bacteria overlap. Without the NoiseMix method the ST prediction maps would underestimate the region with bacteria coverage and make significant more misclassifications in the demarcation zone between the CaF$_2$ and bacteria. Thus, the resulting decrease in SNR of the Raman signals has the consequence that the ML models, which is exclusively trained on multilayer bacterial mats underestimates the region covered with bacteria, and make a large number of misclassifications in the demarcation zone. However, by applying NoiseMix in the training phase, the ST model becomes extremely efficient even at detecting and identifying low concentrations of bacteria (monolayers) even though the original training data only contains measurements of multilayer bacterial mats. Which is attributed to the NoiseMix algorithms’ ability to improve classification for low-SNR Raman signals. We define a accuracy for a class as: correct/(crosses + correct), where crosses are all wrong predictions with values above >0.5 and excluding prediction of background (CaF$_2$). This gives a accuracy of 87.3% and 87.9% for Fig. 3a,b, respectively. Comparing the accuracies with the surface coverage we find that our ST classifier for this specific case is undetermined in approximately 10% of the time, where the prediction rate is lower than 0.5. The 15-class ST classifier makes primarily the misclassifications in the demarcation zone. Note that by increasing the integration time to 2 seconds, or more, this would decrease the occurrence of misclassifications, but has the consequence that the completely measurement time of one Raman map with 2601 Raman spectra would take more than 14 hours.

Figure 3c shows a random mix of E. coli ATCC 25922 and E. coli ATCC 35218 cultures. The two monoculture samples are transferred directly to the CaF$_2$ objective slide, where they are mixed and subsequently measured. From the visual image and the Raman map, no information about the mixture of E. coli ATCC 35218 and E. coli ATCC 25922 can be obtained. The only information that is deduced is that the layer is slightly thicker in the left side, which can be seen from the 10 pixels projection of the contour plot onto the x- and y-axis. However, from the ST prediction map we clearly see the mixture of the two E. coli bacteria. We find that the ST model, with NoiseMix applied in the model training phase, did not make any misclassifications and predicted only correct species, namely the E. coli, with an estimated density ratio of 48.8% of E. coli ATCC 25922 and 51.2% of E. coli ATCC 35218. The reason for this impressive classification result, where only E. coli is predicted is due to the thick layer of bacteria distribution of 4–6 $\upmu $m, Thus the Raman signal SNR is always relatively high. Further, we find an overall accuracy of 98.1% for E. coli ATCC 25922 and E. coli ATCC 35218, where the last 1.9% are undetermined data points with an equal prediction rate of 0.5, which sums up to approximately 49 points in the Raman map.

Testing for antibiotic resistance and -susceptibility

Figure 4 shows measurements and test for the differentiation of antibiotic resistant bacteria. For this proof-of-concept AST we collect Raman maps from clinical isolates of MR S. epidermidis ATCC 35984 (MRSE), MR S. aureus ATCC 252 (MRSA 252), MR S. aureus ATTCC 4951 (MRSA4951) and on MS S. epidermidis ATCC 14990 (MSSE), MS $\textit{S. aureus}$ ATTCC 4699 (MSSA 4699), and MS $\textit{S. aureus}$ ATCC 2752 (MSSA 2752). The overall model performance of the 15-class classifier on the MR-MS classification task can be seen in the confusion matrix in Fig. 2. The ST classifier also contains S. lugdunensis, S. haemolyticus and S. pettenkoferi these strains were chosen to represent biological variation, potential cross-interference for making a more difficult classification task for the ST, and to create a realistic view on the possibilities of our technique. Notably, we find that the ST distinguishes between MRSE and MSSE isolates of S. epidermidis with a prediction accuracy in excess of 99.5$\%$. Examples of predictions maps for the MRSE, MSSE, MRSA 252, and MSSA 2752 and reference bacteria can be seen in Fig. 4. In Fig. 4c,d the measurement of MRSA and MSSA is shown for two monocultures of MRSA 252 and MSSA 2752 reference bacteria, respectively. Figure 4c shows that the ST estimate the prediction rates (density surface coverage) to be 40.5$\%$ for CaF$_2$ background, 56$\%$ for MRSA 252, 0.4$\%$ for MSSA 2752 and 3.1$\%$ for E. coli ATCC 25922. Again, it is evident that in the demarcation zone between CaF$_2$ and the MRSA bacteria the misclassification rates are higher, due the decrease in SNR. For this measurement the ST does indeed make a 69 misclassifications, which can be seen from Fig. 4c, where predictions rate between 0.5 and up to 0.99 for E. coli ATCC 25922 is found. However, this could also be related to contamination of the test sample. In Fig. 4b measurements of MSSA 2752 are shown. We find that the prediction rates (surface coverage) are 41.6$\%$ for CaF$_2$ background, 55.4$\%$ for MSSA 2752 and 3$\%$ for MSSA 4699. The ST has a few misclassification, where the ST is predicting the bacteria to be MSSA 4699, as seen in Fig. 4b, again these are mostly found in the demarcation zone and is therefore related to the low SNR found here. By increasing the integration time, to 2 seconds or more, would have circumvented these misclassifications, however the since the map consisting of 2601 individual spectra, the acquisition time would take more than 14 hours. From the confusion matrix we find that the overall performance of the 15 class ST classifier has a prediction accuracy of 94.6$\%$, for the sub-matrix of the two MRSA and two MSSA isolates. If we compare our results with a binary classifier used in Ref.²⁵, where they distinguish between MRSA and MSSA with an achieving 89.1$\%$ accuracy, we find that our ST model clearly outperforms the CNN model. Note that if measurements only are conducted on thick layers of monocultures of bacteria we find that the ST has a very high accuracy. Not shown visually, but we find as an example for MSSA 2752 and MRSA 4951 accuracies of 99.7% and 99.9%, respectively. Which might not be surprising since the training validation datasets are very similar.

In addition to distinguishing between antibiotic resistant and antibiotic sensitive isolates, we also test our developed ST and NoiseMix method on single bacteria (few bacteria), as can be seen in Fig. 4a,b. The maps are acquired with 10 second integration-time, however without NoiseMix we found that the ST model could not identify any bacteria, thus demonstrating how NoiseMix improves the sensitivity of the ML models. The prediction rates (density surface coverage) for Fig. 4a are 96.8$\%$ CaF$_2$ background, 2.9$\%$ MRSE and 0.3$\%$ MSSE. The highest prediction peak for MSSE is only 0.15. Thus, the ST does not make any misclassification bewteen MRSE and MSSE or any other bacteria class. For Fig. 4b we find that the prediction rates are 93$\%$ CaF$_2$ background, 0.2$\%$ E. coli ATCC 35218, 1.3$\%$ MRSE, and 5.5$\%$ of MSSE. Again, the ST does not make any misclassification between MRSE and MSSE, since the highest prediction peak found for MSSE are 0.45. We it remarkably that our ST together with NoiseMix also allows high-accuracy identification of single bacteria although the original training examples are exclusively gathered from multilayer bacterial mats.

Identification of clinical E. coli isolates

In Fig. 3 we investigated the performance of the ST and NoiseMix on E. coli reference bacteria from the same clinical monoculture isolates. However, in order to demonstrate that our ST also potentially works for clinical patient isolates we conducted tests on three new E. coli clinical patient isolates obtained from the Department of Clinical Microbiology at Odense University Hospital. The E. coli isolates P1, P2, and P3 (shown in Fig. 5) are isolated from urine and were species identified from indole spot test (positive) and from plating on CHROMID®CPS ELITE agar plates (Biomérieux, USA). Note that the ST has never seen these Raman spectra before. Thus, the patient samples have or might have a slightly different phenotype, then the E. coli reference bacteria used for the training of the ST. We therefore would expect that the ST will return predictions for a mix of the two E. coli reference bacteria. The visual images and prediction maps for the 3 E. coli patient isolates are shown in Fig. 5. From the ST prediction maps we can estimate the overlap (prediction rates) with E. coli ATCC 25922 and E. coli ATCC 35218. We find that the average misclassification for the 3 patient samples are 1.4% and is partly due to the fact that the ST has not seen any training data for the 3 patient samples before. We again see that the misclassification is mostly found in the demarcation zone between the CaF$_2$ background and the bacteria mat and is therefore also related to the low Raman SNR. Antibiotic resistance profiles for the three clinical isolates and for the two E. coli ATCC strains were also performed using the disk diffusion test. From these data (see Supplementary Materials), it could suggest that P1 has indeed the highest similarity with E. coli ATCC 25922 in regards to antibiotic resistance profile, while P2 and P3 show a similar resistance pattern as E. coli ATCC 35218. As is evident in Fig. 5, the ST classification also prefers classifying the P1 isolate as E. coli ATCC 25922, while P2 and P3 are more often classified as being E. coli ATCC 35218, which indicates a tendency for the resistance profile of the isolates to guide the Raman measurements. However, more samples and measurements need to be conducted in order to verify this and make a conclusion. However, we can conclude that our ST indeed can distinguish within a few seconds/minutes microbial phenotypes of E. coli with an average classification accuracy of 98.6% for the three patient samples.

Discussion

For rapid identification of bacteria and to combat the spread of AMR we have conducted a proof-of-concept experiment using RS assisted by ML. We have demonstrated that RS is a promising technology for microbiology studies. For this we have developed an attention-based ML model and a novel data augumentation algorithm (NoiseMix) to obtain state-of-the-art results within bacteria identification. The ST model architecture used in this work, is inspired by the success of the visual transformer (VIT)⁴⁰ and compact convolutional transformers (CCT) for their ability to generalize well, when trained on small datasets⁴¹. In contrast to VIT’s and CCT’s, we found that when dealing with RS data, both splitting the Raman spectra into patches and implementing convolutions to induce an inductive bias, is detrimental to model performance. Furthermore, we have found that limiting model depth increases the efficacy of the model substantially, at least on problems with limited availability of data. We suspect this is due to the capacity of deep transformer models to overfit, which becomes a limiting factor, when intra-sample variance is high, as we observe for our datasets. Which also would be the case for practical implementations of RS for in-situ measurements in clinics and hospitals. We firmly believe that our method of novel data augumentation and RS assisted by our developed ST may close the gap between basic research and practical application in clinical laboratories⁴². We explicit demonstrated that our ST outperforms a state-of-the-art domain-specific residual CNN both in terms of accuracy, and computation time²⁵. The significant reduction in computation time importantly reduces both diagnosis time, and the cost of the diagnosis apparatus as the inference time of the ST is fast, even on low cost hardware. The ST models used in this work could also be applied to other spectroscopy-based classification problems such as cancer detection or mineral identification. Our Raman system assisted by the ST model distinguishes between 15 different classes with more than 96$\%$ overall classification accuracy, while the CNN has slightly lower overall classification accuracy 88.6%. Since this was a proof-of-concept our dataset only contains 15 classes, however, the database can easily be expanded to contain any number of bacteria and non-bacteria.

Comparing our method to methods which are currently being used at hospitals, namely labor and time demanding testing in laboratories, RS assisted by ML is an improvement with respect to speed, coverage, price and handling. Other technologies such as flow cytrometry, polymerase chain reaction and MALDI-TOF mass spectrometry, are also intensively studied for their potential as fast and reliable diagnostic technologies^12,16,17,18. The disadvantage of these technologies is that they require large expensive equipment, need special trained personnel and they cannot be used locally as a point-of-care diagnostic/screening tool. Importantly, mass spectrometers require cultivation, has difficulties to discriminate closely related bacterial species and to differentiate some antibiotic resistance phenotypes, such as MRSA and MSSA¹⁹. In contrast, we demonstrate that our RS assisted by the ST and NoiseMix approach enable accurate classification of different bacteria phenotypes, namely E. coli, S. Epidermidis, and S. Aureus. Importantly, our result is obtained with easy to produce Raman training data that was gathered from deep monoculture mats of bacteria. With this simple preparation approach for acquiring training data, we consistently achieve diagnosis times of less than a few minutes, if culturing is disregarded. The significance of our data collection method is paramount, as our approach facilitates easy, fast and cheap development of big datasets, which is crucial for clinical application. Consequently, it is possible to simply create training data from cultured bacteria, and then embed background and contaminant noise with NoiseMix into the fast-and easy-to-produce training data. This would allow for both fast data production, fast sample preparation, and would not need any form of filtering or culturing of the bacteria. It is therefore reasonable to assume that our approach can readily be adopted for direct diagnosis of sepsis from genuine patient samples, without any prepossessing. Assuming this, accurate diagnosis and therefore treatment with targeted antimicrobial can be achieved within few minutes.

Methods

Sample preparation

The bacteria come from bacterial isolates which were cultured overnight on agar plates and were sealed with parafilm and stored at 5 °C until sample preparation. Storage time varied, but was not found to result in spectral changes to strain or phenotype characteristics. All other sample preparation conditions were kept consistent between samples. Test samples were prepared separately from samples used for training, to ensure classification was not influenced by differences in sample preparation. To prepare samples for Raman measurement, a sample was simply transferred from a single colony directly to a sterilized CaF$_2$ Raman-grade objective slide.

Datasets and training details

Bacteria-surface + NoiseMix and Bacteria-surface: The Bacteria-surface training dataset is made up of three integration times for each class. The dataset consists of 12 classes of bacteria (E. coli ATCC 35218, E. coli ATCC 25922, methicillin-resistant S. epidermidis ATCC 35984 (MRSE), methicillin-sensitive S. epidermidis ATCC 14990 (MSSE), Micrococcus luteus, S. lugdunensis, S. haemolyticus, S. pettenkoferi, methicillin-resistant S. aureus ATCC 252, methicillin-resistant S. aureus ATTCC4951, methicillin-sensitive S. aureus ATTCC4699, methicillin-sensitive S. aureus ATCC 2752, and 3 non-bacteria classes, calcium fluoride, (CaF$_2$), agar, and polystyrene beads. The data of the bacterial classes in the Bacteria-surface training dataset were acquired by measuring over CaF$_2$ slides, which were completely covered by multilayer bacterial mats. The data of the CaF$_2$ background class in the Bacteria-surface training dataset, was acquired by measuring clean CaF$_2$ slides. The data of the agar class in the Bacteria-surface training dataset, was acquired by measuring over CaF$_2$ slides covered by a deep layer of agar. The data of the polystyrene class in the Bacteria-surface training dataset, was acquired by measuring over CaF$_2$ slides, which were completely covered by polystyrene beads. For tests using NoiseMix e.g. in Figs. 2,3,4, the CaF$_2$ and agar Bacteria-surface training data, is used as mixing inputs for the algorithm. The Bacteria-surface test dataset used in Fig. 2, consists of 12 classes of bacteria and 3 non-bacteria classes. Each class in the Bacteria-surface test dataset is represented by one measurement over a partially covered CaF$_2$ surface. The bacteria classes in the Bacteria-surface test dataset, is therefore not represented by the same number of bacteria Raman spectra. The Bacteria-surface validation dataset is produced in the same way as the Bacteria-surface test dataset but does not contain all 15 classes. The measurements shown in Figs. 3,4,5 are acquired following the same procedure used to produce the Bacteria-surface test dataset. Pre-processing of the Bacteria-surface training dataset data, consists of normalising each spectrum between 0-1. Pre-processing of data shown in Figs. 3,4,5 of the Bacteria-surface test and validation data, consists of two steps. (i) the slope of the spectra’s are removed by subtracting the linear function between the start and end values of the spectra’s, and (ii) a normalisation step in which each Raman spectra is normalised between 0 and 1. For the results shown in Figs. 3,4,5, we use 100$\%$ of data from the Bacteria-surface training dataset for training and then use the held-out Bacteria-surface validation dataset for model selection. As the validation set is produced with the same procedure as the actual test dataset it is a better indicator of model classification efficacy.

Bacteria ID 1: The models are trained on the reference dataset from Stanford²⁵, which consists of 30 bacterial and yeast isolates with 2000 spectra for each of the 30 isolates. The models were then fine-tuned on the reference fine-tuning dataset which consists of 30 bacterial and yeast isolates with 100 spectra for each of the 30 isolates²⁵. The models are subsequently tested on the reference test dataset, consisting of 30 bacterial and yeast isolates with 100 spectra for each of the 30 isolates²⁵.

Bacteria ID 2: The models were trained only on the reference fine-tuning dataset, and subsequently tested on the reference test dataset²⁵.

E. coli binary: The models were trained and tested on binary datasets consisting of E. coli ATCC 35218 and E. coli ATCC 25922. The data of the E. coli binary datasets were acquired by measuring over CaF$_2$ slides, which were covered by multilayer bacterial mats. The E. coli binary training dataset has 5180 spectra for each class, and each class is made up of two different integration times, each containing 2590 spectra. The E. coli binary test dataset has 2590 spectra for each class, and the integration times are different from those of the training set. Pre-processing for the E. coli binary datasets, consists of two steps performed automatically without user intervention: (i) a baseline-correction step using Zhangfit⁴³, and (ii) a normalisation step in which each Raman spectra is normalised between 0 and 1.

Raman microscope

The Raman microscope for acquiring Raman data is shown in Fig. 1b. The Raman microscope uses a 785-nm excitation laser (TA pro, Toptica, Germany) with 60 mW of power. The pump beam is spatial cleaned with a 1 meter long single-mode (SM) fiber (PANDA PM FC/PC to FC/APC Patch Cable) with 5.3 $\upmu $m mode-field diameter. A long-working-distance $100\times $ microscope objective (MO) (LMPLN-IR/LCPLN-IR, numerical aperture NA = 0.85) from Olympus is used both for imaging, focusing the excitation laser and collecting the backscattered light. The bacteria samples are placed on Raman graded calcium fluoride (CaF$_2$) objective slides and the position is controlled with an automated XYZ scanning stage. A dichroic mirror (DM) (high pass 750 nm, Semrock) is used to couple the visible illumination light to a charge-coupled device (CCD) for imaging. A second DM (high pass 800 nm) is used to separate the Raman signal from the pump. Additional filters (high pass, 800 nm, Semrock) and (band pass, 785 nm ± 10 nm, Semrock) are used for filtering of the 785-nm pump. A 5 m long multi-mode (MM) fiber (ø200 m, 0.39 NA, FC/PC to FC/PC Patch Cables) collects the Raman signal and directs it to the spectrometer. For acquiring Raman spectra, we use a HR320 Horiba spectrometer. All measurements were performed with a slit size of 300 $\upmu $m and the grating used has a line density of 950 L/mm. A thermoelectrically cooled charge-coupled device (CCD) is used for detection (Synapse, 1024 256 with each pixel size of 26 $\upmu $m). The CCD pixels are binned in clusters of 2x20 pixels to reduce noise and hereby increase SNR. With each acquired Raman spectra consisting of 480 points in the range from 700-1600 cm$^{-1}$, the spectral resolution of the spectrometer is approximately 10 cm$^{-1}$.

Raman maps

To control the position and change the sampling point for RS, we use an XYZ scanning stage from Applied Scientific Instrumentation (ASI). The ASI stepper motors provide precise control through the use of closed-loop DC servomotors employing high-resolution encoders for positioning and feedback. The XY stage has a range of travel of 100 mm $\times $ 100 mm and a positional accuracy of approximately 200 nm. Custom made Python software was developed for the automation of the complete Raman microscope to asynchronously control the scanning stage and Horiba spectrometer for acquiring hyperspectral Raman maps of the bacteria samples.

Spectroscopic calibration

For spectral calibration (and optimization) of the Raman microscope and and calibration of the translation stage, we use polystyrene beads ranging in size from 1–5 $\upmu $m. The polystyrene beads are comparable in size to bacteria and constitute multiple Raman peaks in the same Raman shift region as the bacteria. From the measurements and ST prediction maps we estimate that the spatial resolution of the Raman maps are $\approx $ 2 $\upmu $m $\pm 500$ nm) and for the ST prediction map are $\approx $ 3 $\upmu $m $\pm 500$ nm).

Pre-processing of data

The raw Raman spectra were initially cleaned from cosmetic spikes. Subsequently the linear function between the start and end values of each spectrum is identified and subtracted. As a final pre-processing step, the spectra were individually normalized to the range between zero and one. Notably, we have also investigated baseline-correction methods using Zhangfit [36], however we found that any kind of non-linear baseline removal was detrimental to model performance, especially when used in conjunction with NoiseMix.

NoiseMix and the data-augmentation process

To improve model performance in the test phase, we apply data augmentation in the model traning phase. The algorithm NoiseMix works by randomly selecting and subsequently mixing bacteria spectra $S_{bacteria}(\nu )$ and background spectra $S_{bg}(\nu )$. An augmented Raman spectra $S_{bacteria}^{(aug)}(\nu )$ is then given by

$$\begin{aligned} S_{bacteria}^{(aug)}(\nu ) = (1 - \alpha ) S_{bacteria}(\nu ) +\alpha S_{bg}(\nu ) \end{aligned}$$

where $\alpha $ is chosen randomly from a uniform distribution in the range $[0, \alpha _{max}]$, and $\alpha _{max} <1$ is an upper bound for the contribution of background spectra.

Spectral-transformer architecture

The ST ML model developed here is a compact version of the standard transformer encoder³⁹, but differs in that it uses sequence pooling to map the sequential outputs to a singular class The structure of the ST model can be seen in Fig. 1c. It consists of an optional positional embedding layer (ST-pe), followed by a dropout layer. The next layer is a block that sequentially contains, layer norm, multihead attention (MHA), layer norm, and then a multilayer perceptron (MLP) with a GELU nonlinearity. This is followed by layer norm, and then a sequence pooling layer. Finally, the output layer is a fully connected linear layer. Our ST architecture is parameterised by three arguments ST(i,j,k), where i is the depth of the transformer encoder, j is the number of heads in the MHA layer, and k is the multilayer perceptron ratio. Hence, in the ST(1,2,7) version, the transformer encoder has a depth of 1, the MHA layer has 2 heads, and the hidden layer dimension of the MLP is 7 times larger than the MLP input dimension. These hyperparameters, as well as all hyperparameters used for training, were selected using a Tree-structured Parzen Estimator, using one training and validation split on the isolate classification task²⁵.

Accuracy and density

As we have included non-bacteria background classes in our model, we opted to use two performance metrics: accuracy and density. Accuracy is defined in the usual sense as the ratio of correct bacteria classifications to the total number of bacteria classifications. Density on the other hand is a measure of bacteria coverage, and is given as the number of bacteria classifications to the total number of classifications.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

World Health Organization. Global action plan on antimicrobial resistance (2015).
on Antimicrobial Resistance, I. C. G. Report to the secretary-general of the united nations (2019).
Stekel, D. First report of antimicrobial resistance pre-dates penicillin. Nature 562, 1 (2018).
Article Google Scholar
O’Neill, J. Tackling drug-resistant infections globally: final report and recommendations (2016).
Murray, C. J. et al. Global burden of bacterial antimicrobial resistance in 2019: A systematic analysis. The Lancet (2022).
bank group, W. Drug-resistant infections: A threat to our economic future (2017).
Tenover, F. C. Mechanisms of antimicrobial resistance in bacteria. Am. J. Med. 119, S3–S10 (2006).
Article CAS Google Scholar
Holmes, A. H. et al. Understanding the mechanisms and drivers of antimicrobial resistance. The Lancet 387, 176–187 (2016).
Article CAS Google Scholar
Dadgostar, P. Antimicrobial resistance: Implications and costs. Infect. Drug Resist. 12, 3903 (2019).
Article CAS Google Scholar
Nathan, C. Resisting antimicrobial resistance. Nat. Rev. Microbiol. 18, 259–260 (2020).
Article CAS Google Scholar
Aslam, B. et al. Antibiotic resistance: A rundown of a global crisis. Infect. Drug Resist. 11, 1645 (2018).
Article CAS Google Scholar
Khan, Z. A., Siddiqui, M. F. & Park, S. Current and emerging methods of antibiotic susceptibility testing. Diagnostics 9, 49 (2019).
Article CAS Google Scholar
Reverter, M. et al. Aquaculture at the crossroads of global warming and antimicrobial resistance. Nat. Commun. 11, 1870 (2020).
Article ADS CAS Google Scholar
Amann, S., Neef, K. & Kohl, S. Antimicrobial resistance (amr). Eur. J. Hosp. Pharm.: Sci. Pract. 26, 175–177 (2019).
Article Google Scholar
Abushaheen, M. A. et al. Antimicrobial resistance, mechanisms and its clinical significance. Dis. Mon. 66, 100971 (2020).
Article Google Scholar
Barghouthi, S. A. A universal method for the identification of bacteria based on general PCR primers. Indian J. Microbiol. 51, 430–444 (2011).
Article CAS Google Scholar
Florio, W., Tavanti, A., Barnini, S., Ghelardi, E. & Lupetti, A. Recent advances and ongoing challenges in the diagnosis of microbial infections by Maldi–Tof mass spectrometry. Front. Microbiol. 9, 1097 (2018).
Article Google Scholar
Hou, T.-Y., Chiang-Ni, C. & Teng, S.-H. Current status of Maldi–Tof mass spectrometry in clinical microbiology. J. Food Drug Anal. 27, 404–414 (2019).
Article CAS Google Scholar
Wang, L. et al. Applications of Raman spectroscopy in bacterial infections: principles, advantages, and shortcomings. Front. Microbiol. 12, 1 (2021).
Google Scholar
Jones, R. R., Hooper, D. C., Zhang, L., Wolverson, D. & Valev, V. K. Raman techniques: fundamentals and frontiers. Nanoscale Res. Lett. 14, 1–34 (2019).
Article Google Scholar
Ralbovsky, N. M. & Lednev, I. K. Towards development of a novel universal medical diagnostic method: Raman spectroscopy and machine learning. Chem. Soc. Rev. 49, 7428–7453 (2020).
Article CAS Google Scholar
Guo, S., Popp, J. & Bocklitz, T. Chemometric analysis in raman spectroscopy from experimental design to machine learning-based modeling. Nat. Protoc. 16, 5426–5459 (2021).
Article CAS Google Scholar
Lorenz, B., Wichmann, C., Stöckel, S., Rösch, P. & Popp, J. Cultivation-free raman spectroscopic investigations of bacteria. Trends Microbiol. 25, 413–424 (2017).
Article CAS Google Scholar
Novelli-Rousseau, A. et al. Culture-free antibiotic-susceptibility determination from single-bacterium raman spectra. Sci. Rep. 8, 1–12 (2018).
Article CAS Google Scholar
Ho, C., Jean, N. & Hogan, C. Rapid identification of pathogenic bacteria using Raman spectroscopy and deep learning. Nat. Commun. 10, 4927 (2019).
Article ADS Google Scholar
Ashton, L., Lau, K., Winder, C. L. & Goodacre, R. Raman spectroscopy: Lighting up the future of microbial identification. Fut. Microbiol. 6, 991–997 (2011).
Article CAS Google Scholar
Eberhardt, K., Stiebing, C., Matthäus, C., Schmitt, M. & Popp, J. Advantages and limitations of Raman spectroscopy for molecular diagnostics: An update. Expert Rev. Mol. Diagn. 15, 773–787 (2015).
Article CAS Google Scholar
Chang, K.-W. et al. Antibiotic susceptibility test with surface-enhanced raman scattering in a microfluidic system. Anal. Chem. 91, 10988–10995 (2019).
Article CAS Google Scholar
Strola, S. A. et al. Single bacteria identification by raman spectroscopy. J. Biomed. Opt. 19, 111610 (2014).
Article ADS Google Scholar
de Siqueira e Oliveira, F. S., da Silva, A. M., Pacheco, M. T. T., Giana, H. E. & Silveira, L. Biochemical characterization of pathogenic bacterial species using raman spectroscopy and discrimination model based on selected spectral features. Las. Med. Sci.36, 289–302 (2021).
Duraipandian, S., Petersen, J. & Lassen, M. Authenticity and concentration analysis of extra virgin olive oil using spontaneous Raman spectroscopy and multivariate data analysis. Appl. Sci. 9, 2433 (2019).
Article CAS Google Scholar
García-Timermans, C. et al. Label-free Raman characterization of bacteria calls for standardized procedures. J. Microbiol. Methods 151, 69–75 (2018).
Article Google Scholar
Rousseau, A. N. et al. Fast antibiotic susceptibility testing via raman microspectrometry on single bacteria: An mrsa case study. ACS Omega 6, 16273–16279 (2021).
Article CAS Google Scholar
Kloß, S., Rösch, P., Pfister, W., Kiehntopf, M. & Popp, J. Toward culture-free Raman spectroscopic identification of pathogens in ascitic fluid. Anal. Chem. 87, 937–943 (2015).
Article Google Scholar
Franco-Duarte, R. et al. Advances in chemical and biological methods to identify microorganisms-from past to present. microorganisms (2019).
Pahlow, S. et al. Isolation and identification of bacteria by means of Raman spectroscopy. Adv. Drug Deliv. Rev. 89, 105–120 (2015).
Article CAS Google Scholar
Lussier, F., Thibault, V., Charron, B., Wallace, G. Q. & Masson, J.-F. Deep learning and artificial intelligence methods for Raman and surface-enhanced Raman scattering. TrAC, Trends Anal. Chem. 124, 115796 (2020).
Article CAS Google Scholar
Lu, W., Chen, X., Wang, L., Li, H. & Fu, Y. V. Combination of an artificial intelligence approach and laser tweezers Raman spectroscopy for microbial identification. Anal. Chem. 92, 6288–6296 (2020).
Article CAS Google Scholar
Parmar, N. et al. Image transformer 4055–4064 (2018).
Shao, R., Shi, Z., Yi, J., Chen, P.-Y. & Hsieh, C.-J. On the adversarial robustness of visual transformers. arXiv e-prints arXiv–2103 (2021).
Hassani, A. et al. Escaping the big data paradigm with compact transformers. arXiv preprint arXiv:2104.05704 (2021). https://arxiv.org/pdf/2104.05704.pdf.
Kong, K., Kendall, C., Stone, N. & Notingher, I. Raman spectroscopy for medical diagnostics-from in-vitro biofluid assays to in-vivo cancer detection. Adv. Drug Deliv. Rev. 89, 121–134 (2015).
Article CAS Google Scholar
Zhang, Z. M., Chen, S. & Liang, Y. Z. Baseline correction using adaptive iteratively reweighted penalized least squares. Analyst 135, 1138–1146 (2010).
Article ADS CAS Google Scholar

Download references

Acknowledgements

We gratefully acknowledge fruitful conversations with Poul A. Jessen from BacAlert. RBG was affiliated with University of Southern Denmark at the time of the study and is currently affiliated with the Diagnostic Center, Department of Medicine, Odense University Hospital, Svendborg, Denmark. This research was funded by the Danish Agency for Institutions and Educational Grants and the Innovation Fund Denmark (IFD) under Eurostars project Bacsens (case No. 9046-00032A).

Author information

Authors and Affiliations

Danish Fundamental Metrology, Kogle Allé 5, 2970, Hørsholm, Denmark
Benjamin Lundquist Thomsen, Jesper B. Christensen, Olga Rodenko & Mikael Lassen
Institute of Optics and Atomic Physics, Technische Universität Berlin, Straße des 17. Juni 135, 10623, Berlin, Germany
Iskander Usenov
Art photonics GmbH, Rudower Ch 46, 12489, Berlin, Germany
Iskander Usenov
Research Unit of Clinical Microbiology, University of Southern Denmark and Odense University Hospital, J.B. Winsløws Vej 21.2, 5000, Odense, Denmark
Rasmus Birkholm Grønnemose & Thomas Emil Andersen

Authors

Benjamin Lundquist Thomsen
View author publications
You can also search for this author in PubMed Google Scholar
Jesper B. Christensen
View author publications
You can also search for this author in PubMed Google Scholar
Olga Rodenko
View author publications
You can also search for this author in PubMed Google Scholar
Iskander Usenov
View author publications
You can also search for this author in PubMed Google Scholar
Rasmus Birkholm Grønnemose
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Emil Andersen
View author publications
You can also search for this author in PubMed Google Scholar
Mikael Lassen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.B.C., I.U. and M.L. designed and built the Raman microscope. R.B.G. and T.E.A. prepared the bacteria for the experiments. J.B.C, O.R., and M.L obtained the main experimental Raman results. B.L.T. designed and developed the software for machine learning analysis. The paper was written by B.L.T., J.B.C. and M.L. with contributions from all authors. M.L. conceived and supervised the research.

Corresponding author

Correspondence to Mikael Lassen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Thomsen, B.L., Christensen, J.B., Rodenko, O. et al. Accurate and fast identification of minimally prepared bacteria phenotypes using Raman spectroscopy assisted by machine learning. Sci Rep 12, 16436 (2022). https://doi.org/10.1038/s41598-022-20850-z

Download citation

Received: 26 June 2022
Accepted: 20 September 2022
Published: 30 September 2022
DOI: https://doi.org/10.1038/s41598-022-20850-z
Springer Nature Limited

This article is cited by

Optics miniaturization strategy for demanding Raman spectroscopy applications
- Oleksii Ilchenko
- Yurii Pilhun
- Anja Boisen
Nature Communications (2024)

Accurate and fast identification of minimally prepared bacteria phenotypes using Raman spectroscopy assisted by machine learning

Abstract

Similar content being viewed by others

Rapid identification of pathogenic bacteria using Raman spectroscopy and deep learning

A New and Fast Approach for Antimicrobial Resistance Detection: Combination of Artificial Intelligence and Surface-Enhanced Raman Spectra

Drug-resistant Staphylococcus aureus bacteria detection by combining surface-enhanced Raman spectroscopy (SERS) and deep learning techniques

Introduction

Results

Fast- and simple training data acquisition

Data augmentation using NoiseMix

Raman identification of bacteria with the spectral transformer

Visualization and differentiation of E. coli isolates

Testing for antibiotic resistance and -susceptibility

Identification of clinical E. coli isolates

Discussion

Methods

Sample preparation

Datasets and training details

Raman microscope

Raman maps

Spectroscopic calibration

Pre-processing of data

NoiseMix and the data-augmentation process

Spectral-transformer architecture

Accuracy and density

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Optics miniaturization strategy for demanding Raman spectroscopy applications

Search

Navigation