Using Deep Neural Networks to Reconstruct Non-uniformly Sampled NMR Spectra

Hansen, D. Flemming

doi:10.1007/s10858-019-00265-1

Using Deep Neural Networks to Reconstruct Non-uniformly Sampled NMR Spectra

Article
Open access
Published: 10 July 2019

Volume 73, pages 577–585, (2019)
Cite this article

Download PDF

You have full access to this open access article

Journal of Biomolecular NMR Aims and scope Submit manuscript

Using Deep Neural Networks to Reconstruct Non-uniformly Sampled NMR Spectra

Download PDF

D. Flemming Hansen ORCID: orcid.org/0000-0003-0891-220X¹

8160 Accesses
68 Citations
9 Altmetric
Explore all metrics

Abstract

Non-uniform and sparse sampling of multi-dimensional NMR spectra has over the last decade become an important tool to allow for fast acquisition of multi-dimensional NMR spectra with high resolution. The success of non-uniform sampling NMR hinge on both the development of algorithms to accurately reconstruct the sparsely sampled spectra and the design of sampling schedules that maximise the information contained in the sampled data. Traditionally, the reconstruction tools and algorithms have aimed at reconstructing the full spectrum and thus ‘fill out the missing points’ in the time-domain spectrum, although other techniques are based on multi-dimensional decomposition and extraction of multi-dimensional shapes. Also over the last decade, machine learning, deep neural networks, and artificial intelligence have seen new applications in an enormous range of sciences, including analysis of MRI spectra. As a proof-of-principle, it is shown here that simple deep neural networks can be trained to reconstruct sparsely sampled NMR spectra. For the reconstruction of two-dimensional NMR spectra, reconstruction using a deep neural network performs as well, if not better than, the currently and widely used techniques. It is therefore anticipated that deep neural networks provide a very valuable tool for the reconstruction of sparsely sampled NMR spectra in the future to come.

FID-Net: A versatile deep neural network architecture for NMR spectral reconstruction and virtual decoupling

Article Open access 19 April 2021

Unbiased measurements of reconstruction fidelity of sparsely sampled magnetic resonance spectra

Article Open access 27 July 2016

Pitfalls in compressed sensing reconstruction and how to avoid them

Article Open access 11 November 2016

Find the latest articles, discoveries, and news in related topics.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Reconstruction of non-uniformly sampled (NUS) NMR spectra has become a very important tool for obtaining ultra-high dimensional NMR spectra with high resolution and in a short time. For example, being able to accurately reconstruct NUS NMR spectra allows for high-resolution four-dimensional methyl–methyl NOESY spectra for chemical shift assignment and characterisations of large proteins (Tugarinov et al. 2005; Vuister et al. 1993; Hyberts et al. 2012), five-dimensional spectral of intrinsically disordered proteins for chemical shift assignments (Krähenbühl et al. 2012; Kosiński et al. 2017), and fast characterisation of macromolecular dynamics (Linnet and Teilum 2016), amongst others. Various algorithms have been developed to reconstruct the full dataset from the sparsely sampled data (Hyberts et al. 2012; Ying et al. 2017; Coggins et al. 2012; Orekhov and Jaravine 2011; Balsgart and Vosegaard 2012; Holland et al. 2011; Kazimierczuk and Orekhov 2011), or otherwise extract NMR parameters from the dataset (Eghbalnia et al. 2005; Murrali et al. 2018; Dutta et al. 2015; Pustovalova et al. 2018). Moreover, it has become clear that it is not only the algorithm used to process the sparse data that determines the accuracy with which information can be obtained from NUS NMR data, since the sampling schedule used also has a substantial impact on the final outcome (Hyberts et al. 2012).

The development and application of deep neural networks (DNN) have seen an impressive growth in the last decade with many and highly different applications in all areas of science and technology, including spectroscopy. Traditional applications of DNN include image processing and speech recognition, whereas applications involving the analysis of EPR DEER spectra (Worswick et al. 2018) and reconstruction of MRI (Han and Ye 2018; Hyun et al. 2018) datasets are more recent. Briefly, a supervised DNN contains several layers that each transforms an input tensor into an output vector or tensor. For example, a layer can be linear with activation functions, that is, initially the input tensor is multiplied by a parameter-tensor followed by an elementwise mathematical operation, such as tanh(x). Training the neural network consist of determining optimal parameters or parameter-tensors and training is therefore a classical minimisation/optimisation problem. Although training a DNN can involve the optimisation of millions of parameters, highly efficient optimisers based on stochastic gradients (Kingma and Ba 2014) have been developed and optimised neural networks now often outperform traditional tools used for analyses.

Reconstruction of NUS NMR data is well suited for supervised DNN, because of the simple architecture of the problem. The input data consists of the sparsely sampled data matrix and a sampling schedule, whereas the output data can consist of the fully sampled NMR spectrum in either the time domain or frequency domain. More importantly, a large training database can easily be generated since the general form of NMR spectra is well known.

It is shown below that a simple DNN based on long short-term memory (LSTM) networks can be trained to reconstruct sparsely sampled one-dimensional NMR spectra. Once the network is trained, the time required for reconstruction is similar to reconstruction times for traditional methods, such as iterative-soft-thresholding (IST) (Hyberts et al. 2012; Holland et al. 2011; Kazimierczuk and Orekhov 2011) and sparse multidimensional iterative lineshape-enhanced (SMILE) reconstruction (Ying et al. 2017). The network developed below is cross-validated using an experimental two-dimensional ¹⁵N–¹H NMR spectrum of the 18 kDa T4 Lysozyme recorded with a large sweep width, so that also arginine ¹⁵N_ε–¹H_ε signals and are observed. In all cases, the reconstruction with DNN yields reconstructed spectra with small RMSDs to the ‘true’ spectra and more accurate intensities of the resulting peaks in the frequency-domain spectrum compared to traditional algorithms. It is envisaged that with faster computer hardware, reconstruction and analysis of high-dimensional NMR spectra will be substantially improved in the future using deep learning and artificial intelligence.

Methods

Training the Deep Neural Network

The DNNs were trained on a standard desktop computer (Intel Core I7-6900 K, 3.2 GHz, 64 GB RAM), equipped with an NVIDIA GeForce GTX 1080 TI GPU graphics card. The tensorflow (Abadi et al. 2015) python package with the keras frontend was used to generate the network graphs and the optimisation was performed within tensorflow using the stochastic ADAM (Kingma and Ba 2014) optimiser with standard parameters and a learning rate of 0.00012. The python nmrglue (Helmus and Jaroniec 2013) package was used to read and write NMR spectra in various formats.

Synthetic one-dimensional FIDs were generated using an in-house written c++ programme and these FIDs were stored in a binary 2D nmrPipe (Delaglio et al. 1995) format. The number of peaks in the synthetic FIDs was randomly chosen between 1 and (3/8)sp, where sp is the number of sampled complex points in the sampling schedule. Theoretically, two complex numbers or equivalently four real numbers, {intensity, phase, transverse relaxation rate, frequency}, are required to define each signal in the one-dimensional spectrum. The ratio of 3/8, which is slightly smaller than 1/2, was chosen to avoid over-fitting. Thus, for sampling schedules with 32 sampled points a maximum of 12 signals were generated and for sampling schedules with 48 sampled points a maximum of 18 signals were generated. For each signal in each of the synthetic FIDs, an intensity, I, a phase, φ, a transverse relaxation rate, R₂, and a frequency, ν was randomly chosen. The intensities were randomly chosen between 0 and 1, the R₂ randomly chosen between 3 s⁻¹ and 100 s⁻¹, and the frequency randomly chosen over the entire sweep-width. Ideally, the phase, φ, should be predictable in NMR spectra. However, minor mis-calibrations and off-resonance effects can lead to small deviations from perfectly phased spectra. Therefore, for each signal in the synthetic FIDs, the phase was randomly chosen between − 5^o and + 5^o. In each run of optimisation, a series of 8 × 10⁶ synthetic FIDs were generated (the maximum amount that could be stored in the memory during training). Prior to optimisation, the synthetic FIDs were normalised by the absolute value of the first point.

During each optimisation, 10% of the training data was not included in the optimisation but purely used for cross-validation. For each sampling schedule, the network was trained in runs consisting of 20 epochs until the average mean-squared error for the cross-validation set was less than 5 × 10⁻⁴ (1.5 × 10⁻⁴) and the mean-absolute error less than approximately 0.013 (0.008) for 12.5% (18.75%) sampled data. Typically 25–50 runs were needed, which took between 30 and 40 h depending on the specific sampling schedule used.

NMR Spectroscopy

A two-dimensional ¹⁵N–¹H HSQC correlation spectra was recorded on a uniformly ¹³C, ¹⁵N isotope labelled sample of the L99A mutant of T4 Lysozyme (T4L L99A) that was prepared as described previously (Bouvignies et al. 2011; Werbeck et al. 2013). The NMR spectra was recorded at 25 °C on a Bruker Avance III NMR spectrometer with a ¹H operating frequency of 700 MHz and equipped with helium-cooled TCI inverse cryoprobe. The fully sampled spectrum was acquired as a 1024 × 256 complex matrix with spectral widths of 12 kHz (¹H) and 5.1 kHz (¹⁵N). An adiabatic ¹³C pulse was applied during the ¹⁵N chemical shift evolution, t₁, for refocussing of ¹⁵N–¹³C^α and ¹⁵N–¹³CO scalar couplings, and suppression of the H₂O resonance was achieved using a flip-back pulse (Andersson et al. 1998) immediately after the first INEPT element. Four scans were collected for each t₁ increment with a recycle delay of 1 s resulting in a total experiment time of 34 min.

Reconstruction of Experimental NMR Spectra

Reconstruction with the iterative-soft-thresholding (IST) algorithm was carried out with the istHMS programme (Hyberts et al. 2012) using standard parameters, except that the number of iterations was increased to 1000. The standard parameters were: initial level multiplier, i_mult = 0.98, end level multiplier, e_mult = 0.98, and correction of first point, xc = 0.5. For reconstruction with the SMILE (Ying et al. 2017) algorithm, the function within nmrPipe (Version 2.0 beta Rev 2018.094.15.20 64-bit) was used with standard parameters, except that the noise factor for the signal cutoff (nSigma) was decreased to 3.0 from the default value of 5. Running the SMILE algorithm with the default nSigma of 5.0 resulted in poor reconstruction in our hands. The standard parameters used for reconstruction with SMILE were: xP0 = 0.0, xP1 = 0.0, zfCount = 2, xApod = SP, xQ1 = 0.50, xQ2 = 0.98, xQ3 = 1.0, minTDL = 0.25, maxTDL = 4.0, thresh = 0.80, fraction = 1.0. Prior to analysis, the spectra reconstructed with the SMILE algorithm were divided by the downscaling factor provided in the logfile, so that intensities were comparable with the fully sampled spectrum.

Reconstruction with the DNN algorithm was performed using a python script. First the network graph and the parameters were loaded from the output of the training (see above). Subsequently the reconstructed 1D spectra, one for each ¹H frequency point, were generated using the predict function within tensorflow. Finally the reconstructed spectrum was saved in nmrPipe format using functions within the nmrglue package.

All the reconstructed spectra were Fourier transformed along the ¹⁵N dimension using the same window function, which was a square-sine window that was shifted by 0.42 π.

Data Analysis

All NMR spectra were processed using nmrPipe (Delaglio et al. 1995) and subsequently analysed using NMRFAM-Sparky (Lee et al. 2015) or in-house written python scripts that utilise functions within the nmrglue, numpy and matplotlib packages. Peak heights were determined using NMRFAM-Sparky and the “Center peak; pc” interpolation function.

Results and Discussion

A DNN Architecture for NUS Reconstruction

Nuclear magnetic resonance spectra have traditionally been recorded on multi-dimensional regular Nyquist grids, with data recorded at equally spaced time points in each of the dimensions. A multi-dimensional discrete Fourier transform is subsequently used to generate the spectrum in the frequency domain for the identification of signals, their positions, intensities, line-widths and phases. However, as the requirement has increased for higher and higher dimensional NMR spectra, the concept of non-uniform, non-linear, and sparsely-sampled data has been improved substantially since some of its initial applications (Schmieder et al. 1993). Sparse sampling of NMR spectra is possible because the information contained in the spectrum is often far less than the actual number of data points sampled on the full Nyquist grid. For example, for a one-dimensional NMR spectrum each peak is fully characterised by four parameters, the frequency ν, the linewidth Δν = R₂/π, the intentity I, and the phase φ. Therefore, and in theory, for a one-dimensional spectrum with n Lorentzian shaped peaks it is sufficient to sample 4n real time-points or 2n complex time-points. As the number of dimensions increases the sparseness of NMR spectra typically increases, that is, a smaller fraction of the full Nyquist grid is required to fully characterise all the cross-peaks present. Whereas the linear Nyquist grid is easily processed using the discrete Fourier transform, a linear transformation, processing of sparsely sampled NMR spectra is traditionally more demanding. The major challenge has been to extract the spectral parameters from sparse data because the search-space quickly becomes very large and simple minimisation procedures fail due to the highly non-linear nature of oscillating NMR signals over the non-regular time-domain data matrix.

The architecture of the DNN developed here to reconstruct sparsely sampled NMR spectra is shown in Fig. 1. This architecture is designed with inspiration from LSTM networks (Hochreiter and Schmidhuber 1997) that have traditionally been used to analyse time series, for example in finance, handwriting, and for speech-recognition (Chen and Wang 2017; Graves et al. 2009). In a very recent preprint (Qu et al. 2019) an entirely different DNN architecture, based on densely connected convolution neural networks (CNN), has been suggested. Whereas the architecture in Fig. 1 has its roots in the analysis of time series, dense CNN networks are often used for image processing, noise reduction, and removal of artefacts.

For the presented architecture in Fig. 1, the sparsely sampled FID is represented as a 2×sp matrix, where sp is the number of sampled complex points with one row for each of the real and imaginary data. Initially the sparsely sampled FID and the sampling schedule are transformed into two vectors with length 2np, where np is the number of complex points in the fully reconstructed one-dimensional spectrum. This transformation is carried out using linear layers with tanh(x) activation and bias. Specifically, for a linear layer with the activation function a(x) and bias b, the output vector y is calculated from the input vector x, as

$$\varvec{y} = \left\{ {y_{1} ,y_{2} , \ldots ,y_{n} } \right\} = \left\{ {a\left( {z_{1} } \right),a\left( {z_{2} } \right), \ldots ,a\left( {z_{n} } \right)} \right\}$$

(1)

where

$${\mathbf{z}} = {\mathbf{Ax}} + {\mathbf{b}}$$

(2)

and A is the parameter-tensor (kernel). Optimisation of the layer consist of optimising the parameters of the bias b and the parameter-tensor A (see Supporting Material).

Training the Deep Neural Network

The DNN was trained separately for each sampling schedule on synthetic data. In each run 8 × 10⁶ fully sampled one-dimensional spectra were generated randomly (see “Methods” section). The input spectra ‘Sparse time-domain’ in Fig. 1a, used for training the DNN were calculated from the fully sampled synthetic spectrum by extracting the points corresponding to the sampling schedule. The cost-function used to optimise the parameters of the DNN was calculated as the average mean-square-derivation between the reconstructed spectra and the fully sampled synthetic spectra. For 12.5% sampled spectra (32/256) the optimisation lead to average mean-squared errors of the cross-validation set of 5 × 10⁻⁴, showing that the highly non-linear behaviour of the NMR time-domain spectra is well-captured by the deep neural network and the architecture in Fig. 1. Specifically, there is a clear indication that the DNN has indeed ‘learned’ the task of reconstructing the spectra rather than ‘memorising’ and interpolating the training set. For example for 12 peaks, and only considering peak positions, there are 256¹² = 8 × 10²⁸ possible spectra with a resolution of SW/np. Additionally, there is differential peak intensities and differential line-widths of the 12 peaks as well as the possibility that less than 12 peaks are present. Thus, the training set far from span the full set of possible spectra, although up to 5 × 10⁸ spectra in total has been used for training,

Application to an Experimental NMR Spectrum

A two-dimensional ¹⁵N–¹H HSQC correlation spectrum of the L99A mutant of the 164-residue protein T4 Lysozyme (T4L L99A) was used to evaluate the performance of the DNN algorithm and to compared the performance of the DNN algorithm with currently leading algorithms for reconstruction of sparsely sampled NMR spectra. A spectrum with a large ¹⁵N sweep-width (72 ppm) was recorded with 256 complex points in the ¹⁵N dimension, such that both the backbone ¹⁵N–¹H correlations as well as the side-chain ¹⁵N–¹H correlations of arginine and ¹⁵N–¹H₂ correlations of asparagine and glutamine are observed, Fig. 2a. Also, the T4L L99A protein is in chemical exchange (Mulder et al. 2001), which renders some of the ¹⁵N–¹H correlations broadened and thereby leading to an even larger range of spectral parameters present in the spectrum.

The experimental ¹⁵N–¹H HSQC spectrum was first Fourier transformed in the directly detected ¹H dimension. Subsequently, for each point in the ¹H frequency dimension, f_H, the resulting fully sampled ¹⁵N time-domain spectrum, t_N(f_H) was extracted and the sparsely sampled ¹⁵N time-domain spectrum, ${\mathbf{s}}_{\text{N}} ({\mathbf{f}}_{\text{H}} )$, was obtained by extracting the points corresponding to the sampling schedule. The reconstructed time-domain spectra, ${\tilde{\mathbf{t}}}_{\text{N}} ({\mathbf{f}}_{\text{H}} )$, were obtained using three different algorithms (1) the DNN algorithm presented here, (2) the SMILE algorithm (Ying et al. 2017), and (3) the hmsIST algorithm (Hyberts et al. 2012). Finally the reconstructed spectra, ${\tilde{\mathbf{t}}}_{\text{N}} ({\mathbf{f}}_{\text{H}} )$, obtained for each of the three algorithms as well as the fully sampled spectrum, t_N(f_H), were Fourier transformed along the ¹⁵N dimension to generate two-dimensional frequency-domain spectra.

A sparsely sampled spectrum was generated using a 12.5% (32 out of 256 complex points) Poisson-gap sampling schedule (Hyberts et al. 2012). Excerpts of the reconstructed spectra obtained using the three different algorithms, DNN, SMILE and IST, are compared with the fully sampled spectrum in Fig. 2b and Figure S1. Overall, all three algorithms provide a good reconstruction with most of the cross-peaks reconstructed with intensities that are similar to the fully sampled spectrum. The spectrum reconstructed using the SMILE algorithm had artefacts for ¹H frequencies around 7.3 ppm and some cross-peaks were missing in the reconstructed spectrum.

To provide a more quantitative comparison, slices along the ¹⁵N frequency-domain were taken out of the reconstructed spectra and compared to the corresponding slices of the fully sampled spectrum; green vertical lines in Fig. 2a. The first ¹H frequency, ¹H of 9.6 ppm, for which a ¹⁵N slice is shown in Fig. 3 was chosen because it intuitively should be easy to reconstruct due to only two very isolated and sharp peaks being present. The second slice, ¹H of 7.4 ppm, should intuitively be more difficult to reconstruct since it contains many peaks with differential line widths. A good reconstruction is obtained for the slice at ¹H of 9.6 ppm, Fig. 3a, c, although reconstruction with the IST algorithm leads to visible artefacts when a random sampling scheme is used. For the more challenging slice, ¹H of 7.4 ppm, all three algorithms lead to similar RMSDs between the reconstructed and the fully sampled spectrum, Fig. 3b, when a Poisson-gap sampling is used. For the random sampling schedule, Fig. 3d, the DNN algorithm leads to significantly better reconstructions than both IST and SMILE. The fact that the DNN algorithm leads to a significantly better reconstruction for very sparse and random samples is already apparent from a simple visualisation of the reconstructed 2D spectra (Figure S2), where artefacts and extra peaks are observed in the spectra reconstructed with IST in particular.

Subsequently, the ability of the different reconstruction algorithms to reproduce peak intensities was quantified. The discrete Fourier transform traditionally used to transform fully sampled spectra is a linear operator as well as an isomorphic transformation. Intensities are therefore represented well in the frequency-domain spectrum. Since the sparse data are sampled on a non-uniform grid, the reconstruction algorithms are inherently non-linear and a quantification of how well peak intensities are reconstructed is therefore important. For a set of 134 isolated peaks, Figure S3, the peak heights were obtained by interpolation (see “Methods” section). Figure 4 shows a comparison of normalised peak intensities obtained from the fully sampled spectrum versus intensities obtained from spectra reconstructed with the three algorithms from a 12.5% Poisson-gap sampling. In this dataset the DNN algorithm had a good overall reproduction of peak intensities with a normalised RMSD of just over 1% and a Pearson correlation between peak intensities measured in the spectrum reconstructed with DNN and the fully sampled spectrum of R² = 0.996. This is similar to the Pearson correlation estimated using the densely connected convolution network (Qu et al. 2019), R² = 0.992. The SMILE algorithm generally had a good reproduction of peak intensities as well, however, with a handful of peaks rather poorly reconstructed, leading to an overall RMSD of about 3%. The IST algorithm also had a good reproduction of peak intensities, however, with substantial better reconstruction of the more intense peaks and slightly worse reconstruction of weaker peaks, which lead to an overall RMSD of about 2%.

Three types of sampling schedules were evaluated, (1) a 12.5% (32/256) random sampling, (2) a 12.5% Poisson-gap sampling, and (3) a 18.75% (48/256) Poisson-Gap sampling. Three individual sampling schedules were randomly generated for each type leading to a total of nine sampling schedules, Table S1. For each of the three types of sampling schedules the two metrics described above in Figs. 3 and 4 were used to quantify the overall quality of the reconstruction. Firstly, the RMSD between the reconstructed spectra and the fully sampled spectrum was calculated for each of the ¹H frequency points, as in Fig. 3, and the average RMSD for ¹H frequencies between 6.5 and 9.6 ppm; 〈RMSD〉_spec reported. Secondly, the normalised peak intensities from reconstructed spectra were compared to those obtained from the fully sampled spectrum, as in Fig. 4.

From the summary in Fig. 5 it is apparent that the DNN algorithm generally leads to better reconstructions for the T4L L99A ¹⁵N–¹H spectrum, both in terms of RMSD between the reconstructed spectrum and the fully sampled spectrum as well as reproducing peak intensities. This is particularly the case for the more sparse data (12.5%) and for random sampling schedules. Reconstruction using the IST algorithm improves substantially when a Poisson-gap schedule is used as also pointed out previously (Hyberts et al. 2012).

Conclusion

Reconstruction of sparse and non-uniformly sampled NMR spectra is increasingly becoming more important as the demand for fast acquisition and ultra-high-dimensional spectra increases. A strategy to reconstruct sparsely sampled NMR spectra using deep neural networks was presented. The proposed strategy employs a new network architecture that is based on LSTM layers, which are frequently used in the analysis of time series. Optimisation of the neural network on a standard desktop computer allowed for excellent reconstruction of sparsely sampled one-dimensional experimental NMR spectra at a level that was as good as, or slightly better than, current algorithms. The time required for reconstruction with the presented neural network is similar to reconstruction times for traditional methods (Hyberts et al. 2012; Ying et al. 2017), albeit longer than an alternative strategy presented very recently (Qu et al. 2019). It is important to stress that in this study deep neural networks were used to reconstruct only one-dimensional spectra, however, the presented strategy shows an avenue for employing deep neural networks to more generally analyse and reconstruct sparsely sampled spectra. It is envisaged that with further explorations of deep network architectures and optimisations, accurate reconstructions of high-dimensional NMR spectra will become possible using deep learning and artificial intelligence.

References

Abadi M et al (2015) TensorFlow: large-scale machine learning on heterogeneous systems. Software available from https://www.tensorflow.org/
Andersson P, Gsell B, Wipf B, Senn H, Otting G (1998) HMQC and HSQC experiments with water flip-back optimized for large proteins. J Biomol NMR 11:279–288
Article Google Scholar
Balsgart NM, Vosegaard T (2012) Fast forward maximum entropy reconstruction of sparsely sampled data. J Magn Reson 223:164–169
Article ADS Google Scholar
Bouvignies G et al (2011) Solution structure of a minor and transiently formed state of a T4 lysozyme mutant. Nature 477:111–117
Article ADS Google Scholar
Chen J, Wang D (2017) Long short-term memory for speaker generalization in supervised speech separation. J Acoust Soc Am 141:4705–4714
Article ADS Google Scholar
Coggins BE, Werner-Allen JW, Yan A, Zhou P (2012) Rapid protein global fold determination using ultrasparse sampling, high-dynamic range artifact suppression, and time-shared NOESY. J Am Chem Soc 134:18619–18630
Article Google Scholar
Delaglio F et al (1995) NMRpipe—a multidimensional spectral processing system based on unix pipes. J Biomol NMR 6:277–293
Article Google Scholar
Dutta SK et al (2015) APSY-NMR for protein backbone assignment in high-throughput structural biology. J Biomol NMR 61:47–53
Article Google Scholar
Eghbalnia HR, Bahrami A, Tonelli M, Hallenga K, Markley JL (2005) High-resolution iterative frequency identification for NMR as a general strategy for multidimensional data collection. J Am Chem Soc 127:12528–12536
Article Google Scholar
Graves A et al (2009) A novel connectionist system for unconstrained handwriting recognition. IEEE Trans Pattern Anal Mach Intell 31:855–868
Article Google Scholar
Han Y, Ye JC (2018) k-space deep learning for accelerated MRI. http://arxiv.org/abs/1805.03779v2
Helmus JJ, Jaroniec CP (2013) nmrglue: an open source Python package for the analysis of multidimensional NMR data. J Biomol NMR 55:355–367
Article Google Scholar
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780
Article Google Scholar
Holland DJ, Bostock MJ, Gladden LF, Nietlispach D (2011) Fast multidimensional NMR spectroscopy using compressed sensing. Angew Chem Int Ed 50:6548–6551
Article Google Scholar
Hyberts SG, Milbradt AG, Wagner AB, Arthanari H, Wagner G (2012) Application of iterative soft thresholding for fast reconstruction of NMR data non-uniformly sampled with multidimensional Poisson Gap scheduling. J Biomol NMR 52:315–327
Article Google Scholar
Hyun CM, Kim HP, Lee SM, Lee S, Seo JK (2018) Deep learning for undersampled MRI reconstruction. Phys Med Biol 63:135007
Article Google Scholar
Kazimierczuk K, Orekhov VY (2011) Accelerated NMR spectroscopy by using compressed sensing. Angew Chem Int Ed 50:5556–5559
Article Google Scholar
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. http://arxiv.org/abs/1412.6980
Kosiński K, Stanek J, Górka MJ, Żerko S, Koźmiński W (2017) Reconstruction of non-uniformly sampled five-dimensional NMR spectra by signal separation algorithm. J Biomol NMR 68:129–138
Article Google Scholar
Krähenbühl B, Hofmann D, Maris C, Wider G (2012) Sugar-to-base correlation in nucleic acids with a 5D APSY-HCNCH or two 3D APSY-HCN experiments. J Biomol NMR 52:141–150
Article Google Scholar
Lee W, Tonelli M, Markley JL (2015) NMRFAM-SPARKY: enhanced software for biomolecular NMR spectroscopy. Bioinformatics 31:1325–1327
Article Google Scholar
Linnet TE, Teilum K (2016) Non-uniform sampling of NMR relaxation data. J Biomol NMR 64:165–173
Article Google Scholar
Mulder FA, Mittermaier A, Hon B, Dahlquist FW, Kay LE (2001) Studying excited states of proteins by NMR spectroscopy. Nat Struct Biol 8:932–935
Article Google Scholar
Murrali MG et al (2018) ¹³C APSY-NMR for sequential assignment of intrinsically disordered proteins. J Biomol NMR 70:167–175
Article Google Scholar
Orekhov VY, Jaravine VA (2011) Analysis of non-uniformly sampled spectra with multi-dimensional decomposition. Prog Nucl Magn Reson Spectrosc 59:271–292
Article Google Scholar
Pustovalova Y, Mayzel M, Orekhov VY (2018) XLSY: extra-large NMR spectroscopy. Angew Chem Int Ed 57:14043–14045
Article Google Scholar
Qu X et al (2019) Accelerated nuclear magnetic resonance spectroscopy with deep learning. http://arxiv.org/abs/1904.05168
Schmieder P, Stern A, Wagner G, Hoch J (1993) Application of nonlinear sampling schemes to COSY-type spectra. J Biomol NMR 3(5):569–576
Article Google Scholar
Tugarinov V, Kay LE, Ibraghimov I, Orekhov VY (2005) High-resolution four-dimensional ¹H−¹³C NOE spectroscopy using methyl-TROSY, sparse data acquisition, and multidimensional decomposition. J Am Chem Soc 127:2767–2775
Article Google Scholar
Vuister GW et al (1993) Increased resolution and improved spectral quality in four-dimensional ¹³C/¹³C-separated HMQC-NOESY-HMQC spectra using pulsed field gradients. J Magn Reson B 101:210–213
Article Google Scholar
Werbeck ND, Kirkpatrick J, Hansen DF (2013) Probing arginine side-chains and their dynamics with carbon-detected NMR spectroscopy: application to the 42 kDa human histone deacetylase 8 at high pH. Angew Chem Int Ed Engl 52:3145–3147
Article Google Scholar
Worswick SG, Spencer JA, Jeschke G, Kuprov I (2018) Deep neural network processing of DEER data. Sci Adv 4:eaat5218
Article ADS Google Scholar
Ying J, Delaglio F, Torchia DA, Bax A (2017) Sparse multidimensional iterative lineshape-enhanced (SMILE) reconstruction of both non-uniformly sampled and conventional NMR data. J Biomol NMR 68:101–118
Article Google Scholar

Download references

Acknowledgements

Dr Ilya Kuprov and Dr Vaibhav Shukla are acknowledged for helpful discussions and Dr Nicolas D. Werbeck is acknowledged for producing the sample of L99A T4L used in this study. This research is supported by the BBSRC (BB/R000255/1) and the Leverhulme Trust (RPG-2016-268).

Author information

Authors and Affiliations

Division of Biosciences, Institute of Structural and Molecular Biology, University College London, London, WC1E 6BT, UK
D. Flemming Hansen

Authors

D. Flemming Hansen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to D. Flemming Hansen.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (PDF 806 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Hansen, D.F. Using Deep Neural Networks to Reconstruct Non-uniformly Sampled NMR Spectra. J Biomol NMR 73, 577–585 (2019). https://doi.org/10.1007/s10858-019-00265-1

Download citation

Received: 23 April 2019
Accepted: 25 June 2019
Published: 10 July 2019
Issue Date: November 2019
DOI: https://doi.org/10.1007/s10858-019-00265-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Using Deep Neural Networks to Reconstruct Non-uniformly Sampled NMR Spectra

Abstract

Similar content being viewed by others

FID-Net: A versatile deep neural network architecture for NMR spectral reconstruction and virtual decoupling

Unbiased measurements of reconstruction fidelity of sparsely sampled magnetic resonance spectra

Pitfalls in compressed sensing reconstruction and how to avoid them

Introduction

Methods

Training the Deep Neural Network

NMR Spectroscopy

Reconstruction of Experimental NMR Spectra

Data Analysis

Results and Discussion

A DNN Architecture for NUS Reconstruction

Training the Deep Neural Network

Application to an Experimental NMR Spectrum

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Electronic supplementary material

Supplementary material 1 (PDF 806 kb)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Using Deep Neural Networks to Reconstruct Non-uniformly Sampled NMR Spectra

Abstract

Similar content being viewed by others

FID-Net: A versatile deep neural network architecture for NMR spectral reconstruction and virtual decoupling

Unbiased measurements of reconstruction fidelity of sparsely sampled magnetic resonance spectra

Pitfalls in compressed sensing reconstruction and how to avoid them

Explore related subjects

Introduction

Methods

Training the Deep Neural Network

NMR Spectroscopy

Reconstruction of Experimental NMR Spectra

Data Analysis

Results and Discussion

A DNN Architecture for NUS Reconstruction

Training the Deep Neural Network

Application to an Experimental NMR Spectrum

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Electronic supplementary material

Supplementary material 1 (PDF 806 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation