Stretched non-negative matrix factorization

Gu, Ran; Rakita, Yevgeny; Lan, Ling; Thatcher, Zach; Kamm, Gabrielle E.; O’Nolan, Daniel; Mcbride, Brennan; Wustrow, Allison; Neilson, James R.; Chapman, Karena W.; Du, Qiang; Billinge, Simon J. L.

doi:10.1038/s41524-024-01377-5

Stretched non-negative matrix factorization

Article
Open access
Published: 27 August 2024

Volume 10, article number 193, (2024)
Cite this article

Download PDF

You have full access to this open access article

npj Computational Materials

Stretched non-negative matrix factorization

Download PDF

502 Accesses
1 Altmetric
Explore all metrics

Abstract

A novel algorithm, stretchedNMF, is introduced for non-negative matrix factorization (NMF), accounting for signal stretching along the independent variable’s axis. It addresses signal variability caused by stretching, proving beneficial for analyzing data such as powder diffraction at varying temperatures. This approach provides a more meaningful decomposition, particularly when the component signals resemble those from chemical components in the sample. The stretchedNMF model introduces a stretching factor to accommodate signal expansion, solved using discretization and Block Coordinate Descent algorithms. Initial experimental results indicate that the stretchedNMF model outperforms conventional NMF for datasets exhibiting such expansion. An enhanced version, sparse-stretchedNMF, optimized for powder diffraction data from crystalline materials, leverages signal sparsity for accurate extraction, especially with small stretches. Experimental results showcase its effectiveness in analyzing diffraction data, including success in real-time chemical reaction experiments.

Efficient and generalized processing of multidimensional NUS NMR data: the NESTA algorithm and comparison of regularization terms

Article 26 March 2015

Off-diagonal symmetric nonnegative matrix factorization

Article 04 February 2021

Deep data analysis via physically constrained linear unmixing: universal framework, domain examples, and a community-wide platform

Article Open access 30 April 2018

Introduction

Non-negative matrix factorization (NMF) is an unsupervised machine learning method used for decomposing compressed data. NMF extracts distinct components from related signal sets in various research fields, including signal processing¹, biomedical engineering², pattern recognition³, image engineering⁴, and so on. NMF differs from principle component analysis (PCA)⁵ by applying positivity constraints on the extracted components and their weights. It is then attractive for attempting to find components that resemble physical signals in the case where the positivity constraints are expected to hold. In crystallography, NMF has demonstrated significant potential in finding physically plausible structural signals from diffraction data collected from in situ chemical reactions^6,7,8. Recently, NMF has also been used for in situ time-dependent diffraction measurements^9,10 and spatially resolved electron diffraction maps¹¹, single-layer nanosheets¹², integrated multimodal analysis¹³, and metal-organic frameworks^14,15.

In many scientific fields, including chemistry and materials science, the need to analyze data exhibiting stretching phenomena is paramount. For instance, in temperature series experiments, the stretching of peak positions in diffraction patterns or atomic pair distribution function (PDF) data can provide crucial insights into structural changes induced by varying temperatures. Conventional NMF, constrained by the assumption of fixed components, struggles to effectively model such stretching behaviors, resulting in difficulties for researchers to use the extracted mathematical components for subsequent component analysis and structural identification.

To address this limitation, extended NMF models have been proposed. One such model is the Shifted NMF, which accounts for shifts in the onset of a frequency profile, which can be induced by the Doppler effect for spectrometry data¹⁶. However, Shifted NMF is not able to solve the temperature series data problem because the change in the component is a stretch, not a shift. Another approach is to incorporate stretching regression steps into the analysis workflow¹⁷. There is also a method based on statistical information to obtain stretching and align peak positions, combined with PCA to decompose X-ray diffraction (XRD) data¹⁸. Despite these improvements, the NMF algorithm that considers stretching, which is the most direct and effective method, still remains a gap.

In this paper, we propose a new extended NMF model called stretchedNMF to explore a more fundamental aspect of the algorithm itself. We introduce a stretching factor matrix to describe the stretching scales of each component and each component is allowed to have different entire stretching factors at different moments. stretchedNMF can be developed to account for a simple stretching of the measured signal and returns only components that explain variability beyond this stretching.

In this paper, we first develop the mathematical formulas of stretchedNMF in the form of functional optimization. We present the method of discretization and the optimization algorithm. An enhanced version, sparse-stretchedNMF, optimized for powder diffraction data from crystalline materials, leverages signal sparsity for accurate extraction, especially with small stretches. Then using both simulated and real data, we show that stretchedNMF and sparse-stretchedNMF significantly outperforms conventional NMF in the case of diffraction data with thermal expansion. Furthermore, we show that the algorithm may be used to extract different chemical components from the data if there are multiple components that have differential thermal expansivities. This gives an interesting possibility for extracting the components in a multi-phase sample from a temperature dependent measurement of that sample, even when those components are not changing chemically during the measurement. Although we focused on diffraction signals from temperature series data, the algorithm may be used for any case where part of the changes to the signal are exactly, or approximately, a stretch of its dependent variable.

Results

Diffraction use case

Here we test the approach using simulated and also real x-ray powder diffraction (PXRD) data^19,20, and atomic pair distribution function (PDF)²¹ data. PXRD and PDF patterns are continuous 1D signals that encode the 3D arrangement of atoms in a material. PDF data represents the probability distribution of atomic distances, while XRD patterns exist in reciprocal space, meaning that the positions and intensities of its diffraction peaks vary with diffraction angle. We assume a situation where the PDF and PXRD patterns have been measured for samples as a function of temperature and are undergoing thermal expansion, where the thermal expansion coefficient of each phase is different. The thermal expansion causes Bragg peaks in the PXRD, and peaks in the PDF, to change their positions. In principle, thermal expansion can be different along different directions of the crystal, but often it is quite isotropic and appears as a stretching of the pattern where the peak shifts increase with increasing distance along the independent variable axis as required for this algorithm to work. This makes it an interesting use-case for stretchedNMF, though we note that the stretchedNMF may be applied to any series of signals where one aspect of the variability is a continuous stretching on the axis of the independent variable.

An intuitive explanation regarding thermal expansion and the stretching of PDF data is that under thermal expansion, if the overall atomic structure of a material enlarges, then all inter-atomic distances become a multiple a of the original distances, which is reflected in the PDF data as multiplying the horizontal positions of all PDF peaks by a. XRD data functions in reciprocal space. Theoretically, there exists a Fourier transform between the PDF and XRD. Therefore, the stretching factor a in PDF data corresponds to the reciprocal of the stretching factor, 1/a, in XRD data for the same material.

The goal of our testing use-case is to see if we can use NMF in general, but stretchedNMF in particular, to separate the chemical components in a binary chemical mixture where the two components have different thermal expansion coefficients. For example, this could be used by a material scientist to discover the chemical components in a synthesis product by measuring the mixture as a function of temperature and running stretchedNMF on the mixture, where the algorithm returns mathematical components that resemble the PXRD or PDF signals of the actual chemical components. These mathematical components could then be given to algorithms such as the structureMining²² or spacegroupMining²³ algorithms that are implemented as a service on the PDFitc.org website²⁴. These algorithms, given an uploaded PDF, will return a rank-ordered list of candidate structures consistent with that PDF. Our test of the algorithm will, therefore, consist of taking either simulated or actual measured data over a wide temperature range from binary mixtures where the components have different thermal expansion coefficients. These signals will be fed to NMF and stretchedNMF to extract two components which will then be analyzed to see if they resemble the signals from the actual chemical components. In the case of the PDF, an interesting test of this is to take the extracted mathematical components and giving them to the structureMining algorithm to see if it correctly identifies the chemical component from the stretchedNMF and conventional NMF extraction.

Simulated data

To evaluate the performance of the stretchedNMF and sparse-stretchedNMF algorithms, we test them on the simulated datasets. The algorithms are designed to be applied to any signal undergoing stretching and we want to understand their performance compared to conventional NMF algorithms. We could choose any simulated signals to do this, but since a primary motivation for the development was to decompose temperature-dependent powder diffraction data we have simulated powder diffraction patterns and PDFs. However, since we are interested in testing the performance when stretches are large and when they are small we simulate data with stretches that are not realistic for real powder data, as well as some more realistic examples later.

We simulate signals by computing the PXRD and the PDF patterns of a weighted sum of a cubic perovskite BaTiO₃ and a cubic zinc-blende ZnSe phase. To ensure that both signals contribute comparably we chose an atomic concentration ratio of 0.61:1.00 when initial lattice parameters of BaTiO₃=4.18 Å and ZnSe=5.62 Å were used. The crystal structures used for BaTiO₃ and ZnSe were from structures reported in²⁵ and²⁶, respectively, and downloaded from the Springer Materials database (https://materials.springer.com/isp/crystallographic/docs/sd_0304044, https://materials.springer.com/isp/crystallographic/docs/sd_1929775).

In this example we considered that the component signals stretched at ratio of rates of α_BTO/α_ZS = 2 across the series of signals, with a further assumption that α does not vary with temperature. Although we are not initially using realistic thermal expansions of these materials, this is the approximate ratio of expansivities of these two materials^27,28. In detail, we simulated thermal expansions for α_BTO: α_ZS to be 20:10, 4:2, and 2:1.

We note that to test purely the effects of stretching, which is the basis of the current NMF modification, we fixed and did not vary the atomic displacement parameters (ADPs) that would result in changes in the attenuation of the PXRD Bragg peaks and broadening of peaks in the simulated PDF. Such effects are likely to be present in real data and may require a further modification to the NMF algorithm in the future but this is beyond the scope of the current paper. This set of simulations assumed no phase transition or chemical reaction to be occurring and the relative weights of the components were not varied in the computed dataset.

The PXRD patterns were simulated using Dans-Diffraction²⁹. Pseudo-Voight lineshapes were used. The PDFs were simulated using Diffpy-CMI³⁰. The code used to generate the PDFs can be found at https://github.com/yevgenyr/diffpysim. The static set of parameters used for the simulations is reproduced in Table 1. Representative PXRD and PDF patterns are shown in Fig. 1.

Table 1 The static set of parameters that were used for PXRD and PDF simulations

Full size table

**Fig. 1: Example simulated signals used in the tests.**

Results on Simulated PDF

First, we compare the performance of the conventional NMF and the stretchedNMF on simulated PDF data. The PDFs were generated by a combination of two components, namely simulated BaTiO₃ and ZnSe. The weight coefficients for each component were set as constants. We also assigned different linearly increasing rates for the thermal expansion of BaTiO₃ and ZnSe. Specifically, we used artificially generated rates such that BaTiO₃ and ZnSe linearly expands from the first PDF to the last with 20% and 10% expansions, respectively.

We then applied the conventional NMF and stretchedNMF methods to extract two components from the simulated PDF data. These could then be compared with the ground-truth PDFs. In principal, any of the ground-truth PDFs could be picked as we apply a stretching factor to the NMF component signal before the comparison. In this study, we selected the ground-truth PDF that resulted in the minimal residual when using only the scale factor variable. We further optimized the agreement between the NMF component and the selected ground-truth PDF by varying both the scale-factor and stretch factor variables.

We first evaluated the outcomes of the conventional NMF approach. These findings are illustrated in Fig. 2a–d and Table 2.

**Fig. 2: Comparison of NMF-extracted PDF signals (red) and ground truth PDFs (blue).**

Table 2 Results of the comparison between the NMF extracted components and the ground-truth PDFs on simulated PDF data test with 20% and 10% expansions

Full size table

Figure 2 (a–d) depicts the resulting PDFs in a matrix layout, with the NMF extracted components being represented as rows (in red) and the ground-truth PDFs as columns (in blue). The difference curves (ground-truth - NMF component) are plotted below in green. Large residuals and large R_w factors are evident between all the NMF components and the ground-truth curves and the NMF extraction has failed to produce components that resemble the actual signals. This is not surprising since the weights of the two components are not varying in the test.

The same test was applied using the stretchedNMF algorithm and the results are shown in Fig. 2e–h and Table 2. In this case we can see that the stretchedNMF extracted signal I is closely related to the ground truth component I and likewise for the component II. This is evident as a very flat difference curve in Fig. 2e, h and small R_w for these pairings in Table 2.

This shows that even in the absence of changes in component weights the stretchedNMF algorithm can extract components just from a differential stretching of the structure signal.

Results on simulated PXRD

We carry out the same comparison of NMF vs stretchedNMF for the case of powder diffraction signals. Similar to the simulated PDF case, the data comprise of a combination of simulated BaTiO₃ and ZnSe, where BaTiO₃ and ZnSe have 20% and 10% linearly varying expansions, respectively.

The results of the comparison are presented in Fig. 3 and Table 3.

**Fig. 3: NMF and stretchedNMF solutions on simulated PXRD data.**

Table 3 Comparison between the NMF extracted components and the ground-truth PXRDs on the simulated PXRD data set with 20% and 10% expansions

Full size table

As is evident in Fig. 3a–d, none of the extracted conventional NMF components resemble ground-truth curves. Again, this is not a surprise because the weights of the components are not changing. However, for the stretchedNMF extraction we see that the first extracted component (Comp I) corresponds well the BaTiO₃ pattern (Truth I), and the second extracted component (Comp II) corresponds well to the ZnSe diffraction pattern (Truth II) (Fig. 3e, h).

As with the simulated PDF data, the stretchedNMF algorithm can extract components resembling the physical signals from a phase mixture where the weights are not changing but there is a variable thermal expansion.

Results on simulated PDF and PXRD data with small expansion coefficients

The tests above show that even in the presence of large stretches of signals stretchedNMF can automatically extract signals that resemble real physical signals whereas conventional NMF cannot, at least in the case where the component weights are not changing.

We now would like to see how well stretchedNMF can perform for smaller stretching factors, for example, for magnitudes that might occur in physical systems due to thermal expansion. The simulated data is still taken as the combination of BaTiO₃ and ZnSe. The weights are set to constants as before. However, in this example we set the thermal expansion rates of BaTiO₃ and ZnSe to 4% and 2%, respectively. Both simulated PDF and PXRD are tested.

First, we compare the performance of the conventional NMF and the stretchedNMF on simulated PDF data. The results are presented in Table 4 and Fig. 4.

Table 4 Results of the comparison between the NMF extracted components and the ground-truth PDFs on simulated PDF data sets with 2% expansions on ZnSe

Full size table

**Fig. 4: NMF and STRETCHEDNMF solutions on simulated PDF data with 2% expansions on ZnSe.**

Unlike the previous figures we just plot the agreement of the extracted component and the ground-truth curve that shows the best agreement. The poor performance of the conventional NMF is evident in Fig. 4a, b, whereas again, even for this much smaller stretch, the stretchedNMF algorithm still gives a good extraction of the physical components (Fig. 4c, d).

We get the same overall result for the test on simulated PXRD data as for the PDF data. The results are shown in Fig. 5 and Table 5. Again, stretchedNMF gives a very good extraction of the physical components even for this small relative expansion coefficient (Fig. 5c, d) whereas conventional NMF does not (Fig. 5a, b)

**Fig. 5: NMF and STRETCHEDNMF solutions on simulated PXRD data with 2% expansions on ZnSe.**

Table 5 Results of the comparison between the NMF extracted components and the ground-truth PXRDs on simulated PXRD data test with 2% expansions on ZnSe

Full size table

The results are less ideal when the expansion rates are reduced further to BaTiO₃ and ZnSe changing linearly from 1 to 1.02 and 1.01, respectively. The results are summarized in Table 6 and Fig. 6.

Table 6 Results of the comparison between the NMF extracted components and the ground-truth PXRDs on simulated PXRD data test with a 1% differential expansion between the components

Full size table

**Fig. 6: Comparison of NMF-extracted solutions on simulated PXRD data with 1% expansions on ZnSe.**

At this level of expansion, even the stretchedNMF is not correctly extracting the physical components. For example, it incorrectly assigns peaks in the spectrum of its extracted components in red at around Q = 1.5, 2, and 2.5 Å⁻¹ (Fig. 6c, d). These same peaks are partially misassigned by the conventional NMF algorithm.

However, the sparse-stretchedNMF algorithm does a good job of extracting physical components from the powder PXRD simulations (Fig. 6e and f) even in this challenging case with a relatively small (1%) differential expansion. The components of sparse-stretchedNMF are close to ground truths. This indicates that sparse-stretchedNMF can enhance the performance of stretchedNMF.

These tests show that the stretchedNMF algorithm is able to extract physically meaningful PDF and PXRD signals from sets of data where the signals are unchanged except for a different relative stretch between the two curves. If there is a large differential change in lattice parameter across the dataset stretchedNMF can still extract ground-truth PDF and PXRD signals. For relative stretches of a few percent, comparable to what might be expected for a mixture of compounds with a differential thermal expansion, this is also true for both PDF and PXRD data. When the differential thermal expansion gets to around 1%, stretchedNMF starts to struggle to extract physical components. However, for PXRD data the sparse-stretchedNMF algorithm still performs well. We note that the PDF data is not sparse, and therefore sparse-stretchedNMF algorithm is applied only to PXRD data.

We should note that in these ground-truth tests on simulated data we wanted to test how well stretchedNMF can handle datasets that contain stretches, for example, as might come from thermal expansion. We, therefore, did not include in the simulation other effects of temperature changes, such as increases in atomic displacement factors (ADPs). In principle, we would like to develop a new algorithm that can eliminate changes in ADP in the same way as stretchedNMF eliminates stretches. This problem will be left for a future paper. Preliminary tests on simulated data with combined stretching and increased-ADP effects indicate that stretchedNMF and sparse-stretchedNMF still perform reasonably well and clearly outperform the conventional NMF algorithm, but with larger errors than in the constant-ADP tests reported here. Despite this known shortcoming, we would still like to see whether stretchedNMF and sparse-stretchedNMF can perform well on experimental data from a variable temperature experiment, and this is discussed in the following section.

Experimental PXRD - thermal expansion

To test the stretchedNMF and sparse-stretchedNMF algorithms on real data, we use part of an in situ solid-state synthesis reaction dataset where no phase-transition or chemical reaction occurred but which spanned a rather broad temperature range. This allows us to evaluate how the algorithms perform for the effect of thermal expansion of a phase mixture from real data. The PXRD experiment was done at the 28-ID-2 beamline (XPD instrument) at the NSLS-II facility at Brookhaven National Laboratory. A large area 2D Perkin Elmer detector was used to acquire the data. To gain high spectral resolution in the PXRD, the distance between the sample and the detector was set to 144 cm. The beam wavelength was 0.1949 Å.

A stoichiometric mixture of 2:1 YOCl (>98% tetragonal phase) and MgMn₂O₄ (spinel phase) was uniformly mixed and sealed in a quartz capillary. It was then heated in a gradient furnance, meaning that each location on the quartz tube had a different temperature³¹. The absolute temperatures at each point along the sample were calibrated from the lattice expansion of a known calibration material, Ni. The data went from a low temperature of 368^∘C to a highest temperature of 668^∘C with a total of 20 individual temperature points. Using ‘pyFAI’³², the collected 2D diffraction patterns were then cleaned by masking the beam-stop and over-bright/dead pixels, followed by an azimuthal integration to gain 1D PXRD patterns. The 1D PXRD data was then used as inputs to the different NMF algorithms.

Multi-component Rietveld refinements were carried out and indicate that the chemical components in this reaction are MgMn₂O₄, orthorhombic YMnO₃, and rhombohedral and tetragonal YOCl (rYOCl and tYOCl, respectively) where MgMn₂O₄ and tYOCl are the dominant phases. The results of the Rietveld refinements for the two majority phases were used as ground truth against which to compare the performance of the NMF algorithms.

The results are shown in Fig. 7 and the resulting R_w and PC are listed in Table 7.

**Fig. 7: Solutions on real PXRD data using conventional NMF, stretchedNMF, and sparse-stretchedNMF.**

Table 7 Results of the comparison between the NMF extracted components and the ground truth from Rietveld refinement on real PXRD data test

Full size table

In Fig. 7, the blue curves in the top row (a, b, c) are from the diffraction pattern of MgMn₂O₄ and the blue curves in the bottom row (d, e, f) are from tYOCl. The red curves in each panel show the relevant extracted component from the NMF algorithm used. The columns are sorted by the NMF algorithm used. The first column (a, d) used regular NMF, the second (b, e) used the stretchedNMF algorithm, and the third column (c, f) used the sparse-stretchedNMF algorithm.

All NMF solvers give reasonable results for the tYOCl chemical component. The peak positions are consistent with the ground truth, and the inconsistency of intensity is acceptable. But for MgMn₂O₄, the NMF and stretchedNMF derived components are poor. They are much better using the sparse-stretchedNMF algorithm, which gives better agreement both visually and in terms of the R_w between the ground-truth and the extracted components. For this case, from the perspective of separation ability, sparse-stretchedNMF is superior to stretchedNMF which is superior to the conventional NMF in this test.

The scaled weights from all NMF solvers are compared to the weights from Rietveld refinement which can be considered as ground-truth. The results are shown in Fig. 8.

**Fig. 8: Weights comparison on real PXRD data.**

The weights of the chemical components do no show stark changes, but only fluctuate about 5% around a constant average and so we would expect the weights to be largely independent of temperature. The conventional NMF clearly does not return constant weights and is getting confused by the thermal expansion in the data. The stretchedNMF and sparse-stretchedNMF methods do yield almost constant weights. Rietveld refinements were carried out on these data-sets and can be treated as a ground-truth. The results of the Rietveld refinement are shown as the dashed curve. stretchedNMF is doing quite well, but sparse-stretchedNMF is doing very well in reproducing the results of the Rietveld refinement.

Experimental PXRD - thermal expansion and reaction

We also tested the NMF algorithms on another PXRD dataset, but this time, where a solid-state chemical reaction happened together with the thermal expansion so that the weights of the components as well as the thermal expansion were varying during the experiment. The data were measured as the temperature changed from 28 C to 370 C in 215 steps during the reaction of

$${{\rm{CuCl}}}_{2}+{{\rm{Na}}}_{2}{{\rm{Se}}}_{2}\longrightarrow {{\rm{CuSe}}}_{2}+2{\rm{NaCl}}.$$

Here the components involved in the reaction are NaCl, CuSe, Cu₂Se, Se, pyrite, and marcasite, as determined by a multi-phase Rietveld refinement³³ on the full dataset carried out previously, where Rietveld refinement is a process of local optimization of a structural model to give the best agreement of between calculated and measured PXRD patterns. The full details of experiment are published in ref. ³⁴.

The top panel in Fig. 9 depicts the measured PXRD data obtained during the in situ reaction experiment. At a position of 2.2 Å⁻¹, a peak exhibiting an expansion coefficient of approximately 2% is observed. This expansion coefficient closely matches the 2% expansion coefficient we set in our tested simulations in section “Results on simulated PDF and PXRD data with small expansion coefficients”.

**Fig. 9: The upper subplot shows the 215 experimental raw data PXRD curves, offset for clarity.**

The curves obtained by a multi-phase Rietveld refinement fit³⁴ are shown in blue in the panels below. The Rietveld refined phase weights are shown in blue in the right hand column below³⁴. The components extracted from a sparse-stretchedNMF decomposition are shown in red, plotted on top of the ground-truth components, and the extracted weights are shown in red on top of the Rietveld extracted weights in the right hand column. The results are very good and indicate that, except for Se, the components obtained from sparse-stretchedNMF matched well with the ground truth in terms of peak positions, as do the extracted weights. The majority of the extracted weights exhibit the same increasing or decreasing trends as the weights obtained from Rietveld refinements, where except for NaCl and CuSe that show some stronger deviations, four of the components show almost a perfect match between the SNMF weights and those obtained by Rietveld refinements.

This shows that the sparse-stretchedNMF algorithm can be used as a rapid way to extract reliable components and weights from data collected at different temperatures. This approach can be very helpful looking at large amounts of data very rapidly as it is being collected to look for known phases and unknown phases without having to carry out a complex multicomponent Rietveld campaign in real time.

Discussion

This paper presents a novel functional optimization model called stretchedNMF, which is an extension to the traditional NMF model. The initial experimental results indicate that for data where stretches in the signal are observed, such as diffraction data where thermal expansion has taken place, the proposed stretchedNMF model outperforms the conventional NMF. This is true even for PXRD and PDF data with small stretching degrees corresponding to realistic thermal expansivities. However, a further enhancement to stretchedNMF, which makes use of the sparsity of powder diffraction patterns, called sparse-stretchedNMF allows correct extractions even for very small stretches where stretchedNMF struggled.

Assumptions of the algorithm are that signal stretches are uniform on all features in the signal, as would be the case for an isotropic thermal expansion. Strictly speaking, the algorithm is not valid for the case of anisotropic expansions. However, it can be expected to perform better than conventional NMF even when there is anisotropic thermal expansion, especially if it is small. For example, in Fig. 9, we find a very good match to the Rietveld refinements of Marcasite, which is orthorhombic. More work is needed to establish the robustness of the algorithm in these cases. However, we note that it is not the goal of the work to replace a fully quantitative model dependent analysis of the data, such as a multi-phase Rietveld refinement, but to give a useful rapid, model independent, assessment of large sets of data that can help guide any later model dependent analysis.

Numerous factors influence the anisotropic expansion of materials, with symmetry playing a crucial role. Lower symmetry crystal systems are more prone to displaying significant anisotropies during expansion. Moreover, materials with low dimensionality, like layered materials, are highly inclined towards exhibiting anisotropic expansion behavior. A deeper study into the significance of symmetry, especially regarding its influence on anisotropic expansion in low-symmetry crystals and layered materials will be the topc of future work.

At the algorithmic level, incorporating additional transformations beyond stretching may help explore the anisotropic expansion further. We note that the current model only considers stretching, adding shift transforms it into a first-order polynomial transformation. In this case, only a new block is added to the computation, but a better approximation can be obtained. Incorporating higher-order polynomial transformations could further balance the computational and approximative accuracy of the model. Further research is needed to investigate and optimize the stretchedNMF model’s potential in overcoming these challenges.

We also note that experimental noise can affect the outcome. This has not been studied in detail in this paper, but we note that we obtained good results from real data that included noise. To further address the noise issue, different regularization techniques can be utilized.

Finally, we note that although the motivation for the development, and all the tests, were on diffraction data where underlying structures have undergone thermal expansion, the stretchedNMF algorithm will work on any signal decomposition that smooth continuous variations in a stretching fact as a characteristic of the signal and it is not limited to use on diffraction data.

Methods

Stretched non-negative matrix factorization

Non-negative matrix factorization (NMF) is a mathematical tool to approximate a given matrix $Z\in {{\mathbb{R}}}^{N\times M}$ by the product of two low-rank non-negative matrices,

$$Z\,\approx\, XY,$$

(1)

where $X\in {{\mathbb{R}}}^{N\times K}$ and $Y\in {{\mathbb{R}}}^{K\times M}$, and K ≪ N, M³⁵. Its description and use are described in detail in multiple places^36,37. The common NMF model uses the square of Euclidian distance (SED) as the objective function, and the corresponding optimization problem is written as

$$\begin{array}{rcl}&\mathop{\min }\limits_{X\in {{\mathbb{R}}}^{N\times K},Y\in {{\mathbb{R}}}^{K\times M}}&\frac{1}{2}{\left\Vert XY-Z\right\Vert }_{F}^{2},\\ &\,\text{s.t.}\,&X\ge 0\,\,\text{and}\,\,Y\ge 0.\end{array}$$

(2)

Similar to principal component analysis³⁸, the NMF decomposition will find components that explain variability in the signals in the set of data. Unlike PCA, a constraint of positivity is applied to both the components and the weights. Since many real physical signals, and their weights, obey positivity, NMF is more likely to find components that resemble signals from different physical components contributing to a compound signal coming from multiple sources. As such, it is finding extensive use in scientific applications^9,39,40.

Here we address a situation where one aspect of the variability, a stretching of the signal on the axis of its independent variable, is not of scientific interest, for example, due to the thermal expansion of a material affecting its diffraction pattern. We formulate an approach named stretchedNMF which extends the conventional NMF decomposition whilst accounting for the stretching in the algorithm.

Suppose the experimental signals, which are columns of Z, z^m for m = 1…M, and the components, which are columns in X, x_k for k = 1…K, are continuous functions of an independent variable r. Then the conventional NMF optimization problem may be written as

$$\min\limits_{y_k^m\geq 0,x_k\geq 0} \quad {\sum\limits_{m=1}^M} \left\|\sum\limits_{k=1}^K y_k^m x_k(r)-z^m(r) \right\|_{L_2}^2,$$

(3)

where ${y}_{k}^{m}$ is the weight of the kth component at the mth position in the dataset. Now, we assume that there is an m-dependent stretching of the signal along the r axis. The component signals stretch with component-dependent rates that we capture in a stretching factor, ${\{{a}_{k}^{m}\}}_{m = 1,\ldots ,M}$. We add the stretching factors ${a}_{k}^{m}$ into Eq. (3) and the optimization problem becomes

$$\min\limits_{a_k^m\geq 0,y_k^m\geq 0,x_k\geq 0} \quad{\sum\limits_{m=1}^M} \left\|\sum\limits_{k=1}^K y_k^m x_k\left(r/a_k^m\right)-z^m(r) \right \|_{L_2}^2.$$

(4)

Notice that if ${a}_{k}^{m} \,>\, 1,{x}_{k}$ is stretched, and if ${a}_{k}^{m} \,<\, 1,{x}_{k}$ is compressed. In practice, we consider a finite r range $[0,{r}_{\max }]$. Therefore, without loss of generality, we define x_k(r) = 0 for $r\ge {r}_{\max }$. Thus, when ${a}_{k}^{m} \,>\, 1,{x}_{k}(r/{a}_{k}^{m})=0$ for $r\ge {r}_{\max }/{a}_{k}^{m}$. Now we are able to expand the L₂ norm in Eq. (4) as an integral over the r range as

$$\begin{array}{rcl}&\mathop{\min }\limits_{{a}_{k}^{m}\ge 0,{y}_{k}^{m}\ge 0,{x}_{k}\ge 0}&\mathop{\sum}\limits_{m = 1}^{M}\mathop{\int}\nolimits_{\!0}^{{r}_{\max }}{\left(\mathop{\sum}\limits_{k = 1}^{K}{y}_{k}^{m}{x}_{k}(r/{a}_{k}^{m})-{z}^{m}(r)\right)}^{2}dr,\\ &\,\text{s.t.}\,&{x}_{k}(r)=0,\,\text{if}\,\,r\ge {r}_{\max }.\end{array}$$

(5)

For fixed component $k,{\{{a}_{k}^{m}\}}_{m = 1,\cdots ,M}$ is a series of stretching factors, which usually change smoothly with time m. However, the optimization problem in Eq. (5) is non-convex, and hence the smoothness of ${\{{a}_{k}^{m}\}}_{m = 1,\cdots ,M}$ may be violated when we solve it numerically. Therefore, we add a regularization term to the objective function to make it favor smooth a_k, i.e.,

$$\begin{array}{rcl}&\mathop{\min }\limits_{{a}_{k}^{m}\ge 0,{y}_{k}^{m}\ge 0,{x}_{k}\ge 0}&\mathop{\sum}\limits_{m = 1}^{M}\mathop{\int}\nolimits_{\!0}^{{r}_{\max }}{\left(\mathop{\sum}\limits_{k = 1}^{K}{y}_{k}^{m}{x}_{k}(r/{a}_{k}^{m})-{z}^{m}(r)\right)}^{2}dr\\ &&+\rho \mathop{\sum}\limits_{k = 1}^{K}\mathop{\sum}\limits_{m = 1}^{M-2}{({a}_{k}^{m}-2{a}_{k}^{m+1}+{a}_{k}^{m+2})}^{2},\\ &\,\text{s.t.}\,&{x}_{k}(r)=0,\,\text{if}\,\,r\ge {r}_{\max },\end{array}$$

(6)

where $\mathop{\sum }\nolimits_{k = 1}^{K}\mathop{\sum }\nolimits_{m = 1}^{M-2}{({a}_{k}^{m}-2{a}_{k}^{m+1}+{a}_{k}^{m+2})}^{2}$ is the smoothness regularization and ρ is the parameter to control the effect of regularization. In our numerical testing section, we initiate a large ρ and gradually decrease it in subsequent iterations.

Numerical solution of stretchedNMF

In this section, we describe the numerical implementation of the stretchedNMF.

In order to numerically solve the functional optimization problem Eq. (6), we discretize the functionals and solve the corresponding vector optimization problem. Unlike Shifted NMF¹⁶, we cannot get benefits from discretizing the frequency domain of the components after applying the Fourier transform. So we choose to discretize the problem in the real r space, without loss of generality, using a uniform grid on $[0,{r}_{\max }]$. Since we have introduced the stretching factors, when we discretize the functionals ${x}_{k}(r/{a}_{k}^{m})$, on this uniform grid the arguments $r/{a}_{k}^{m}$ are actually not on the grid nodes. Therefore, we apply a spline interpolation, that is we approximate ${x}_{k}(r/{a}_{k}^{m})$ from x_k(r), where the interpolant is a piecewise polynomial. In terms of the order of the spline, we need at least a quadratic order, i.e., a piecewise quadratic polynomial with continuous derivatives on the grid points. The smoothness of the spline will help the convergence of the discretized optimization solution. In this paper, we use a quadratic spline interpolation to approximate ${x}_{k}(r/{a}_{k}^{m})$ in the optimization problem Eq. (6). Explicitly, let 0 = r₀ < r₁ < ⋯ < r_N = r_max be the uniform grid nodes, resulting in an interval of h = r_max/N. The quadratic piecewise polynomial approximation, S_i(r), of x(r) for r ∈ [r_i, r_i+1] is

$${S}_{i}(r)={q}_{i}(r-{r}_{i})(r-{r}_{i+1})+\left[x({r}_{i+1})-x({r}_{i})\right](r-{r}_{i})/h+x({r}_{i}),$$

(7)

where q_i is the quadratic coefficient to be determined. The derivatives of the polynomials S_i(r) and S_i+1(r) are

$${S}_{i}^{{\prime} }(r)={q}_{i}(2r-{r}_{i}-{r}_{i+1})+\left[x({r}_{i+1})-x({r}_{i})\right]/h,$$

(8)

$${S}_{i+1}^{{\prime} }(r)={q}_{i+1}(2r-{r}_{i+1}-{r}_{i+2})+\left[x({r}_{i+2})-x({r}_{i+1})\right]/h.$$

(9)

Notice the fact that the second-order spline should have continuous derivatives over the entire domain, which means that ${S}_{i}^{{\prime} }({r}_{i+1})={S}_{i+1}^{{\prime} }({r}_{i+1})$ at positions r_i+1 for i = 0, …, N − 2, using Eqs. (8) and (9), we get

$${q}_{i}+{q}_{i+1}=[x({r}_{i})-2x({r}_{i+1})+x({r}_{i+2})]/{h}^{2}.$$

(10)

Since we have x(r) = 0, for r ≥ r_max, we set S_N−1(r_N) = 0 and ${S}_{N-1}^{{\prime} }({r}_{N})=0$. Then we can write q as

$$\left(\begin{array}{c}{q}_{0}\\ {q}_{1}\\ \vdots \\ {q}_{N-1}\end{array}\right)=\frac{1}{{h}^{2}}{\left(\begin{array}{ccccc}1&1&&&\\ &1&1&&\\ &&\ddots &\ddots &\\ &&&1&1\\ &&&&1\end{array}\right)}^{-1}\left(\begin{array}{ccccc}1&-2&1&&\\ &1&-2&1&\\ &&\ddots &\ddots &\ddots \\ &&&1&-2\\ &&&&1\end{array}\right)\left(\begin{array}{c}x({r}_{0})\\ x({r}_{1})\\ \vdots \\ x({r}_{N-1})\end{array}\right).$$

(11)

Now we can write ${x}_{k}(r/{a}_{k}^{m})$ in terms of x_k(r_i) as a linear transformation

$${x}_{k}(r/{a}_{k}^{m})={q}_{i}(r/{a}_{k}^{m}-{r}_{i})(r/{a}_{k}^{m}-{r}_{i+1})+[x({r}_{i+1})-x({r}_{i})](r/{a}_{k}^{m}-{r}_{i})/h+x({r}_{i}),$$

(12)

if $r/{a}_{k}^{m}\in [{r}_{i},{r}_{i+1}]$ and ${x}_{k}(r/{a}_{k}^{m})$ is set to zero if $r/{a}_{k}^{m}\ge {r}_{max}$. Since the leading coefficient q is also linearly dependent on x as shown in Eq. (11), we can conclude the linear transformation ${x}_{k}({r}_{i}/{a}_{k}^{m})={b}_{i,{a}_{k}^{m}}^{T}{x}_{k}$ for i = 0, 1, …, N in a matrix form

$$\left(\begin{array}{c}{x}_{k}({r}_{0}/{a}_{k}^{m})\\ {x}_{k}({r}_{1}/{a}_{k}^{m})\\ \vdots \\ {x}_{k}({r}_{N}/{a}_{k}^{m})\end{array}\right)=\left(\begin{array}{ccc}\cdots \,&{b}_{0,{a}_{k}^{m}}^{T}&\cdots \\ \cdots \,&{b}_{1,{a}_{k}^{m}}^{T}&\cdots \\ &\vdots &\\ \cdots \,&{b}_{N,{a}_{k}^{m}}^{T}&\cdots \end{array}\right)\left(\begin{array}{c}{x}_{k}({r}_{0})\\ {x}_{k}({r}_{1})\\ \vdots \\ {x}_{k}({r}_{N})\end{array}\right),$$

(13)

and denote the coefficient matrix as ${B}_{{a}_{k}^{m}}$. Now we are ready to write the discretization of the optimization problem in Eq. (6) as

$$\mathop{\min }\limits_{{a}_{k}^{m}\ge 0,\,{y}_{k}^{m}\ge 0,\,{x}_{k}\ge 0}\mathop{\sum}\limits_{m = 1}^{M}{\left\Vert \mathop{\sum}\limits_{k = 1}^{K}{y}_{k}^{m}{B}_{{a}_{k}^{m}}{x}_{k}-{z}^{m}\right\Vert }^{2}+\rho \mathop{\sum}\limits_{k = 1}^{K}\mathop{\sum}\limits_{m = 1}^{M-2}{({a}_{k}^{m}-2{a}_{k}^{m+1}+{a}_{k}^{m+2})}^{2},$$

(14)

where ${y}_{k}^{m},{x}_{k}$ and z^m are discretized functionals on the uniform grid $0={r}_{0} < {r}_{1} < \cdots < {r}_{N}={r}_{\max }$.

If the theoretical convergence is neglected, linear interpolation may be used as an approximation. In this case, we set q_i = 0 in Eq. (12). The final form of the optimization problem is still Eq. (14), but with a different ${B}_{{a}_{k}^{m}}$ with higher sparsity.

Among the existing methods, a popular approach to solve the conventional NMF is alternating non-negative least squares (ANLS)^{41,42,43,44,45}. ANLS alternatively adjusts X and Y to minimize the objective function and each subproblem can be solved by the non-negative linear least square method. In fact, this framework is also called the block coordinate descent (BCD) method with two blocks. In our problem Eq. (14), which can be simplified as

$$\mathop{\min }\limits_{A\ge 0,Y\ge 0,X\ge 0}f(A,Y,X),$$

(15)

there are three blocks A, Y and X. Applying the BCD method with three blocks, we can solve problem Eq. (15) using Algorithm 1.

Algorithm 1

Block Coordinate Descent (BCD) Method

1: for t = 1, 2, ⋯ do

2: $A:= \arg \mathop{\min }\nolimits_{A\ge 0}f(A,Y,X)$

3: $Y:= \arg \mathop{\min }\nolimits_{Y\ge 0}f(A,Y,X)$

4: $X:= \arg \mathop{\min }\nolimits_{X\ge 0}f(A,Y,X)$

5: end for

Similar to conventional NMF, the subproblems of Y and X in Lines 3 and 4 are convex quadratic programming problems that can be easily solved by existing solvers. But the subproblem of A in Line 2 is highly non-convex and therefore we cannot efficiently solve it for its global minimum. In practice, we use a subspace trust-region method⁴⁶ to find a local minimum.

The convergence of the BCD method for 3 blocks is not guaranteed⁴⁷. Here we use an algorithm that can guarantee its convergence for a quadratic spline approximation that is called the linearized block coordinate descent method⁴⁸. The outline of the framework is presented in Algorithm 2, where α_t is the step size and $\hat{A}/\hat{X}/\hat{Y}$ are the extrapolations of the current A/X/Y and previous A/X/Y. In each iteration, the algorithm randomly chooses one block and minimizes the corresponding linear approximation and a proximal term. One can refer to ref. ⁴⁸ for more information about parameter selections.

Algorithm 2

linearized block coordinate descent method

1: for t = 1, 2, ⋯ do

2: pick one of the following to implement in a deterministic or random manner;

3: $\begin{array}{rcl}A:&=&\arg \mathop{\min }\nolimits_{A\ge 0}\langle {\nabla }_{A}\,f(\hat{A},Y,X),A\rangle +\frac{1}{{\alpha }_{t}}\parallel A-\hat{A}{\parallel }^{2}\\ &=&\max \{\hat{A}-\frac{{\alpha }_{t}}{2}{\nabla }_{A}\,f(\hat{A},Y,X),0\}\end{array}$

4: $\begin{array}{rcl}Y:&=&\arg \mathop{\min }\nolimits_{Y\ge 0}\langle {\nabla }_{Y}f(A,\hat{Y},X),Y\rangle +\frac{1}{{\alpha }_{t}}\parallel Y-\hat{Y}{\parallel }^{2}\\ &=&\max \{\hat{Y}-\frac{{\alpha }_{t}}{2}{\nabla }_{Y}f(A,\hat{Y},X),0\}\end{array}$

5: $\begin{array}{rcl}X:&=&\arg \mathop{\min }\nolimits_{X\ge 0}\langle {\nabla }_{X}\,f(A,Y,\hat{X}),X\rangle +\frac{1}{{\alpha }_{t}}\parallel X-\hat{X}{\parallel }^{2}\\ &=&\max \{\hat{X}-\frac{{\alpha }_{t}}{2}{\nabla }_{X}\,f(A,Y,\hat{X}),0\}\end{array}$

6: end for

In Algorithm 2, the gradient projection method is utilized in each iteration, which is a common technique in the ANLS framework for solving the classical NMF problem. The primary computational cost of this method lies in computing gradients, rendering the computation relatively lightweight. Specifically, the computational complexity of computing the gradient for A/X/Y is O(NMK), where the constant factor is influenced by the interpolation method. For example, by employing linear interpolation instead of quadratic spline, it can avoid the computation in Eq. (11) and thus reduce the computational cost. On one hand, due to the non-convex and non-quadratic nature of the subproblem of A, the gradient projection technique is quite suitable. On the other hand, in the updates of X/Y, the computational cost in each iteration using gradient projection is much smaller compared to other methods such as the interior-point method⁴⁹ (O((N + MK)M²K²)).

When considering the overall convergence of the algorithm, the reference⁴⁸ guarantees a sublinear convergence rate theoretically, while in the numerical experiments of this paper, it is observed that the algorithm can achieve the desired results within a reasonable number of iterations.

Algorithm developments

In the case of PDF data, we apply stretchedNMF to time-series data according to the workflow shown in the chart in Fig. 10. A common experimental function (for example, the output of xPDFsuite⁵⁰ and PDFgetX3⁵¹, is the G(r) function⁵². This function oscillates above and below zero. NMF works on the basis that signals are positive and in order to avoid the loss of signal where the function goes negative, we need to modify the signal into a non-negative form. Here we use an offset method, by taking the smallest of all data values and adding its absolute value to all data. This approach has the advantage of being simple and has been successfully applied to the deep learning method of PDF analysis⁹.

**Fig. 10: The workflow of STRETCHEDNMF.**

After running the NMF solvers, we must restore the components to valid G(r) functions (oscillating around zero). To do this we utilize the solved weights and stretching factors to recover the components according to

$$\mathop{\min }\limits_{{x}_{k}}\mathop{\sum}\limits_{m = 1}^{M}{\left\Vert \mathop{\sum}\limits_{k = 1}^{K}{y}_{k}^{m}{B}_{{a}_{k}^{m}}{x}_{k}-{z}^{m}\right\Vert }^{2},$$

(16)

where, z^m is the original data rather than the data after the offset pre-processing and the other symbols are described alongsided Eq. (14). The weight, y, and stretching factors, a, are fixed to be those obtained from the NMF solution, and we remove the constraint that the components must be non-negative. Functions resembling G(r) are then recovered from the NMF components and may be fit using standard PDF modeling protocols. This is reasonable because it is based on our trust in the weights and stretching factors of the NMF solver’s solution of the preprocessed data. This approach is highly automated and can be applied to both conventional NMF and stretchedNMF, because the stretching factor of the conventional NMF is always 1.

For the case of PXRD data from highly crystalline samples, we have the additional observation that the diffraction patterns consists of a sparse set of sharp peaks. That is, the function value is zero in between the Bragg peaks (neglecting backgrounds and any diffuse scattering). We can make use of this property to enhance our ability to decompose signals by adding a sparse regularization term to the optimization problem. For the case where there are smooth backgrounds in experimental PXRD data, the background can be easily and automatically eliminated to make the data sparse. In this case, we make two modifications to the optimization problem in Eq. (14). The first is adding the l_1/2 sparse regularization term to x⁵³. The second is adding an upper bound on y, in order to prevent x from collapsing to zero as a whole, resulting in

$$\begin{array}{ll}\,\mathop{\min }\limits_{{a}_{k}^{m}\ge 0,\,0\le {y}_{k}^{m}\le 1,\,{x}_{k}\ge 0}\,\mathop{\sum}\limits_{m = 1}^{M}{\left\Vert \mathop{\sum}\limits_{k = 1}^{K}{y}_{k}^{m}{B}_{{a}_{k}^{m}}{x}_{k}-{z}^{m}\right\Vert }^{2}\\ \,\,+\rho \mathop{\sum}\limits_{k = 1}^{K}\mathop{\sum}\limits_{m = 1}^{M-2}{\left({a}_{k}^{m}-2{a}_{k}^{m+1}+{a}_{k}^{m+2}\right)}^{2}+\eta \mathop{\sum}\limits_{k = 1}^{K}\mathop{\sum}\limits_{i = 1}^{N}{({x}_{k,i})}^{\frac{1}{2}}.\end{array}$$

(17)

We refer to this as sparse-stretchedNMF.

In this optimization model, there are two regularization parameters, ρ and η. From experience, the smoothness parameter ρ is not sensitive and is usually adjusted by multiplying by 10. The sparsity parameter η can be adjusted by doubling. Problem Eq. (17) is still solved using Algorithm 2. However, it is worth mentioning that when updating Y, a constraint of Y ≤ 1 is enforced, leading to the update rule:

$$Y:= \min \left\{\max \left\{\hat{Y}-\frac{{\alpha }_{t}}{2}{\nabla }_{Y}f(A,\hat{Y},X),0\right\},1\right\}.$$

The update for X is formulated as:

$$X:= \arg \mathop{\min }\limits_{X\ge 0}\frac{1}{2}{\Vert X-\hat{X}+\frac{{\alpha }_{t}}{2}{\nabla }_{X}\,f(A,Y,\hat{X})\Vert }^{2}+\eta \parallel X{\parallel }_{\frac{1}{2}}^{\frac{1}{2}},$$

which has a closed-form solution as demonstrated in ref. ⁵⁴.

Data availability

The datasets used in this study are available at https://github.com/guran1214/Stretched-NMF.

Code availability

A python package for carrying out stretchedNMF is under development as an open source project. It will be released on conda-forge and pypi. Installation instructions, documentation and the code itself are at https://github.com/diffpy/diffpy.snmf. Fully operational matlab scripts that were used to run the examples in this paper are available at https://github.com/guran1214/Stretched-NMF.

References

Buciu, I., Nikolaidis, N. & Pitas, I. Nonnegative matrix factorization in polynomial feature space. IEEE Trans. Neural Netw. 19, 1090–1100 (2008).
Article PubMed Google Scholar
Sra, S. & Dhillon, I. Nonnegative matrix approximation: Algorithms and applications. Technical Report TR-06-27 (Computer Science Department, 2006).
Cichocki, A. & Phan, A.-H. Fast local algorithms for large scale nonnegative matrix and tensor factorizations. IEICE Trans. Fundamentals Electron., Commun. Comput. Sci. E92-A, 708–721 (2009).
Article Google Scholar
Buciu, I. Nonnegative matrix factorization, a new tool for feature extraction: theory and applications. Int. J. Comput., Commun. Control (IJCCC) 3, 67–74 (2008).
Google Scholar
Jolliffe, I. T.Principal Component Analysis. Springer Series in Statistics, 2nd edn (Springer, New York, 2002).
Long, C. J., Bunker, D., Li, X., Karen, V. L. & Takeuchi, I. Rapid identification of structural phases in combinatorial thin-film libraries using x-ray diffraction and non-negative matrix factorization. Rev. Sci. Instrum. 80, 103902 (2009).
Article CAS PubMed Google Scholar
Kusne, A. G., Keller, D., Anderson, A., Zaban, A. & Takeuchi, I. High-throughput determination of structural phase diagram and constituent phases using GRENDEL. Nanotechnology 26, 444002 (2015).
Article CAS PubMed Google Scholar
Hua, X. et al. Non-equilibrium metal oxides via reconversion chemistry in lithium-ion batteries. Nat. Commun. 12, 561 (2021).
Article CAS PubMed PubMed Central Google Scholar
Liu, C.-H. et al. Validation of non-negative matrix factorization for rapid assessment of large sets of atomic pair distribution function data. J. Appl. Crystallogr. 54, 763–775 (2021).
Thatcher, Z. et al. nmfMapping: a cloud-based web application for non-negative matrix factorization of powder diffraction and pair distribution function datasets. Acta Crystallogr. Sect. A: Found. Adv. 78, 242–248 (2022).
Rakita, Y. et al. Mapping structural heterogeneity at the nanoscale with scanning nano-structure electron microscopy (SNEM). Acta Mater. 242, 118426 (2023).
Article CAS Google Scholar
Beauvais, M. L., Chupas, P. J., O’Nolan, D., Parise, J. B. & Chapman, K. W. Resolving single-layer nanosheets as short-lived intermediates in the solution synthesis of FeS. ACS Mater. Lett. 3, 698–703 (2021).
Article CAS Google Scholar
O’Nolan, D. et al. A multimodal analytical toolkit to resolve correlated reaction pathways: the case of nanoparticle formation in zeolites. Chem. Sci. 12, 13836–13847 (2021).
Article PubMed PubMed Central Google Scholar
Chen, Z. et al. Node distortion as a tunable mechanism for negative thermal expansion in metal–organic frameworks. J. Am. Chem. Soc. 145, 268–276 (2023).
Article CAS PubMed Google Scholar
Rayder, T. M. et al. Unveiling unexpected modulator-CO2 dynamics within a zirconium metal–organic framework. J. Am. Chem. Soc. 145, 11195–11205 (2023).
Article CAS PubMed Google Scholar
Morup, M., Madsen, K. H. & Hansen, L. K. Shifted non-negative matrix factorization. In 2007 IEEE Workshop on Machine Learning for Signal Processing, 139–144 (2007).
Rakita, Y. et al. Active reaction control of cu redox state based on real-time feedback from in situ synchrotron measurements. J. Am. Chem. Soc. 142, 18758–18762 (2020).
Article CAS PubMed Google Scholar
Guccione, P., Palin, L., Milanesio, M., Belviso, B. D. & Caliandro, R. Improved multivariate analysis for fast and selective monitoring of structural dynamics by in situ X-ray powder diffraction. Phys. Chem. Chem. Phys. 20, 2175–2187 (2018).
Article CAS PubMed Google Scholar
Pecharsky, V. & Zavalij, P. Fundamentals of Powder Diffraction and Structural Characterization of Materials, Second Edition (Springer Science & Business Media, 2008).
Dinnebier, R. E. & Billinge, S. J. L.Powder Diffraction: Theory and Practice (Royal Society of Chemistry, 2008).
Egami, T. & Billinge, S. J. L. Underneath the Bragg Peaks: Structural Analysis of Complex Materials. 2nd edn. No. in Pergamon Materials Series (Elsevier, 2012).
Yang, L., Juhás, P., Terban, M. W., Tucker, M. G. & Billinge, S. J. L. Structure-mining: screening structure models by automated fitting to the atomic pair distribution function over large numbers of models. Acta Crystallogr. Sect. A: Found. Adv. 76, 395–409 (2020).
Article CAS Google Scholar
Liu, C.-H., Tao, Y., Hsu, D. J., Du, Q. & Billinge, S. J. L. Using a machine learning approach to determine the space group of a structure from the atomic pair distribution function. Acta Crystallogr. A 75, 633–643 (2019).
Article CAS Google Scholar
Yang, L. et al. A cloud platform for atomic pair distribution function analysis: PDFitc. Acta Crystallogr. Sect. A: Found. Adv. 77, 2–6 (2021).
Article CAS Google Scholar
Keler, E.K. & Andreyeva, A.B. Further data on solid solutions in {ZrO}2-{TiO}2 system}. Refractories 1, 257–260 (1960).
Article Google Scholar
Andreev, A., Bulanyi, M., Hayward, S., Mozharovsikii, L. Synthesis and some properties of single crystals of the {Zn$_{x}$Cd$_{1-x}$S} and {ZnS$_{y}$Se$_{1-y}$} solid solutions. (Russian) J. Inorg. Chem. (translated from Zhurnal Neorganicheskoi Khimii) 40, 1079–1082 (1995).
Bland, J. A. The thermal expansion of cubic barium titanate (BaTiO3) FROM 350 °C TO 1050 °C. Can. J. Phys. 37, 417–421 (1959).
Article CAS Google Scholar
Su, C.-H., Feth, S. & Lehoczky, S. L. Thermal expansion coefficient of ZnSe crystal between 17 and 1080 ^∘c by interferometry. Mater. Lett. 63, 1475–1477 (2009).
Article CAS Google Scholar
Porter, D. DanPorter/dans_diffraction. Zenodo https://zenodo.org/record/3859501. (2020).
Juhás, P., Farrow, C., Yang, X., Knox, K. & Billinge, S. Complex modeling: a strategy and software program for combining multiple information sources to solve ill posed structure and nanostructure inverse problems. Acta Crystallogr. Sect. A: Found. Adv. 71, 562–568 (2015).
Article Google Scholar
O’Nolan, D. et al. A thermal-gradient approach to variable-temperature measurements resolved in space. J. Appl. Crystallogr. 53, 662–670 (2020).
Article PubMed PubMed Central Google Scholar
Ashiotis, G. et al. The fast azimuthal integration python library: pyFAI. J. Appl. Crystallogr. 48, 510–519 (2015).
Article CAS PubMed PubMed Central Google Scholar
Rietveld, H. M. The rietveld method. Phys. Scr. 89, 098002 (2014).
Article CAS Google Scholar
Martinolich, A. J., Kurzman, J. A. & Neilson, J. R. Polymorph selectivity of superconducting CuSe ₂ through kinetic control of solid-state metathesis. J. Am. Chem. Soc. 137, 3827–3833 (2015).
Article CAS PubMed Google Scholar
Lee, D. D. & Seung, H. S. Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999).
Article CAS PubMed Google Scholar
Berry, M. W., Browne, M., Langville, A. N., Pauca, V. P. & Plemmons, R. J. Algorithms and applications for approximate nonnegative matrix factorization. Comput. Stat. Data Anal. 52, 155–173 (2007).
Article Google Scholar
Wang, Y.-X. & Zhang, Y.-J. Nonnegative matrix factorization: a comprehensive review. IEEE Trans. Knowl. Data Eng. 25, 1336–1353 (2013).
Article Google Scholar
Abdi, H. & Williams, L. J. Principal component analysis. WIREs Comput. Stat. 2, 433–459 (2010).
Article Google Scholar
Ren, B., Pueyo, L., Zhu, G. B., Debes, J. & Duchêne, G. Non-negative matrix factorization: robust extraction of extended structures. Astrophys. J. 852, 104 (2018).
Article Google Scholar
Gobinet, C., Perrin, E. & Huez, R. Application of non-negative matrix factorization to fluorescence spectroscopy. In 2004 12th European Signal Processing Conference, p. 1095–1098 (2004).
Paatero, P. & Tapper, U. Positive matrix factorization: a non-negative factor model with optimal utilization of error estimates of data values. Environmetrics 5, 111–126 (1994).
Article Google Scholar
Lin, C.-J. Projected gradient methods for nonnegative matrix factorization. Neural Comput. 19, 2756–2779 (2007).
Article PubMed Google Scholar
Kim, H. & Park, H. Nonnegative matrix factorization based on alternating nonnegativity constrained least squares and active set method. SIAM J. Matrix Anal. Appl. 30, 18 (2008).
Article Google Scholar
Guan, N., Tao, D., Luo, Z. & Yuan, B. NeNMF: an optimal gradient method for nonnegative matrix factorization. IEEE Trans. Signal Process. 60, 2882–2898 (2012).
Article Google Scholar
Huang, Y., Liu, H. & Zhou, S. Quadratic regularization projected Barzilai–Borwein method for nonnegative matrix factorization. Data Min. Knowl. Discov. 29, 1665–1684 (2015).
Article Google Scholar
Coleman, T. F. & Li, Y. An interior trust region approach for nonlinear minimization subject to bounds. SIAM J. Optim. 6, 28 (1996).
Article Google Scholar
Grippo, L. & Sciandrone, M. On the convergence of the block nonlinear Gauss–Seidel method under convex constraints. Oper. Res. Lett. 26, 127–136 (2000).
Article Google Scholar
Xu, Y. & Yin, W. A globally convergent algorithm for nonconvex optimization based on block coordinate update. J. Sci. Comput. 72, 700–734 (2017).
Article Google Scholar
Gu, R., Billinge, S. J. L. & Du, Q. A fast two-stage algorithm for non-negative matrix factorization in smoothly varying data. Acta Crystallogr. Sect. A 79, 203–216 (2023).
Article CAS Google Scholar
Yang, X., Juhas, P., Farrow, C. L. & Billinge, S. J. L. xPDFsuite: an end-to-end software solution for high throughput pair distribution function transformation, visualization and analysis. arXiv https://doi.org/10.48550/arXiv.1402.3163 (2015).
Juhás, P., Davis, T., Farrow, C. L. & Billinge, S. J. Pdfgetx3: a rapid and highly automatable program for processing powder diffraction data into total scattering pair distribution functions. J. Appl. Crystallogr. 46, 560–566 (2013).
Article Google Scholar
Farrow, C. L. & Billinge, S. J. L. Relationship between the atomic pair distribution function and small-angle scattering: implications for modeling of nanoparticles. Acta Crystallogr. Sect. A: Found. Crystallogr. 65, 232–239 (2009).
Article CAS Google Scholar
Xu, Z., Zhang, H., Wang, Y., Chang, X. & Liang, Y. L1/2 regularization. Sci. China Inf. Sci. 53, 1159–1169 (2010).
Article Google Scholar
Xu, Z., Chang, X., Xu, F. & Zhang, H. l_1/2 regularization: a thresholding representation theory and a fast solver. IEEE Trans. Neural Netw. Learn. Syst. 23, 1013–1027 (2012).
Article PubMed Google Scholar

Download references

Acknowledgements

We would like to thank Dr. Daniel Olds, for assistance during the measurements of the experimental PDF data. The work described here was funded by the Next Generation Synthesis Center (GENESIS), an Energy Frontier Research Center funded by the U.S. Department of Energy, Office of Science, Basic Energy Sciences under Award Number DE-SC0019212. X-ray PDF measurements were conducted on beamline 28-ID-2 of the National Synchrotron Light Source II, a US DOE Office of Science User Facility operated for the DOE Office of Science by Brookhaven National Laboratory under contract No. DESC0012704. Q.D. is also partially supported by DOE-ASCRDE-SC0022317. G.E.K received training and support as a part of QuADS: Quantitative Analysis of Dynamic Structures National Science Foundation Research Traineeship Program, grant number NSF DGE 1922639.

Author information

Authors and Affiliations

NITFID, School of Statistics and Data Science, Nankai University, Tianjin, China
Ran Gu
Department of Applied Physics and Applied Mathematics, Fu Foundation School of Engineering & Applied Sciences, Columbia University, New York, NY, USA
Yevgeny Rakita, Ling Lan, Zach Thatcher, Qiang Du & Simon J. L. Billinge
Department of Chemistry, Stony Brook University, Stony Brook, NY, USA
Gabrielle E. Kamm, Daniel O’Nolan & Karena W. Chapman
Department of Chemistry, Colorado State University, Fort Collins, CO, USA
Brennan Mcbride, Allison Wustrow & James R. Neilson

Authors

Ran Gu
View author publications
You can also search for this author in PubMed Google Scholar
Yevgeny Rakita
View author publications
You can also search for this author in PubMed Google Scholar
Ling Lan
View author publications
You can also search for this author in PubMed Google Scholar
Zach Thatcher
View author publications
You can also search for this author in PubMed Google Scholar
Gabrielle E. Kamm
View author publications
You can also search for this author in PubMed Google Scholar
Daniel O’Nolan
View author publications
You can also search for this author in PubMed Google Scholar
Brennan Mcbride
View author publications
You can also search for this author in PubMed Google Scholar
Allison Wustrow
View author publications
You can also search for this author in PubMed Google Scholar
James R. Neilson
View author publications
You can also search for this author in PubMed Google Scholar
Karena W. Chapman
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Du
View author publications
You can also search for this author in PubMed Google Scholar
Simon J. L. Billinge
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.G. proposed the methodology, contributed to manuscript writing and revision, performed data analysis, and handled data processing. Y.R. participated in manuscript writing and revision, conducted data analysis, contributed to experiment design, and assisted in data processing. L.L. was involved in manuscript writing, revision, data analysis, and data processing. Z.T. contributed to manuscript writing and data analysis. G.E.K. participated in manuscript writing, executed experiments, collected data, and performed data analysis. D.O. assisted in experiment execution, data collection, and data analysis. B.M. contributed to manuscript writing and data analysis. A.W. participated in manuscript writing, experiment execution, data collection, and data analysis. J.R.N. contributed to manuscript revision, experiment design, project supervision, and provided financial support. K.W.C. was involved in manuscript revision, experiment design, project supervision, and provided financial support. Q.D. contributed to manuscript writing, revision, final review, financial support, and project supervision. S.J.L.B. participated in manuscript writing, revision, financial support, and project supervision.

Corresponding authors

Correspondence to Qiang Du or Simon J. L. Billinge.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Gu, R., Rakita, Y., Lan, L. et al. Stretched non-negative matrix factorization. npj Comput Mater 10, 193 (2024). https://doi.org/10.1038/s41524-024-01377-5

Download citation

Received: 11 March 2024
Accepted: 04 August 2024
Published: 27 August 2024
DOI: https://doi.org/10.1038/s41524-024-01377-5
Springer Nature Limited

Stretched non-negative matrix factorization

Abstract

Similar content being viewed by others

Efficient and generalized processing of multidimensional NUS NMR data: the NESTA algorithm and comparison of regularization terms

Off-diagonal symmetric nonnegative matrix factorization

Deep data analysis via physically constrained linear unmixing: universal framework, domain examples, and a community-wide platform

Introduction

Results

Diffraction use case

Simulated data

Results on Simulated PDF

Results on simulated PXRD

Results on simulated PDF and PXRD data with small expansion coefficients

Experimental PXRD - thermal expansion

Experimental PXRD - thermal expansion and reaction

Discussion

Methods

Stretched non-negative matrix factorization

Numerical solution of stretchedNMF

Algorithm 1

Algorithm 2

Algorithm developments

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary information

Rights and permissions

About this article

Cite this article

Navigation

Stretched non-negative matrix factorization

Abstract

Similar content being viewed by others

Efficient and generalized processing of multidimensional NUS NMR data: the NESTA algorithm and comparison of regularization terms

Off-diagonal symmetric nonnegative matrix factorization

Deep data analysis via physically constrained linear unmixing: universal framework, domain examples, and a community-wide platform

Introduction

Results

Diffraction use case

Simulated data

Results on Simulated PDF

Results on simulated PXRD

Results on simulated PDF and PXRD data with small expansion coefficients

Experimental PXRD - thermal expansion

Experimental PXRD - thermal expansion and reaction

Discussion

Methods

Stretched non-negative matrix factorization

Numerical solution of stretchedNMF

Algorithm 1

Algorithm 2

Algorithm developments

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation