Reconfigurable Stochastic neurons based on tin oxide/MoS2 hetero-memristors for simulated annealing and the Boltzmann machine

Yan, Xiaodong; Ma, Jiahui; Wu, Tong; Zhang, Aoyang; Wu, Jiangbin; Chin, Matthew; Zhang, Zhihan; Dubey, Madan; Wu, Wei; Chen, Mike Shuo-Wei; Guo, Jing; Wang, Han

doi:10.1038/s41467-021-26012-5

Reconfigurable Stochastic neurons based on tin oxide/MoS₂ hetero-memristors for simulated annealing and the Boltzmann machine

Article
Open access
Published: 29 September 2021

Volume 12, article number 5710, (2021)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue

Reconfigurable Stochastic neurons based on tin oxide/MoS₂ hetero-memristors for simulated annealing and the Boltzmann machine

Download PDF

6573 Accesses
18 Citations
36 Altmetric
5 Mentions
Explore all metrics

Abstract

Neuromorphic hardware implementation of Boltzmann Machine using a network of stochastic neurons can allow non-deterministic polynomial-time (NP) hard combinatorial optimization problems to be efficiently solved. Efficient implementation of such Boltzmann Machine with simulated annealing desires the statistical parameters of the stochastic neurons to be dynamically tunable, however, there has been limited research on stochastic semiconductor devices with controllable statistical distributions. Here, we demonstrate a reconfigurable tin oxide (SnO_x)/molybdenum disulfide (MoS₂) heterogeneous memristive device that can realize tunable stochastic dynamics in its output sampling characteristics. The device can sample exponential-class sigmoidal distributions analogous to the Fermi-Dirac distribution of physical systems with quantitatively defined tunable “temperature” effect. A BM composed of these tunable stochastic neuron devices, which can enable simulated annealing with designed “cooling” strategies, is conducted to solve the MAX-SAT, a representative in NP-hard combinatorial optimization problems. Quantitative insights into the effect of different “cooling” strategies on improving the BM optimization process efficiency are also provided.

Versatile stochastic dot product circuits based on nonvolatile memories for high performance neurocomputing and neurooptimization

Article Open access 08 November 2019

Neuromorphic Devices and Networks Based on Memristors with Ionic Dynamics

Noise-injected analog Ising machines enable ultrafast statistical sampling and machine learning

Article Open access 04 October 2022

Introduction

Stochastic neuron devices are essential for the neural network implementation of key emerging non-von-Neumann computing concepts such as the Boltzmann machines, which are recurrent artificial neural networks with stochastic features analogous to the thermodynamics of real-world physical systems. BM can be used to solve a broad range of combinatorial optimization problems^1,2 with applications in classification³, pattern recognition⁴, feature learning, and other emerging computing systems. Deriving its name from the Boltzmann distribution of statistical mechanics, BM possesses an artificial notion of “temperature”, and the controlled evolution of this “temperature” parameter during the optimization process^5,6, i.e., the “cooling” strategy, can impact the convergence efficiency of the BM and its chance of reaching a better cost-energy minimization (or maximization depending on problem definition). To realize the hardware implementation of the BM that can also allow the “temperature” control and hence the precise execution of desired “cooling” strategy, it is essential to have electronic devices that can generate exponential-class stochastic sampling with dynamically tunable distribution parameters.

The property of memristor in its deterministic form has been commonly used in applications such as multiply-and-accumulate matrix calculation⁷ and resistor-logic demultiplexers^8,9,10. Its stochastic property is often intentionally suppressed^11,12,13 in such applications for the purpose of achieving accurate and reproducible computational results^14,15. On the other hand, rich stochastic property of memristors, which relies on ensembles of random movements of atoms and ions, offers opportunities in energy-efficient computing applications^{16,17,18,19,20}. With the stochastic property, one can generate random number²¹ to encrypt information, implement physical unclonable functions²², and realize artificial neurons²³ with integrate-and-fire activations. Furthermore, emerging computing schemes can use stochastic memristive device as a building block to emulate biological neural network^24,25, whose functions—such as decision-making—can leverage the stochastic dynamics of neurons and synapses. However, a common challenge with previous stochastic memristors is the lack of means to precisely control and modulate the probability distribution that is associated with its randomness. Realizing such devices has been difficult because many device-generated random features in stochastic memristors or oscillators lack stable probability distribution, which limits the chance of controlling it experimentally^19,26,27. Additionally, with only two terminals in a common memristor, where the probability distribution can only be influenced through the two-terminal bias, the probability distribution of the device output cannot be tuned flexibly and precisely.

In this work, we overcome such challenge with a three-terminal stochastic hetero-memristor based on tin oxide/MoS₂ heterostructure, which demonstrates tunable statistical distributions enabled by the gate modulation. The inherent exponential-class stochastic characteristics of the device arising from the intrinsic randomness and energy distribution in its ionic motions are explored to realize sampling of exponential-class sigmoidal distributions that resembles the Fermi–Dirac distribution in physical systems. The device incorporates gate modulation that allows the efficient control of the stochastic features in the device output characteristics. The device enables the realization of reconfigurable stochastic neuron and the implementation of Boltzmann machine in which the reconfigurable statistic of the device allows different “cooling” strategies to be implemented during the optimization process. The effect of different “cooling” strategies on improving the optimization process efficiency of the BM is demonstrated experimentally.

Results

Figure 1a shows the schematic of this reconfigurable heteromemristor, where tin oxide serves as filament-switching layer and is sandwiched between a MoS₂ layer and Cr/Au top electrodes (TE). The Si substrate serves as a modulating gate bias (V_g) that can influence the filament-formation dynamics in the tin oxide layer. The high-resolution scanning transmission electron microscopy (HR-STEM) image in Fig. 1b shows the cross section of the fabricated device and reveals that the tin oxide layer is amorphous. An energy-dispersive X-ray spectroscopy (EDX) scan in Fig. 1c indicates the elemental composition. Figure 1d plots the Raman spectra for the SnSe sample before and after oxidation, which leads to the formation of the SnO_x layer. All signature modes of SnSe, including the shear mode A_g¹, the in-the-plane modes A_g² and B_3g, and the out-of-plane mode A_g³ that are observed before oxidation, and are not detected after oxidation, indicating the full oxidation and amorphization of the SnSe sample²⁸. The tin oxide film can also be synthesized using atomic-layer deposition (ALD)^29,30,31, which produces films of similar quality as the direct oxidation method.

**Fig. 1: Device structure and electrical characteristics.**

Unipolar electrical switching characteristics of the device at V_g = 0 V are shown in Fig. 1e. It sets and resets at around 3.2 V and 2.8 V respectively in the positive bias, and at −3.4 V and −3 V, respectively, in the negative bias³². Both the Joule heating and the electric-field driven effect can be playing roles in the device operation. The filament-formation operation can be due to a breakdown-like process with random creation of voltage-stress-induced vacancy or defect sites, which is electric-field driven. The Joule heating can be the main effect in filament rupturing. The insertion of the MoS₂ layer in the device made it possible to adjust the electron energy level in MoS₂ by externally modulating the gate bias V_g, which can modulate both the contact-energy barrier between the MoS₂ and SnO_x, and the conductivity of the MoS₂ sheet itself (see supplementary information section 4). Hence, as shown in Fig. 1f, as the gate bias decreases from 30 V to −20 V, the electrostatic doping in MoS₂ and the associated energy level decreases, leading to the reduction in the series conductivity and hence the gradual increase in the set voltage.

The filament-formation process is stochastic due to the inherent random motion of oxygen ions. To extract this stochastic property quantitatively, a statistical study is carried out on the set process. As shown in Fig. 2, the device is initially reset to the high-resistance state and a bias V_TE is applied to the device for up to 2 s. During each set process, it takes a certain amount of time t (t ≤ 2 s) after the bias voltage is applied for the device to be set. This required bias time until set is stochastic in each trial. Furthermore, there is certain chance that the device may still remain in the high-resistance state after 2 s. Figure 2a plots the device current characteristics as a function of time when this reset and set process was repeated for 30 times at V_TE = 6 V, 5 V, 4 V, and 3 V, respectively, with V_g fixed at 0 V. At V_TE = 6 V, the device is successfully set within the first 2 s for all the 30 trials. At V_TE = 5 V, 4 V, and 3 V, the device failed to set within the first 2 s in certain cases. Figure 2b shows the histogram probability distribution extracted from 30 trials of the time required, until the device becomes set. If we consider t as a random variable, the probability that the set will occur within an infinitesimal interval $\triangle t$ at time t can be described by an exponential-class distribution³³ function $P=\frac{\triangle t}{\tau }\cdot {e}^{-\frac{t}{\tau }}$ with the wait time t following a Poisson distribution (see supplementary information section 6) and it fits the experimental data well (red lines, Fig. 2b). This experimental observation resembling Poisson random wait time underlying the filament-formation process in the tin oxide memristive device is indicative of its exponential-class stochastic nature.

**Fig. 2: Sampling of exponential-class sigmoidal distribution.**

Moreover, Fig. 2c plots P_ss,t<2s as a function of V_TE−V_TE0 under different gate voltages, which shows exponential-class sigmoidal distribution function. Here, P_ss,t<2s is the probability that the device will successfully set within 2 s and V_TE0 is the 50% probability bias-voltage point, i.e., P_ss,t<2s (V_TE = V_TE0) = 0.5. With the gate voltage fixed, the chance of the device being set within t < 2 s becomes higher with increasing V_TE, following a sigmoidal distribution. It shows that V_TE can tune the stochastic property of the set event in the device when V_g is fixed. Microscopically, the V_TE tunes the filament-formation process by modulating the vacancy-hopping barrier height and thus the ion-hopping rate. Thus, the device is understandably easier to set at high V_TE than low V_TE. Under different gate voltages, P_ss,t<2s shows a sharper 0-to-1 transition when V_g is 30 V and a wider spread in its 0-to-1 transition when the V_g decreases. Here V_g tunes the Fermi level and charge density in the MoS₂ layer, which modulates the potential distribution between MoS₂ and tin oxide layer under V_TE bias. V_TE is more effective in modulating the device when V_g is higher, i.e. the MoS₂ layer has a higher electron carrier density and higher conductivity, and thus leads to a sharper 0-to-1 transition in the sigmoidal distribution curve.

The set process is achieved by the filament formation through stochastic vacancy generation and hopping-transport processes. Applying a voltage can reduce the generation and hopping-barrier height and exponentially enhance the generation and hopping rates. Analytically, the set probability, P_ss,t<2s, can be derived as P_ss,t<2s$\; = 1-{e}^{{-\beta e}^{\alpha ({V}_{{{{{{\rm{TE}}}}}}}-{V}_{{{{{{\rm{TE}}}}}}0})}}$, where $\alpha$ and $\beta$ are parameters related to the material and device structure (see supplementary information section 7). After further approximation, P_ss,t<2s can be simplified to a distribution function that resembles the Fermi–Dirac distribution (see supplementary information section 8):

$${P}_{{{{{{\rm{ss}}}}}},\,t < 2{{{{{\rm{s}}}}}}}\approx \frac{1}{1+{{\exp }}\left(-\frac{{V}_{{{{{{\rm{TE}}}}}}}-{V}_{{{{{{\rm{TE}}}}}}0}}{{T}_{{{{{{\rm{eff}}}}}}}}\right)}$$

(1)

where T_eff is an effective “temperature” term that can be tuned by the gate bias. This expression fits very well with the experimental data in Fig. 2c. The above analytical description is also in agreement with kinetic Monte Carlo simulations, which describes microscopic stochastic process of vacancy generation, hopping, and recombination in filament formation^34,35. T_eff corresponding to various gate voltages is extracted from the fitting and Fig. 2d plots T_eff versus gate voltage V_g. A behavioral model is developed to understand the dependence of the T_eff on the gate-bias voltage. The device is modeled as a memristor in serial combination with a MoS₂ layer whose resistance (both the sheet resistance and its contact property with the memristive filament) can be modulated by the gate electric field. As a result, T_eff can be expressed as ${T}_{{{{{{\rm{eff}}}}}}}\left({V}_{{{{{{\rm{g}}}}}}}\right)={T}_{{{{{{\rm{V}}}}}}0}\left[1+\frac{Z}{\left({V}_{{{{{{\rm{g}}}}}}}-{V}_{{{{{{\rm{T}}}}}}}\right)}\right]$, where ${T}_{{{{{{\rm{V}}}}}}0}$ and Z are constants, V_T is the threshold voltage (see supplementary information section 9). As shown in Fig. 2d, this model fits well with the experimental data and describes the modulation effect of T_eff by V_g. We would like to note that the value of T_eff has the unit of volt. However, to avoid confusion with the actual electrical bias voltages applied on the device, the unit of T_eff will be omitted in the subsequent discussions. The above discussed stochastic process of the filament formation together with the gate voltage-dependent “temperature” effect can be used to construct exponential-class distribution sampling that has broad applications in statistical modeling and computing, with the Boltzmann machine as a typical example.

To demonstrate the unique advantages of these tunable exponential-class stochastic heteromemristors in computing application, a version of Boltzmann machine that contains a network of stochastic neurons is implemented. The stochastic neurons may fire in response to the input signals and thus drive the searching dynamics of the BM. The BM iterates all possible solutions to search for the best solution by minimizing the system-energy function. Hardware implementations^36,37 of such BM are challenging with conventional transistors and would require a large number of devices and complex circuitry. Here we build a BM where each of the stochastic neuron is based on a single tin oxide/MoS₂ hetero-memristor as stochastic switching and simple peripheral circuitry (more details in Methods: BM construction). This implemented BM is used to solve a maximum satisfiability problem (MAX-SAT), which is an NP-hard combinatorial optimization problem underlying a wide range of key applications, including Max-Clique³⁸, correlation clustering³⁹, treewidth computation⁴⁰, Bayesian network structure learning⁴¹, and argumentation dynamics⁴².

Given a set of Boolean clauses, where each clause is a disjunction of Boolean variables and their negations, the MAX-SAT problem⁴³ aims to maximize the number of clauses that can be true when truth values are assigned to the Boolean variables. Without the loss of generality, the set of Boolean clauses to be solved in this work are selected to be $\left\{{{{{{\rm{Ci}}}}}}|{{{{{\rm{i}}}}}}={{{{\mathrm{1,2}}}}},\ldots ,5\right\}$, where the clause C1 is $\left(x\vee y\vee z\right)$; C2 is $\left({x}^{{\prime} }\vee y\vee z\right)$; C3 is $\left({x}^{{\prime} }\vee {y}^{{\prime} }\vee z\right)$; C4 is $\left(x\vee {y}^{{\prime} }\vee {z}^{{\prime} }\right)$ and C5 is $\left({x}^{{\prime} }\vee y\vee {z}^{{\prime} }\right)$ (shown in Fig. 3a, the Boolean variable ${x}^{{\prime} }$ is the negation of the Boolean variable $x$). The optimization task here is to find a state vector ${{{{{\bf{X}}}}}}=\left({x}_{1},\cdots ,{x}_{6}\right)=(x,y,z,{x}^{{\prime} },{y}^{{\prime} },z^{\prime} )$ that can maximize the number of clauses to be true. A MAX-SAT can be converted equivalently to a problem that is solvable for the BM^44,45. Six stochastic units are used in the BM to realize the activation for each Boolean variable in the state vector ${{{{{\bf{X}}}}}}=\left({x}_{1},\cdots ,{x}_{6}\right)$. Then we build a weight matrix W. The weight ${w}_{{{{{{\rm{ij}}}}}}}$ that is between every two Boolean variables is assigned based on the MAX-SAT problem. Solving the MAX-SAT is equivalent to minimizing the total energy $E={{{{{{\bf{X}}}}}}}^{{{{{{\rm{T}}}}}}}{{{{{\bf{WX}}}}}}$ of the BM, where ${{{{{{\bf{X}}}}}}}^{{{{{{\rm{T}}}}}}}$ is the transverse of ${{{{{\bf{X}}}}}}$.

**Fig. 3: Boltzmann machine implementation using tin oxide/MoS₂ heteromemristor.**

The constructed BM utilizing the tin oxide/MoS₂ heteromemristors is shown in Fig. 3b and the schematic of the circuit blocks with six stochastic neurons is shown in Fig. 3c. In each iteration step, if the hetero-memristor sets, the Boolean value of ${x}_{{{{{{\rm{i}}}}}}}$ would be flipped. If the heteromemristor does not set, the stochastic neuron would not fire and ${x}_{{{{{{\rm{i}}}}}}}$ remains the same. The stochastic neurons are sequentially updated until the BM reaches the optimal solution. In Fig. 3d, we experimentally demonstrated the evolution of the state vector and total energy when the BM started from three different initial states and found the same optimal solution, which is ${{{{{\bf{X}}}}}}=(x,{y},z,{x}^{{\prime} },{y}^{{\prime} },{z}^{{\prime} })=({{{{\mathrm{0,1,1,1,0,0}}}}})$.

As previously shown in Fig. 2d, V_g can tune the tin oxide/MoS₂ heteromemristor to have different T_eff during the BM optimization process. T_eff of the BM describes the average behaviors of all the stochastic units, in close analogy to the temperature parameter in the Boltzmann distribution that describes the average behavior of particles under different thermal equilibrium states in physical systems. Thus, by controlling T_eff in the optimization process that can be achieved via tuning the V_g, it is possible to avoid premature convergence issues and facilitate the convergence efficiency associated with the BM. Figure 3e shows the effect of different V_g bias on the BM optimization process. During these three different runs of the BM, all the tin oxide/MoS₂ stochastic hetero-memristors are biased at V_g = −20 V, 0 V, and 20 V, respectively. The energy evolved differently during these runs each time. The BM is at T_eff = 7 when V_g = 20 V and converges easily for this particular problem. On the other hand, the BM is at T_eff = 50 when V_g = −20 V and is less efficient in reaching convergence. For V_g = 0 V, the BM is at T_eff = 10 and converges at an intermediate rate among the three cases. By counting how many times the BM can reach the global optimal solution out of 50 trial runs, the success rate as a function of V_g and T_eff is statistically obtained as shown in Fig. 3f. It indicates that the V_g and hence the T_eff can substantially affect the performance of the BM.

Simulated annealing^46,47 can be implemented with our BM where the T_eff can gradually change during the optimization process to emulate different “cooling” strategy. It is an important approach for efficiently reaching better optimization solutions and for avoiding the premature convergence. Using the gate-tunable tin oxide/MoS₂ device, such “cooling” procedures can be quantitatively implemented during the simulated annealing by translating the designated sequential evolution of T_eff into the corresponding series of gate voltage bias conditions following the relation in Fig. 2d. To study the effect of different “cooling” strategies on the efficiency of the BM, four different T_eff variation strategies were experimentally applied on the BM. Strategy 1: high T_eff in the first three iteration steps followed by low T_eff for the remaining iterations in one optimization process (HT to LT), Strategy 2: low T_eff in the first three iterations followed by high T_eff for the remaining iterations (LT to HT), Strategy 3: maintaining a low T_eff in the entire optimization process (LT), and Strategy 4: maintaining a high T_eff in the entire optimization process (HT). Figure 4a shows the qualitative schematic about how system energy (color dots) would evolve in the process of searching optimal solutions among multiple possible energy minimums (gray line). To analyze the effect of these “cooling” strategies, typical evolutions of the energy (cost function) during the BM optimization process for the four different strategies were experimentally obtained. As shown in Fig. 4b, using the HT strategy (T_eff = 50), the BM is highly active but loses the selectivity for reaching proper convergence. Using the LT strategy (T_eff = 5), the BM is significantly less active but possesses higher selectivity that facilitates its convergence to a premature state. Finally, simulated annealing using a “cooling” strategy (HT to LT) enables active initial searches at HT (T_eff = 50) and then steady convergence to the minimum energy state at LT (T_eff = 5) as shown in the experimental results. Furthermore, Figs. 4c and 4d show the experimentally obtained statistics of success rate in finding the global optimal solution when the different “cooling” strategies are used. Different initial values for the state vectors are used in Figs. 4c and 4d to show the effect from the different initial conditions. Both figures indicate that the HT to LT strategy has the highest success rate for reaching the global optimal solution for this particular problem, while the HT strategy has the lowest success rate. The results are consistent with the simulated performance of the BM (see supplementary information section 10).

**Fig. 4: Implementing the simulated annealing in tin oxide/MoS₂-based BM.**

To quantitatively understand why T_eff can make such a significant difference in the BM optimization process, we analyze the Russel–Rao (RR) similarity⁴⁸ between all the clauses for this particular MAX-SAT problem. It is because, as illustrated in Fig. 5a, all the five clauses C1–C5 bear inherent similarity to each other due to the following two constraints: the variable constraint and the clause constraint. On the variable side, a Boolean variable and its negation (two variables connected by red lines) are always logically opposite. For example, $x$ and ${x}^{{\prime} }$ will always have opposite values. On the clause side, the chance of two clauses both being true is lower if they contain more complementary Boolean variables in each clause. By assigning true values to the variables $x$, ${y}^{{\prime} }$and ${z}^{{\prime} }$(yellow circle), the number of complementary variables (blue circle) between clauses could be easily observed. Counting the number of complementary variables can directly reflect the inner connection and constraint of the clauses. In Fig. 5a, for example, if the clause C4: $\left(x\vee y^{\prime} \vee z^{\prime} \right)$ is true, then the probability that the clause C2: $\left({x}^{{\prime} }\vee y\vee z\right)$ also being true is much smaller than the other three clauses since C4 and C2 contain three pairs of complementary variables.

**Fig. 5: Russel–Rao similarity matrix underlying the clauses employing different “cooling” strategy in a MAX-SAT problem.**

With the BM set to different T_eff, the RR similarity matrix among the five clauses based on the experimental data is constructed in Figs. 5b, 5c and 5d. The color and number in each cell quantify the similarity between each pair of clauses indexed by the row and column. It represents the probability when both clauses are true among all cases. For example, a RR similarity of 0.84 between C1 and C2 in Fig. 5b means that by repeatedly running the BM 50 times at T_eff = 50, we had C1 and C2, both being true by the end of 42 (out of 50) runs.

The effect of T_eff can be explained as follows. We view the RR similarity as the distance measurement of the statistical relationship between each of the two clauses (distance = 1 − RR coefficient) in solution space⁴⁹. In other words, clauses with RR similarity close to 1 are seen as closely clustered, while the clauses with RR similarity close to 0 are furthermost separated. When T_eff is tuned to 50 (Fig. 5b), all the clauses have similar distances in the solution space, since they show close RR similarity between all pairs. As a consequence, BM tends to search widely in the solution space with a high robustness, high stochasticity, and low selectivity, since choosing any solution would look the same to the BM. When T_eff is 20 (Fig. 5c), clauses with small distances are closely clustered, giving high RR similarity close to unity for pairs of clauses that can be easily satisfied simultaneously, such as C1 and C2, and a low RR similarity for pairs of clauses that can hardly be satisfied at the same time, such as C1 and C4. At this T_eff = 20, the BM gains more selectivity in solution space. When the T_eff is 5 (Fig. 5d), all the clauses are either strongly clustered or separated in distance, with distinct either 1 or 0 RR similarity. BM behaves more like a deterministic “machine”. This tends to cause premature convergence as the BM is significantly less active.

Next, a simulated annealing process in the BM with linear cooling is simulated in Fig. 5e. The evolution of the RR similarity matrix indicates that the BM would evolve through all the cases that are discussed above from being fully stochastic toward nearly deterministic as T_eff decreases linearly. Thus, the simulated annealing process of a BM could be understood as such: at high T_eff, the BM searches solution space globally with high robustness and low selectivity, for the sake of large gradient descent; as the BM cools down, it gains selectivity toward some solutions and can possibly jump out of local minima since T_eff still provides enough perturbation; as the BM cools down to the limit, the BM exhibits a stronger selectivity than robustness, preventing itself from jumping out of the optimal zone. Hence, more efficient performance in the BM can be achieved with an appropriate “cooling” strategy.

In summary, tunable stochastic behavior is demonstrated in the tin oxide/MoS₂ heteromemristor, showing inherent exponential-class statistical characteristics. The device can sample exponential-class sigmoidal distributions resembling the Fermi–Dirac distribution in physical systems with tunable distribution parameters to emulate the “temperature” effects. Simulated annealing with control of the “cooling” strategies is demonstrated in the implemented Boltzmann machine for solving combinatorial optimization with respect to a MAX-SAT problem. These stochastic neurons based on tin oxide/MoS₂ heteromemristors with reconfigurable statistical behavior pave the way for implementing selected “cooling” strategies in BM to reach optimal convergence efficiency and can find broad applications in energy-efficient computing for learning, clustering, and classification.

Methods

Device fabrication

A thin MoS₂ layer is first deposited on a Si wafer with a 285-nm thermally grown SiO₂ layer on top. The sample is then treated in an Ar/H₂-mixed gas environment at 350 °C to clean the MoS₂ surface. Subsequently, a thin tin oxide layer oxidized from SnSe is deposited on MoS₂ and serves as filament-switching layer. Electron beam lithography is then used to transfer the patterns followed by the evaporation of a 10-nm/40-nm Cr/Au metal stack, which forms the top electrode.

STEM and EDX

A FEI Titan Themis G2 system was used to prepare the HRSTEM images with four detectors and spherical aberration. To observe the cross-section image, the sample was pretreated by depositing chromium and carbon-capping layers, then thinned by a focused-ion beam (FIB, FEI Helios 450 S) with an acceleration voltage of 30 kV. The HRSTEM image was acquired with an acceleration voltage of 200 kV. EDX signals were collected to identify the elemental component in the cross section, which was integrated within the STEM system.

Raman spectroscopy

A Renishaw inVia Qontor system was used to measure the Raman spectra, which was installed with a ×100 objective lens, a grating (1800 grooves mm⁻¹), and a charge-coupled device camera. The wavelength of the excitation laser was 532 nm (from a solid laser). The Raman spectra resolution is 1.2 cm⁻¹ per pixel.

BM construction

The implemented BM prototype contains 24 5-bit digital-to-analog converters (DAC). The digital pattern generation interface (DPGI) and training data acquisition interface (TDAI) are controlled by a Xilinx ML605 FPGA board that carries out information storage and computations. It formed a feedback loop to adjust both input and output patterns at each BM iteration. Depending on different input signals, the BM system adjusts the corresponding output training data accordingly. The BM prototype has six stochastic units, with each unit containing a tin oxide/MoS₂ heteromemristor that has approximately sigmoidal switching probability upon applied voltages and peripheral circuitry. The peripheral circuitry is consisting of 4 DACs (digital-to-analog converter) to read digital voltage values and apply to heteromemristor, a dynamic comparator for generating discrete-state readout and output-level shifters.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Kirkpatrick, S., Gelatt, C. D. & Vecchi, M. P. Optimization by simulated annealing. science 220, 671–680 (1983).
Article MathSciNet CAS ADS Google Scholar
Smith, K. A. Neural networks for combinatorial optimization: a review of more than a decade of research. INFORMS J. Comput. 11, 15–34 (1999).
Article MathSciNet Google Scholar
Larochelle, H., Mandel, M., Pascanu, R. & Bengio, Y. Learning algorithms for the classification restricted Boltzmann machine. J. Mach. Learn. Res. 13, 643–669 (2012).
MathSciNet MATH Google Scholar
Fischer, A. & Igel, C. in Iberoamerican Congress on Pattern Recognition. pp. 14–36 (Springer, 2012).
Li, G. et al. Temperature based restricted boltzmann machines. Sci. Rep. 6, 19133 (2016).
Article CAS ADS Google Scholar
Salazar, D. S. Nonequilibrium thermodynamics of restricted Boltzmann machines. Phys. Rev. E 96, 022131 (2017).
Article ADS Google Scholar
Prezioso, M. et al. Training and operation of an integrated neuromorphic network based on metal-oxide memristors. Nature 521, 61–64 (2015).
Article CAS ADS Google Scholar
Kuekes, P. J. et al. Resistor-logic demultiplexers for nanoelectronics based on constant-weight codes. Nanotechnology 17, 1052 (2006).
Article ADS Google Scholar
Kuekes, P. J., Robinett, W. & Williams, R. S. Improved voltage margins using linear error-correcting codes in resistor-logic demultiplexers for nanoelectronics. Nanotechnology 16, 1419 (2005).
Article Google Scholar
Pan, C. et al. Reconfigurable logic and neuromorphic circuits based on electrically tunable two-dimensional homojunctions. Nat. Electron. 3, 383–390 (2020).
Ambrogio, S. et al. Equivalent-accuracy accelerated neural-network training using analogue memory. Nature 558, 60–67 (2018).
Article CAS ADS Google Scholar
Boybat, I. et al. Neuromorphic computing with multi-memristive synapses. Nat. Commun. 9, 1–12 (2018).
Article CAS ADS Google Scholar
Sangwan, V. K. & Hersam, M. C. Neuromorphic nanoelectronic materials. Nat. Nanotechnol. 15, 517–528 (2020).
Article CAS ADS Google Scholar
Wong, H.-S. P. & Salahuddin, S. Memory leads the way to better computing. Nat. Nanotechnol. 10, 191–194 (2015).
Article CAS ADS Google Scholar
Yu, S., Wu, Y., Jeyasingh, R., Kuzum, D. & Wong, H.-S. P. An electronic synapse device based on metal oxide resistive switching memory for neuromorphic computation. IEEE Trans. Electron Devices 58, 2729–2737 (2011).
Article CAS ADS Google Scholar
Hu, M., Wang, Y., Wen, W., Wang, Y. & Li, H. Leveraging stochastic memristor devices in neuromorphic hardware systems. IEEE J. Emerg. Sel. Top. Circuits Syst. 6, 235–246 (2016).
Article ADS Google Scholar
Gaba, S., Sheridan, P., Zhou, J., Choi, S. & Lu, W. Stochastic memristive devices for computing and neuromorphic applications. Nanoscale 5, 5872–5878 (2013).
Article CAS ADS Google Scholar
Gaba, S., Knag, P., Zhang, Z. & Lu, W. In 2014 IEEE International Symposium on Circuits and Systems (ISCAS). 2592–2595 (IEEE, 2014).
Cai, F. et al. Power-efficient combinatorial optimization using intrinsic noise in memristor Hopfield neural networks. Nat. Electron. 3, 409–418 (2020).
Zhu, X., Li, D., Liang, X. & Lu, W. D. Ionic modulation and ionic coupling effects in MoS₂ devices for neuromorphic computing. Nat. Mater. 18, 141–148 (2019).
Article CAS Google Scholar
Jiang, H. et al. A novel true random number generator based on a stochastic diffusive memristor. Nat. Commun. 8, 1–9 (2017).
Article Google Scholar
Zhang, R. et al. Nanoscale diffusive memristor crossbars as physical unclonable functions. Nanoscale 10, 2721–2726 (2018).
Article CAS Google Scholar
Wang, Z. et al. Fully memristive neural networks for pattern classification with unsupervised learning. Nat. Electron. 1, 137–145 (2018).
Article Google Scholar
Zhang, W. et al. Neuro-inspired computing chips. Nat. Electron. 3, 371–382 (2020).
Article ADS Google Scholar
Baek, E. et al. Intrinsic plasticity of silicon nanowire neurotransistors for dynamic memory and learning functions. Nat. Electron. 3, 398–408 (2020).
Serb, A. et al. Unsupervised learning in probabilistic neural networks with multi-state metal-oxide memristive synapses. Nat. Commun. 7, 1–9 (2016).
Article Google Scholar
Huang, C.-Y., Shen, W. C., Tseng, Y.-H., King, Y.-C. & Lin, C.-J. A contact-resistive random-access-memory-based true random number generator. IEEE Electron Device Lett. 33, 1108–1110 (2012).
Article CAS ADS Google Scholar
Zhao, S. et al. Controlled synthesis of single-crystal SnSe nanoplates. Nano Res. 8, 288–295 (2015).
Article CAS Google Scholar
Park, B.-E. et al. Phase-controlled synthesis of SnO_x thin films by atomic layer deposition and post-treatment. Appl. Surf. Sci. 480, 472–477 (2019).
Article CAS ADS Google Scholar
Lee, J.-H. et al. Selective SnO_x atomic layer deposition driven by oxygen reactants. ACS Appl. Mater. interfaces 10, 33335–33342 (2018).
Article CAS Google Scholar
Hoffmann, L. et al. Atmospheric pressure plasma enhanced spatial atomic layer deposition of SnO_x as conductive gas diffusion barrier. J. Vac. Sci. Technol. A Vac. Surf. Films 36, 01A112 (2018).
Article Google Scholar
Nagashima, K., Yanagida, T., Oka, K. & Kawai, T. Unipolar resistive switching characteristics of room temperature grown SnO₂ thin films. Appl. Phys. Lett. 94, 242902 (2009).
Article ADS Google Scholar
Jo, S. H., Kim, K.-H. & Lu, W. Programmable resistance switching in nanoscale two-terminal devices. Nano Lett. 9, 496–500 (2009).
Article CAS ADS Google Scholar
Sadi, T., Badami, O., Georgiev, V. & Asenov, A. In International Conference on Large-Scale Scientific Computing. 429–437 (Springer, 2019).
Wu, T., Zhao, H., Liu, F., Guo, J. & Wang, H. Machine Learning Approach for Device-Circuit Co-Optimization of Stochastic-Memristive-Device-Based Boltzmann Machine. arXiv preprint arXiv:1905.04431 (2019).
Kim, S. K., McAfee, L. C., McMahon, P. L. & Olukotun, K. In 2009 International Conference on Field Programmable Logic and Applications. 367–372 (IEEE, 2009).
Kim, L.-W., Asaad, S. & Linsker, R. A fully pipelined fpga architecture of a factored restricted Boltzmann machine artificial neural network. ACM Trans. Reconfigurable Technol. Syst. 7, 1–23 (2014).
Article Google Scholar
Heras, F. & Larrosa, J. In International Conference on Theory and Applications of Satisfiability Testing. 139–152 (Springer, 2008).
Berg, J. & Järvisalo, M. In 2013 IEEE 13th International Conference on Data Mining Workshops. 750–757 (IEEE, 2013).
Berg, J. & Järvisalo, M. In 2014 IEEE 26th International Conference on Tools with Artificial Intelligence. 328–335 (IEEE, 2014).
Cussens, J. Bayesian network learning by compiling to weighted MAX-SAT. arXiv preprint arXiv:1206.3244 (2012).
Wallner, J. P., Niskanen, A. & Järvisalo, M. Complexity results and algorithms for extension enforcement in abstract argumentation. J. Artif. Intell. Res. 60, 1–40 (2017).
Article MathSciNet Google Scholar
Ansótegui, C., Bonet, M. L. & Levy, J. SAT-based MaxSAT algorithms. Artif. Intell. 196, 77–105 (2013).
Article MathSciNet Google Scholar
d’Anjou, A., Grana, M., Torrealdea, F. J. & Hernandez, M. Solving satisfiability via Boltzmann machines. IEEE Trans. Pattern Anal. Mach. Intell. 15, 514–521 (1993).
Article Google Scholar
Bojnordi, M. N. & Ipek, E. In 2016 IEEE International Symposium on High Performance Computer Architecture (HPCA). 1–13 (IEEE, 2016).
Shin, J. H., Jeong, Y. J., Zidan, M. A., Wang, Q. & Lu, W. D. In 2018 IEEE International Electron Devices Meeting (IEDM). pp. 3 (IEEE, 2018).
Yang, K. et al. Transiently chaotic simulated annealing based on intrinsic nonlinearity of memristors for efficient solution of optimization problems. Sci. Adv. 6, eaba9901 (2020).
Article CAS ADS Google Scholar
Zhang, B. & Srihari, S. N. in Document Recognition and Retrieval X. Vol. 5010, 28–38 (International Society for Optics and Photonics, 2003).
Finch, H. Comparison of distance measures in cluster analysis with dichotomous data. J. Data Sci. 3, 85–100 (2005).
Article Google Scholar

Download references

Acknowledgements

This work is supported in part by the Army Research Office (grant no. W911NF-21-2-0128) and National Science Foundation (grant no. CMMI-2036359). T.W. and J.G. acknowledge support by National Science Foundation (grant no. 1809770 and 1904580). W.W. acknowledges the support from Air Force Research Laboratory (grant no. FA8750-19-1-0503).

Author information

These authors contributed equally: Xiaodong Yan, Jiahui Ma.

Authors and Affiliations

Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA, 90089, USA
Xiaodong Yan, Jiahui Ma, Aoyang Zhang, Jiangbin Wu, Wei Wu, Mike Shuo-Wei Chen & Han Wang
Department of Electrical and Computer Engineering, University of Florida, Gainesville, FL, 32611, USA
Tong Wu & Jing Guo
Sensors and Electron Devices Directorate, U.S. Army Research Laboratory, Adelphi, MD, 20723, USA
Matthew Chin & Madan Dubey
School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, 30332, USA
Zhihan Zhang
Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, CA, 90089, USA
Han Wang

Authors

Xiaodong Yan
View author publications
You can also search for this author in PubMed Google Scholar
Jiahui Ma
View author publications
You can also search for this author in PubMed Google Scholar
Tong Wu
View author publications
You can also search for this author in PubMed Google Scholar
Aoyang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jiangbin Wu
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Chin
View author publications
You can also search for this author in PubMed Google Scholar
Zhihan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Madan Dubey
View author publications
You can also search for this author in PubMed Google Scholar
Wei Wu
View author publications
You can also search for this author in PubMed Google Scholar
Mike Shuo-Wei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jing Guo
View author publications
You can also search for this author in PubMed Google Scholar
Han Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.Y., J.M., and H.W. conceived the project idea. X.Y., J.M. and J.W. fabricated the devices, characterized their electrical performance, and constructed and measured the BM circuit. A.Z., X.Y., M.S.-W.C., and Z.Z. contributed to the design of the BM circuit. M.C and M.D. contributed to the device fabrication. W.W. contributed to the understanding of the device operation. T.W, X.Y., J.M., and J.G led the simulation and modeling of the device and BM circuit. H.W. coordinated and supervised the overall research activities. All coauthors contributed to the discussion of the data. X.Y., J.M., T.W., J.G., and H.W. cowrote the paper with inputs from all coauthors.

Corresponding author

Correspondence to Han Wang.

Ethics declarations

Competing interests

The authors declare the following competing interests: H.W. currently also leads the low-dimensional materials research at Taiwan Semiconductor Manufacturing Company (TSMC) Corporate Research. All other authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Gunuk Wang and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yan, X., Ma, J., Wu, T. et al. Reconfigurable Stochastic neurons based on tin oxide/MoS₂ hetero-memristors for simulated annealing and the Boltzmann machine. Nat Commun 12, 5710 (2021). https://doi.org/10.1038/s41467-021-26012-5

Download citation

Received: 09 January 2021
Accepted: 09 September 2021
Published: 29 September 2021
DOI: https://doi.org/10.1038/s41467-021-26012-5
Springer Nature Limited

This article is cited by

Free-standing two-dimensional ferro-ionic memristor
- Jinhyoung Lee
- Gunhoo Woo
- Taesung Kim
Nature Communications (2024)
Efficient combinatorial optimization by quantum-inspired parallel annealing in analogue memristor crossbar
- Mingrui Jiang
- Keyi Shan
- Can Li
Nature Communications (2023)
CMOS-compatible Ising and Potts annealing using single-photon avalanche diodes
- William Whitehead
- Zachary Nelson
- Luke Theogarajan
Nature Electronics (2023)
A two-dimensional MoS2 array based on artificial neural network learning for high-quality imaging
- Long Chen
- Siyuan Chen
- Jinhui Song
Nano Research (2023)

Reconfigurable Stochastic neurons based on tin oxide/MoS₂ hetero-memristors for simulated annealing and the Boltzmann machine

From

Abstract

Similar content being viewed by others

Versatile stochastic dot product circuits based on nonvolatile memories for high performance neurocomputing and neurooptimization

Neuromorphic Devices and Networks Based on Memristors with Ionic Dynamics

Noise-injected analog Ising machines enable ultrafast statistical sampling and machine learning

Introduction

Results

Methods

Device fabrication

STEM and EDX

Raman spectroscopy

BM construction

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

Peer Review File

Rights and permissions

About this article

Cite this article

This article is cited by

Free-standing two-dimensional ferro-ionic memristor

Efficient combinatorial optimization by quantum-inspired parallel annealing in analogue memristor crossbar

CMOS-compatible Ising and Potts annealing using single-photon avalanche diodes

A two-dimensional MoS2 array based on artificial neural network learning for high-quality imaging

Navigation

Reconfigurable Stochastic neurons based on tin oxide/MoS2 hetero-memristors for simulated annealing and the Boltzmann machine

Abstract

Similar content being viewed by others

Introduction

Results

Methods

Device fabrication

STEM and EDX

Raman spectroscopy

BM construction

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation

Reconfigurable Stochastic neurons based on tin oxide/MoS₂ hetero-memristors for simulated annealing and the Boltzmann machine