Reconfigurable nonlinear photonic activation function for photonic neural network based on non-volatile opto-resistive RAM switch

Xu, Zefeng; Tang, Baoshan; Zhang, Xiangyu; Leong, Jin Feng; Pan, Jieming; Hooda, Sonu; Zamburg, Evgeny; Thean, Aaron Voon-Yew

doi:10.1038/s41377-022-00976-5

Reconfigurable nonlinear photonic activation function for photonic neural network based on non-volatile opto-resistive RAM switch

Article
Open access
Published: 06 October 2022

Volume 11, article number 288, (2022)
Cite this article

Download PDF

You have full access to this open access article

Light: Science & Applications

Reconfigurable nonlinear photonic activation function for photonic neural network based on non-volatile opto-resistive RAM switch

Download PDF

Zefeng Xu^1,2,
Baoshan Tang²,
Xiangyu Zhang²,
Jin Feng Leong ORCID: orcid.org/0000-0002-1939-8632²,
Jieming Pan²,
Sonu Hooda²,
Evgeny Zamburg ORCID: orcid.org/0000-0002-4672-2813² &
…
Aaron Voon-Yew Thean^1,2

7281 Accesses
16 Citations
25 Altmetric
3 Mentions
Explore all metrics

Abstract

Photonic neural network has been sought as an alternative solution to surpass the efficiency and speed bottlenecks of electronic neural network. Despite that the integrated Mach–Zehnder Interferometer (MZI) mesh can perform vector-matrix multiplication in photonic neural network, a programmable in-situ nonlinear activation function has not been proposed to date, suppressing further advancement of photonic neural network. Here, we demonstrate an efficient in-situ nonlinear accelerator comprising a unique solution-processed two-dimensional (2D) MoS₂ Opto-Resistive RAM Switch (ORS), which exhibits tunable nonlinear resistance switching that allow us to introduce nonlinearity to the photonic neuron which overcomes the linear voltage-power relationship of typical photonic components. Our reconfigurable scheme enables implementation of a wide variety of nonlinear responses. Furthermore, we confirm its feasibility and capability for MNIST handwritten digit recognition, achieving a high accuracy of 91.6%. Our accelerator constitutes a major step towards the realization of in-situ photonic neural network and pave the way for the integration of photonic integrated circuits (PIC).

Programmable phase-change metasurfaces on waveguides for multimode photonic convolutional neural network

Article Open access 04 January 2021

Nonlinear germanium-silicon photodiode for activation and monitoring in photonic neuromorphic networks

Article Open access 13 October 2022

Lithography-free reconfigurable integrated photonic processor

Article 27 April 2023

Introduction

Artificial Neural Network (ANN) is a computational model for mimicking the human brain in information processing¹. It consists of massive nodes, namely “neurons” connected to each other through synapses. The computational complexity of ANN in model iterations requires large computational ability for multiply-and-accumulate (MAC) operation². With the continuous advancement of ANN, the past decade has witnessed an exponential rise in the demand for high computing speed and low energy consumption^3,4. As this demand continues, graphics processing unit (GPU) and even central processing unit (CPU)/GPU heterogenous architectures become attractive options for the ANN acceleration since they offer more computational parallelism than CPU⁵. Besides, more electronics architectures have been also developed, such as Application-Specific Integrated Circuit (ASIC) and Field-Programmable Gate Array (FPGA) chips to increase the ANN computing speed and efficiency^6,7,8. However, these architectures are still limited by electrical interconnects with resistance and capacitance (RC) parasitic effects and the twilight of Moore’s law for CMOS technology⁹. As an alternative to electronics, photonics has been considered as a promising archetypal solution to address these issues, with ultra-low computation loss, sub-nanosecond latencies and abundant computing parallelism^10,11. Moreover, photonics can deliver higher bandwidth, better energy-efficiency, and more complex functionality¹².

Recent works have demonstrated the potential of photonic neural network in the acceleration of ANN. The first photonic ANN was implemented on a free-space light platform with optical lens¹³. However, it has a disadvantage of low integration. Along with the rapid development of integrated photonics, the combination of Micro-Ring-Resonator (MRR)-based weighting bank and Photodetector arrays achieves small-scale matrix multiplication with the assistance of Wavelength Division Multiplexing technology^14,15, but this method is not efficient enough due to the large footprint of MRRs. To enlarge the matrix computation scale, MZI mesh on an integrated photonic chip has been proposed for MAC operations^16,17. This corresponds to one of the basic functions of ANN, weighting layer, to interpret incoming signals, with superior propagation speed and power efficiency¹⁸. However, the lack of another necessary basic function, applying in-situ nonlinear activation function to the sum of weighted inputs after MAC functions, remains an open challenge in photonic neural network. It results in insufficient performance, including low recognition accuracy and slow convergence rate¹⁹. This originates from the limited and invariable network complexity. Although the number of linear layers can be increased, the linear photonic ANN model still cannot fit the real physical world problems, which hardly follow straightforward linearity.

To address this challenge, several approaches for in-situ nonlinear activation accelerator in photonics have been proposed and extensively investigated, providing suitable paths for achieving a complete suite of ANN in photonics. For example, two-section distributed-feedback (DFB) lasers²⁰, vertical-cavity surface-emitting laser (VCSEL)²¹ and disk lasers²² have shown promising results, but they are bottlenecked by network scale, frequency of access and power consumption. Moreover, their nonlinear activation responses tend to be fixed during accelerator fabrication, but the nonlinear activation forms should be reprogrammed according to different ANN models and data sets²³. Thus, as a complementary approach, a more straightforward and flexible implementation is attained by calculating the nonlinear functions in CPU, which connects physical photonic neural networks through electrical-to-optical (E/O) and optical-to-electrical (O/E) converters^24,25. Unfortunately, it still suffers from the limitations of low efficiency and high latency with frequent access, due to poor performance of parallel computation²⁶. Another challenge associated with this approach is the adoption of highly efficient optical-to-electrical and electrical-to-optical converter devices, which greatly influence the power consumption of the whole system^27,28. Therefore, to address these issues, one should concurrently research both sides: suitable devices to achieve direct communication between photon and electron, as well as efficient and programmable nonlinear activation accelerator structure.

Herein, we have proposed an optical-to-optical nonlinear activation accelerator in an optical-electrical hybrid structure which alleviates the aforementioned challenges on both device and accelerator structure sides. This accelerator has been developed based on a unique Opto-Resistive RAM Switch, whose memristive behaviour is sensitive to incident light, using solution-processed 2D MoS₂. The solution processed technology has an advantage of the ease of large-scale integration with a low thermal budget, which is critical in processing with highly sensitive optical components on a chip. Furthermore, the Opto-Resistive RAM Switch switching voltage from high resistance state to low resistance state shows a linear dependence to the input optical power, bridging the Opto-Resistive RAM Switch to the photonic ANN for nonlinear activation accelerator. Based on this unique photosensitive device, our proposed accelerator features a variety of nonlinear activation response. The nonlinear accelerator consists of Opto-Resistive RAM Switch, low-power control unit, and MZI with tunable phase change material (PCM). Additionally, this structure allows for the possibility of active tunability of nonlinear response under different initial conditions. In this way, we demonstrate the availability of our Opto-Resistive RAM Switch-based nonlinear activation accelerator in a multi-class MNIST handwritten digit recognition using photonic neural network, with high accuracy and fast convergence rate.

Results

Architecture of the novel photonic neural network

Our overall approach is summarized in Fig. 1. ANN necessitates multiple hidden layers, each with a weighting layer to compute weighting matrix and summation, and a nonlinear layer to execute nonlinear activation function. In the photonic neural network, a programmable MZI mesh contains inner phase-shifters (marked with blue colour) and outer phase-shifters (marked with orange colour) to multiply optical signal from input layer by an assigned weight value and sum over it. Following MZI mesh, nonlinear accelerators apply nonlinear activation functions to the output of the MZI mesh. By repeating such combination of MZI mesh and nonlinear accelerators, photonic neural network achieves in-situ ANN computation with a large number of nodes and connections. The diagram shown in Supplementary Fig. S1 visualises the performance of the photonic neural network equipped with nonlinear accelerators (“PIC + nonlinear accelerator”) against other acceleration architectures for the performance benchmark on ANN acceleration^29,30. It can be intuitively and conveniently identified that photonic neural network equipped with nonlinear accelerators has better overall performance than other computation architectures, including CPU, GPU, FPGA, ASICs and PIC.

**Fig. 1: Photonic neural network architecture.**

MZI-mesh based weighting layer is configured with some 2 × 2 MZIs as marked with a dash box in Fig. 1a. It has been demonstrated that MZI unit can perform all rotations in unitary group of degree two, U(2), by adjusting PCMs θ and φ^31,32. In this regard, any weighting matrix can be decomposed into the product of several U(2)s. Thus, MZI mesh is capable of adding any weighting matrix into optical input. The unitary transformation U(2) of MZI can be given by³³

$$U_{MZI} = \frac{1}{2}\left[ {\begin{array}{*{20}{c}} {e^{i\theta }\left( {e^{i\varphi } - 1} \right)} & {e^{i\theta }\left( {e^{i\varphi } + 1} \right)} \\ {i\left( {e^{i\varphi } + 1} \right)} & {1 - e^{i\varphi }} \end{array}} \right] = \left[ {\begin{array}{*{20}{c}} {u_{11}} & {u_{12}} \\ {u_{21}} & {u_{22}} \end{array}} \right]$$

(1)

where θ and φ are the phase shifts in PCMs (Fig. 1a). The detail of proposed nonlinear accelerator is shown in the magnified view of the nonlinear layer. It contains an optical coupler to split a fraction of light into the bent sub-waveguide from the main waveguide route, a micro-mirror to divert light into the top of sub-waveguide, a Opto-Resistive RAM Switch with MoS₂ switching material to capture the optical information in terms of optical power and incident wavelength, an electrical control unit (ECU) to drive Opto-Resistive RAM Switch and MZI simultaneously, and a MZI with PCM to achieve a feedback loop modulating the light passing through the main route. The principle of its operation will be explained later in the article. There is no need for extra footprint space for control unit compared with other methods introduced above, since control unit is small enough that can just occupy gaps within photonic network. Here, Opto-Resistive RAM Switch, integrated with micro-mirror, plays a key role in the accelerator function. The schematic of Opto-Resistive RAM Switch is shown in a detailed view in Fig. 1b. Opto-Resistive RAM Switch consists of an ITO-MoS₂-Au sandwich-like structure (Supplementary Fig. S2).

Opto-Resistive RAM Switch characteristic

Opto-Resistive RAM Switch employs solution-processed MoS₂ switching material, which is a film spin-coated on the bottom electrode from a MoS₂ high-concentrated ink. The ink is prepared through ion-intercalation-driven exfoliation of a MoS₂ bulk. However, MoS₂ should meet requirements on thickness (1–5 nm) and roughness (≤2 nm) to avoid excessive driving voltage and optical loss and should enable incident-angle-independent absorption at certain wavelength. Surface morphology of stack of 2D MoS₂ sheets measured using Atomic Force Microscopy (AFM) is shown in Supplementary Fig. S3. AFM-image demonstrates that MoS₂ film has low roughness of 1.2 nm, which meets the low refraction loss requirement of fabricating Opto-Resistive RAM Switch³⁴. This MoS₂ synthesis technology allows Opto-Resistive RAM Switch to be fabricated on the top of sub-waveguide. The Raman spectra, collected from MoS₂ film on SiO₂/Si substrate, shows strong peaks at 383.5 cm⁻¹ (E¹_2g) and 408.2 cm⁻¹ (A_1g) (Supplementary Fig. S4), which are consistent with previous reports^35,36, and it indicates the multi-layered structure of the MoS₂ 2D sheets. Moreover, MoS₂ exhibits incident-angle-independent absorption of light at wavelengths <600 nm (Supplementary Figs. S5–6). The analyses above raise a possibility of integrating Opto-Resistive RAM Switch with integrated photonic circuit. For accurate resistance switching characterization, an Opto-Resistive RAM Switch device is prepared on SiO₂/Si substrate.

Figure 2a, b shows the bipolar resistance switching characteristics of Opto-Resistive RAM Switch activated by different optical power of 520 nm and 405 nm guided light, respectively. For the typical current-voltage (I-V) measurement without light input (orange lines in Fig. 2a, b), a DC voltage is applied to the Au top electrode and the ITO bottom electrode is grounded. During the voltage sweep from 0 to 3 V, an obvious abrupt increase of current can be observed while applied voltage reaches a threshold voltage, which is defined as V_SET (e.g. V_SET ≈ 2.7 V without illumination), and Opto-Resistive RAM Switch is switched from high resistance state to low resistance state. In the reversed sweep, negative voltage (−2.2 V) makes Opto-Resistive RAM Switch completely return to high resistance state, termed as RESET process. The V_SET signifies that at this voltage the electrical resistance state of Opto-Resistive RAM Switch, with capacity of non-volatile memory, can be changed as previously reported resistance switching devices^37,38. This switching characteristic is conducted under different optical power with a fixed wavelength irradiance as shown in Fig. 2a, b. The light is absorbed in the MoS₂ material after transmission through bottom ITO electrode, as the photon energy of 2.38 eV and 3.06 eV are larger than the bandgap of MoS₂ material at room temperature (1.29–1.88 eV). Carrier concentration increases with increasing optical power that leads to the increase of high resistance state current with fixed wavelength (inset in Fig. 2a). Remarkably, during the SET process, V_SET steadily decreases from 2.7 to 0.6 V with the increased optical power from 70.7 to 282.9 pW·μm⁻² at 520 nm wavelength, followed by a saturation of V_SET. The similar phenomenon can be observed for 405 nm wavelength illumination as shown in Fig. 2b: V_SET declines from 2.7 to 1.2 V with increased optical power from 0 to 70.7 pW·μm⁻² before a saturation of V_SET. This effect related to input optical power is summarized in Fig. 2c, d for 520 nm and 405 nm, respectively, and it can be fitted perfectly in straight line with high coefficient of determination (R²), 0.9635 and 0.9994 for 520 nm and 405 nm respectively. This linear relationship can be expressed as,

$$V = kP_{abs} + b$$

(2)

where k is the slope, P_abs is absorbed optical power of Opto-Resistive RAM Switch, and b is the intercept. This allows the optical power to be converted into the electrical signal (V_SET) linearly. As for the working function in the process of the acceleration, the response of Opto-Resistive RAM Switch is nonlinear since briefly it is a sudden change of output in terms of current, which is a necessary signal driving the accelerator. Thus, Opto-Resistive RAM Switch’s optical characteristic is unique and different from normal photodetectors^39,40, which detect and convert the optical power into current in a linear way. The unique characteristic of our Opto-Resistive RAM Switch is critical to to the realization of the nonlinear activation accelerator.

**Fig. 2: Electrical characteristic of Opto-Resistive RAM Switch.**

As discussed above, the frequent access to nonlinear activation accelerator requires that Opto-Resistive RAM Switch can maintain its switching characteristic in many cycles. Furthermore, the resolution (R) of Opto-Resistive RAM Switch depends on the variation of its characteristic at each optical power input, which is defined as bellow,

$$\begin{array}{l}R = \mathop {{{{{\mathrm{argmax}}}}}}\limits_n \left| {\left\{ {V_1,V_2,V_3,V_4, \ldots V_n \in V_r} \right\}} \right|,V_i \cap V_j\\\quad = \emptyset ,{{{\mathrm{i}}}}\, \ne \,{{{\mathrm{j}}}} \,\le\, {{{\mathrm{n}}}}\end{array}$$

(3)

where |x| represents the number of elements in a set x. V_i means the V_SET variation of the i^th input power state, and V_r corresponds to the range of possible V_SET. To maximize the power perception resolution, the variation of V_SET at each optical power input should be as small as possible. Cycle-to-cycle evaluation of the Opto-Resistive RAM Switch at room temperature has been carried out. As shown in Fig. 2e–h, the Opto-Resistive RAM Switch exhibits stable and uniform switching over 200 cycles with negligible cycle-to-cycle variation in resistance states and switching voltages under both dark (Fig. 2g) and light circumstances (Fig. 2h) Moreover, the variation of V_SET ranges from 0.03 to 0.08 V for different optical input power, which means Opto-Resistive RAM Switch can differentiate up to 39 optical power independent states. Fig. 2i shows the comparison of switching characteristic for different input wavelength but with the same optical power at 70.7 pW·μm⁻². Obviously, higher input photon energy induces lower V_SET and smaller switching window.

Opto-Resistive RAM Switch operation mechanism

The resistance switching characteristic and optical response are contributed to the vacancy migration and photon-induced heat generation. The resistance switching processes are explained in Fig. 3a–c and corresponding energy band diagrams at different states are shown in Fig. 3d–f. For the MoS₂ solution-processed material, sulphur vacancies are created at the edge of each 2D sheets during solution-exfoliation process as evidenced by our previous work⁴¹. The electron affinity of MoS₂ is around 3.0 eV⁴², lower than work functions of Au and ITO (5.1 eV and 4.7 eV, respectively)⁴³, leading to the formation of Schottky barrier contacts on both interfaces of Au/MoS₂ and MoS₂/ITO. In this case, only few electrons can pass over or tunnel through the barrier and no sulphur vacancies filament is formed. In the SET process, the external bias reduces the width and height of Schottky barrier and therefore increases the electron thermal emission and tunnelling probability, resulting in the improved current. Simultaneously, the positively charged sulphur vacancies migrate along the edge of MoS₂ sheets under voltage bias, bridge the top and bottom electrodes and finally form a conducive path across the MoS₂ layers. The resistance states transit from the high resistance state to low resistance state due to much increased tunnelling electrons with higher vacancy defect concentration (quasi-continuous defect level) in the pathway. For photon-response behaviour of Opto-Resistive RAM Switch, by absorbing photons in the interfaces, photoelectric effect creates electron-hole pairs, and the generated electrons are excited into sulphur vacancies defect level and conductance band in the room temperature. Besides, photogating effect that originates from trapped photogenerated electrons can further lower the Schottky barriers⁴⁴. Thus, under illumination, the current increases with increasing carrier concentration (3.3 times as shown in the inset of Fig. 2a) and it produces more heat from joule heating. Current-induced Joule heating and optical power dissipation accelerate the sulphur vacancies movement to form the defect level with higher concentration. It reduces the dependency on external bias and thus V_SET decreases under illumination.

**Fig. 3: Mechanism of Opto-Resistive RAM Switch.**

Accelerator structure based on Opto-Resistive RAM Switch

Due to the ability of photon-sensitive nonlinear switching, Opto-Resistive RAM Switch plays an important role in photon-electron communication in the nonlinear accelerator. Schematically the accelerator structure shown in Fig. 1b can be represented by Fig. 4a, where the grey lines and black lines represent optical waveguides and electrical pathways, respectively. At the beginning, optical signal propagating through MZI (P_sub) enters a directional coupler which couples a portion (β) of signal into Opto-Resistive RAM Switch through bent sub-waveguide. The Opto-Resistive RAM Switch absorbs the light with absorption coefficient (α) and switches the resistance at V_SET, which is an indicator of the P_abs with linear relationship. Here, we assume input optical signal is with electric field intensity (E) and the corresponding optical power is given by

$$P = \frac{{ab}}{4}E^2\frac{1}{{Z_{TE}}}$$

(4)

$$Z_{TE} = \frac{{\upeta }}{{\sqrt {1 - \left( {\lambda /\lambda _C} \right)^2} }}$$

(5)

$${\upeta} = \sqrt {\mu /\varepsilon }$$

(6)

$$\lambda _C = 2a$$

(7)

where a and b are width and depth of the rectangular waveguide respectively, ε is dielectric constant, μ is magnetic permeability. The voltage driving Opto-Resistive RAM Switch is provided by electrical control unit, whose circuit constitution is given by Fig. 4b. Positive (V1) and negative (V2) power supplies power the Opto-Resistive RAM Switch through a reversed switch-pair, constituted by a PMOS transistor (T1) and a NMOS transistor (T2), after a specified RC delay (τ = R1C1, where τ is RC time constant). Next, it is followed by a trans-impedance amplifier (U1) to convert current into voltage, a hysteresis comparator (U2) to judge the state of Opto-Resistive RAM Switch (low or high resistance state), and a voltage reverser (U3). Initially increasing voltage V_C1 is applied to Opto-Resistive RAM Switch with T1 on and T2 off, and while the current of Opto-Resistive RAM Switch (I_ORS) suddenly increased due to the Opto-Resistive RAM Switch switching under illumination, output voltage of U3 reverses and induces T1 off and T2 on. In this case, V_C2 starts to be pulled down by V2. Besides, simultaneously, another route generates a pulse activated by reversed output of U3 through a specified RC delay (τ = R7C2) and a comparator (U4). This pulse opens one transistor switch (T3) within the pulse time to “read” the maximum voltage of V_C1 (V_SET) using a voltage follower (U5) and this V_SET is applied back to PCM on one arm of MZI to modulate the light go through the main route. The electrical modulation of MZI can be calculated as

$$\vec E_o = \frac{{\vec E_I}}{2}\left( {e^{ - j\left( {\frac{{\pi V}}{{V_\pi }}} \right)} + e^{ - j\delta }} \right)$$

(8)

$$V_\pi = \frac{\lambda }{{n^3}}\frac{1}{r}\frac{d}{L}$$

(9)

where $\vec E_I$ and $\vec E_o$ are the input and output electrical fields of MZI respectively and V_π is the half-wave voltage, which causes phase change π of phase shifter. And λ is the input wavelength, n is the corresponding refractive index, r is electro optic coefficient, L is the length of interferometric arms and d is the thickness of PCM. Combining the expressions above, the mathematical form of nonlinear activation function achieved by nonlinear accelerator can be written explicitly as

$$P_O = \frac{{P_I}}{2}cos^2\left( {\frac{{\frac{{\pi \left( {k\alpha \beta P_o + b} \right)}}{{V_\pi }} + \delta }}{2}} \right)$$

(10)

**Fig. 4: Accelerator structure and function.**

To explain the process of such runtime architecture intuitively, time-series diagram is plotted in Fig. 4c and Supplementary Fig. S7. While V_C1 increases before reaching at V_SET (t) (Fig. 4d), V_MZI (t-1) is applied constantly to PCM. Until V_SET changes the state of Opto-Resistive RAM Switch, V_MZI (t-1) suddenly turns into V_MZI (t) controlled by one pulse of V_U4. Subsequently, it is followed by a decreasing V_C1 (Fig. 4e) to V_RESET, at which Opto-Resistive RAM Switch switches back from low resistance state to high resistance state but V_MZI (t) is still held until next cycle of resistance switching in Opto-Resistive RAM Switch (Fig. 4f). As shown in Fig. 4c, a perfect response of input optical signal in several loops can be viewed, and such nonlinear accelerator easily satisfies one important requirement for photonic neural network: response frequency (voltage sweeping frequency) must be higher than optical signal changing frequency, since the voltage sweeping frequency depends on controllable R1C1 delay. The formula for sweeping voltage is given by

$$V_{C1} = \left\{ {\begin{array}{*{20}{c}} {(V_1 - V_2)\left( {1 - e^{\frac{{ - t}}{{R1C1}}}} \right) + V_2,V_{RESET} < V_{C1 \uparrow } < V_{SET}} \\ {(V_2 - V_1)\left( {1 - e^{\frac{{ - t}}{{R1C1}}}} \right) + V_1,V_{RESET} < V_{C1 \downarrow } < V_{SET}} \end{array}} \right.$$

(11)

Moreover, a benefit of having an adjustable PCM (δ) in another arm of MZI as shown in Fig. 4a is that, in principle, this nonlinear accelerator can be programmed to synthesize different activation functions. Figure 4g–i show various nonlinear activation functions, sigmoid, softplus and clamped rectified linear unit (ReLU), at different initial δ values. Notably, every loop in Fig. 4c corresponds to different states of nonlinear function in Fig. 4g. This reconfigurability opens up the possibility of selecting suitable nonlinear functions for different specific tasks and distinguishes this method from previous nonlinear function approaches^20,45.

Discussion

To validate the functionality of the proposed nonlinear accelerator, a fully connected photonic neural network using Opto-Resistive RAM Switch-based nonlinear accelerator is implemented in the simulation. The schematic of this network for the MNIST handwritten digits classification task is shown in Fig. 5a. This MNIST dataset contains 70,000 greyscale images with 28 × 28 pixel, which is a representative database for neural network model training.

**Fig. 5: Photonic neural network with nonlinear accelerator.**

To reduce the input data dimension, Fast Fourier Transform (FFT) and edge-removal are used to convert real images into k-space images. The FFT of 2D image is given by the following equation

$$F\left( {k_x,k_y} \right) = \mathop {\sum}\limits_{m = 0}^{M - 1} {\mathop {\sum}\limits_{n = 0}^{N - 1} {f(m,n)e^{ - j2\pi \left( {k_x\frac{m}{M} + k_y\frac{n}{N}} \right)}} }$$

(12)

where F(k_x, k_y) is the value of the images in frequency domain corresponding to the coordinates k_x and k_y, f(m, n) is the real pixel at coordinates (m, n), and M and N are the dimensions of the image. The dimension of images is unchanged (28 × 28) after FFT, and the features of images experience centralization since FFT represents spatial frequency distribution of grey level gradients with the lowest frequency in the centre and the highest frequency at four corners. Afterwards, removal of fours edges in each image reduces the dimension from 28 × 28 into 8 × 8 but preserves most of frequency features. The reasons for using FFT include not only dimensionality reduction but also the feasibility of FFT in integrated photonics^46,47.

At the input of photonic neural network using Opto-Resistive RAM Switch-based accelerator, input images in a form of 8 × 8 pixel array are reconfigured into 64 × 1 array. This photonic neural network starts from several staggered weighting layers (WL) and nonlinear layers (NL) to drop layer (DL), which maps 64 inputs into 10 outputs for ten dights recognition. At the end, photodetectors (PD) convert optical signal into electrical signal for backpropagation calculation, which will optimize weighting layers in the training process. It is worth mentioning, here, the nonlinear layer adopts softplus nonlinear function as shown in Fig. 4h. On account of using nonlinear accelerator, this photonic neural network architecture is more efficient and simplified compared with other photonic neural networks in previous works¹⁶, which consume more energy and generate more delay during optical-to-electrical and electrical-to-optical conversions. And the previous methods are limited by on-chip space or complexity of network connection with CPU. Specifically, compared with previous methods for nonlinear activation function, our accelerator reduces the average power consumption by 20.2× and shrinks the footprint by around 40%.

To observe the dependence of recognition accuracy on the layer number, Fig. 5b shows the testing accuracy of the photonic neural network with different number of weighting-nonlinear layers. The accuracy reaches a peak at 91.6% with 5 weighting-nonlinear layers. The corresponding loss has an abrupt dropdown, equivalently fast iteration, before 50 epochs with a batch size of 500 in network training as shown in Fig. 5c. The confusion matrix for 5-layer photonic neural network computed over the testing dataset (Fig. 5d) shows the correct prediction for each digit image. Overall, these demonstrate the possibility of accelerating photonic neural network using proposed Opto-Resistive RAM Switch-based nonlinear accelerator.

This nonlinear accelerator based on MoS₂ Opto-Resistive RAM Switch provides a promising approach for the realization of in-situ photonic neural network. Meanwhile, its simple architecture, low energy consumption and small chip size make it practical to have a wide field of application with good prospects. It can be further extended into the acceleration of more types of neural network that in photonics there has been a number of research works about, such as convolutional neural networks⁴⁸, recurrent neural networks⁴⁹ and long short term memory networks⁵⁰. Moreover, with the incorporation of Wavelength Division Multiplexing technology, it may be capable of computing with high parallelism using different wavelengths, as shown in Fig. 2i.

In conclusion, we have developed a programmable nonlinear accelerator based on Opto-Resistive RAM Switch, which consists of solution-processed MoS₂. By cleverly leveraging the linear relationship that exists between the input optical power and the voltage that leads to abrupt resistance switching, Opto-Resistive RAM Switch proves the advantage of having the unique functionality to perform as a nonlinear switch that is critical to the functionality of the accelerator, compared to typical photonic components, like photodetector. Using this novel Opto-Resistive RAM Switch, our proposed nonlinear accelerator offers remarkable flexibility to use, because it allows generation of different nonlinear activation functions programmatically. The implementation of our nonlinear accelerator surpasses the limitation of outsourced nonlinear activation functions and achieves a comparable classification accuracy and fast iteration on an in-situ fully connected photonic neural network for MNIST classifier application. On the other hand, from a viewpoint of architecture, our nonlinear accelerator has the potential to significantly outperform the previous nonlinear activation architectures in terms of energy efficiency and complexity. In addition, it is very compact with small footprint. It paves the way for promising in-situ photonic neural network with ultra-high computation speed and parallelism.

Materials and methods

Solution-processed MoS₂ preparation

High-quality semiconducting MoS₂ nanosheets were fabricated with an electrochemical intercalation assisted exfoliation method⁵¹. Subsequently, the exfoliated MoS₂ nanosheets were dispersed in isopropanol to obtain the final MoS₂ ink, which as used for device fabrication.

Opto-Resistive RAM Switch fabrication and characterization

Solution-processed MoS₂ is spin-coated on p-Si wafer with 90 nm SiO₂ layer, followed by electron beam lithography and rapid thermal annealing. The surface height image is characterized by Atomic Force Microscopy and the Raman spectroscopy. ITO (40 nm) was deposited by sputtering system followed by lithography patterning and ICP-RIE etching to form electrodes. Top Au electrode (40 nm) is formed by electron beam photolithography and deposition using electron beam evaporator followed by lift-off process. The electrical and optical measurements were conducted by Agilent parameter analyzer B1500A and Lakeshore Cryogenic probe station with fixed-wavelength lasers.

Accelerator and photonic neural network simulation

Accelerator architecture function is analysed using co-simulation of Cadence PSpice design tool and Synopsys OptSim platform. The Neuroptica Python package is used for photonic neural network simulation. In MNIST digit classification task, input port number of MZI mesh is set to 64.

Data availability

The main data supporting the findings of this study are available within the article. Extra data are available from the corresponding authors on reasonable request.

References

Abiodun, O. I. et al. State-of-the-art in artificial neural network applications: a survey. Heliyon 4, e00938 (2018).
Article Google Scholar
Abiodun, O. I. et al. Comprehensive review of artificial neural network applications to pattern recognition. IEEE Access 7, 158820–158846 (2019).
Article Google Scholar
Wang, Y. H., Ribeiro, J. M. L. & Tiwary, P. Past–future information bottleneck for sampling molecular reaction coordinate simultaneously with thermodynamics and kinetics. Nat. Commun. 10, 3573 (2019).
Article ADS Google Scholar
Zhang, J. S. et al. Penetrating the influence of regularizations on neural network based on information bottleneck theory. Neurocomputing 393, 76–82 (2020).
Article Google Scholar
Ambrogio, S. et al. Equivalent-accuracy accelerated neural-network training using analogue memory. Nature 558, 60–67 (2018).
Article ADS Google Scholar
Merolla, P. A. et al. A million spiking-neuron integrated circuit with a scalable communication network and interface. Science 345, 668–673 (2014).
Article ADS Google Scholar
Benini, L. & De Micheli, G. System-level power optimization: techniques and tools. ACM Trans. Des. Autom. Electron. Syst. 5, 115–192 (2000).
Article Google Scholar
Guo, K. Y. et al. [DL] A survey of FPGA-based neural network inference accelerators. ACM Trans. Reconfigurable Technol. Syst. 12, 2 (2019).
Article Google Scholar
Schaller, R. R. Moore’s law: past, present and future. IEEE Spectr. 34, 52–59 (1997).
Article Google Scholar
Won, R. Integrating silicon photonics. Nat. Photonics 4, 498–499 (2010).
Article Google Scholar
Solli, D. R. & Jalali, B. Analog optical computing. Nat. Photonics 9, 704–706 (2015).
Article ADS Google Scholar
Ying, Z. F. et al. Electronic-photonic arithmetic logic unit for high-speed computing. Nat. Commun. 11, 2154 (2020).
Article ADS Google Scholar
Abu-Mostafa, Y. S. & Psaltis, D. Optical neural computers. Sci. Am. 256, 88–95 (1987).
Article Google Scholar
Xu, S. F., Wang, J. & Zou, W. W. Optical patching scheme for optical convolutional neural networks based on wavelength-division multiplexing and optical delay lines. Opt. Lett. 45, 3689–3692 (2020).
Article ADS Google Scholar
Mehrabian, A. et al. A winograd-based integrated photonics accelerator for convolutional neural networks. IEEE J. Sel. Top. Quantum Electron. 26, 6100312 (2020).
Article Google Scholar
Shen, Y. C. et al. Deep learning with coherent nanophotonic circuits. Nat. Photonics 11, 441–446 (2017).
Article ADS Google Scholar
Ma, W. et al. Deep learning for the design of photonic structures. Nat. Photonics 15, 77–90 (2021).
Article ADS Google Scholar
Sui, X. B. et al. A review of optical neural networks. IEEE Access 8, 70773–70783 (2020).
Article Google Scholar
Williamson, I. A. D. et al. Reprogrammable electro-optic nonlinear activation functions for optical neural networks. IEEE J. Sel. Top. Quantum Electron. 26, 7700412 (2020).
Article Google Scholar
Peng, H. T. et al. Neuromorphic photonic integrated circuits. IEEE J. Sel. Top. Quantum Electron. 24, 6101715 (2018).
Article Google Scholar
Nahmias, M. A. et al. A leaky integrate-and-fire laser neuron for ultrafast cognitive computing. IEEE J. Sel. Top. Quantum Electron. 19, 1800212 (2013).
Article Google Scholar
Cai, W. S., Vasudev, A. P. & Brongersma, M. L. Electrically controlled nonlinear generation of light with plasmonics. Science 333, 1720–1723 (2011).
Article ADS Google Scholar
Yuen, B. et al. Universal activation function for machine learning. Sci. Rep. 11, 18757 (2021).
Article ADS MathSciNet Google Scholar
Xiang, S. Y. et al. A review: photonics devices, architectures, and algorithms for optical neural computing. J. Semiconductors 42, 023105 (2021).
Article Google Scholar
Vandoorne, K. et al. Experimental demonstration of reservoir computing on a silicon photonics chip. Nat. Commun. 5, 3541 (2014).
Article ADS Google Scholar
Shastri, B. J. et al. Photonics for artificial intelligence and neuromorphic computing. Nat. Photonics 15, 102–114 (2021).
Article ADS Google Scholar
Zarei, S., Marzban, M. R. & Khavasi, A. Integrated photonic neural network based on silicon metalines. Opt. Express 28, 36668–36684 (2020).
Article ADS Google Scholar
Amin, R. et al. ITO-based electro-absorption modulator for photonic neural activation function. APL Mater. 7, 081112 (2019).
Article ADS Google Scholar
Sunny, F. P. et al. A survey on silicon photonics for deep learning. ACM J. Emerg. Technol. Comput. Syst. 17, 61 (2021).
Article Google Scholar
Talib, M. A. et al. A systematic literature review on hardware implementation of artificial intelligence algorithms. J. Supercomputing 77, 1897–1938 (2021).
Article Google Scholar
Shokraneh, F., Nezami, M. S. & Liboiron-Ladouceur, O. Theoretical and experimental analysis of a 4 × 4 reconfigurable MZI-based linear optical processor. J. Lightwave Technol. 38, 1258–1267 (2020).
Article ADS Google Scholar
Reck, M. et al. Experimental realization of any discrete unitary operator. Phys. Rev. Lett. 73, 58–61 (1994).
Article ADS Google Scholar
Miller, D. A. B. Meshing optics with applications. Nat. Photonics 11, 403–404 (2017).
Article ADS Google Scholar
Guo, C. et al. Simultaneous determination of optical constants, thickness, and surface roughness of thin film from spectrophotometric measurements. Opt. Lett. 38, 40–42 (2013).
Article ADS Google Scholar
Luo, R. C. et al. Van der Waals interfacial reconstruction in monolayer transition-metal dichalcogenides and gold heterojunctions. Nat. Commun. 11, 1011 (2020).
Article ADS Google Scholar
Liang, L. B. & Meunier, V. First-principles Raman spectra of MoS₂, WS₂ and their heterostructures. Nanoscale 6, 5394–5401 (2014).
Article ADS Google Scholar
Zidan, M. A., Strachan, J. P. & Lu, W. D. The future of electronics based on memristive systems. Nat. Electron. 1, 22–29 (2018).
Article Google Scholar
Lin, C. Y. et al. Adaptive synaptic memory via lithium ion modulation in RRAM devices. Small 16, 2003964 (2020).
Article Google Scholar
Youngblood, N. et al. Waveguide-integrated black phosphorus photodetector with high responsivity and low dark current. Nat. Photonics 9, 247–252 (2015).
Article ADS Google Scholar
Bonaccorso, F. et al. Graphene photonics and optoelectronics. Nat. Photonics 4, 611–622 (2010).
Article ADS Google Scholar
Tang, B. S. et al. Wafer-scale solution-processed 2D material analog resistive memory array for memory-based computing. Nat. Commun. 13, 3037 (2022).
Article ADS Google Scholar
Baik, S. S., Im, S. & Choi, H. J. Work function tuning in two-dimensional MoS₂ field-effect-transistors with graphene and titanium source-drain contacts. Sci. Rep. 7, 45546 (2017).
Article ADS Google Scholar
Huang, P. R. et al. The origin of the high work function of chlorinated indium tin oxide. NPG Asia Mater. 5, e57 (2013).
Article ADS Google Scholar
Liu, C. Y. et al. Silicon/2D-material photodetectors: from near-infrared to mid-infrared. Light Sci. Appl. 10, 123 (2021).
Article ADS Google Scholar
Zuo, Y. et al. All-optical neural network with nonlinear activation functions. Optica 6, 1132–1137 (2019).
Article ADS Google Scholar
Sheridan, P. A method to perform a fast fourier transform with primitive image transformations. IEEE Trans. Image Process. 16, 1355–1369 (2007).
Article ADS MathSciNet Google Scholar
Ghani, H. A. et al. A review on sparse Fast Fourier Transform applications in image processing. Int. J. Electr. Computer Eng. 10, 1346–1351 (2020).
Google Scholar
Gu, J. X. et al. Recent advances in convolutional neural networks. Pattern Recognit. 77, 354–377 (2018).
Article ADS Google Scholar
Schuster, M. & Paliwal, K. K. Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 45, 2673–2681 (1997).
Article ADS Google Scholar
Rodriguez, P. et al. Deep pain: exploiting long short-term memory networks for facial expression classification. IEEE Trans. Cybern. 52, 3314–3324 (2022).
Article Google Scholar
Lin, Z. Y. et al. Solution-processable 2D semiconductors for high-performance large-area electronics. Nature 562, 254–258 (2018).
Article ADS Google Scholar

Download references

Acknowledgements

This work is supported by Agency for Science, Technology and Research (A*STAR), Singapore National Research Foundation’s Returning Singapore Scientist Scheme (NRF-RSS2015-003), Singapore under its AME Programmatic Funds (A1892b0026), National Research Foundation Grant RSS2015-003, the Singapore Hybrid-Integrated Next-Generation μ-Electronics (SHINE) Centre hosted at the National University of Singapore (NUS).

Author information

Authors and Affiliations

Integrative Sciences and Engineering Programme, NUS Graduate School, National University of Singapore, Singapore, Singapore
Zefeng Xu & Aaron Voon-Yew Thean
Department of Electrical and Computer Engineering, National University of Singapore, 4 Engineering Drive 3, Singapore, 117583, Singapore
Zefeng Xu, Baoshan Tang, Xiangyu Zhang, Jin Feng Leong, Jieming Pan, Sonu Hooda, Evgeny Zamburg & Aaron Voon-Yew Thean

Authors

Zefeng Xu
View author publications
You can also search for this author in PubMed Google Scholar
Baoshan Tang
View author publications
You can also search for this author in PubMed Google Scholar
Xiangyu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jin Feng Leong
View author publications
You can also search for this author in PubMed Google Scholar
Jieming Pan
View author publications
You can also search for this author in PubMed Google Scholar
Sonu Hooda
View author publications
You can also search for this author in PubMed Google Scholar
Evgeny Zamburg
View author publications
You can also search for this author in PubMed Google Scholar
Aaron Voon-Yew Thean
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.X. and A.V.-Y.T. conceived the project and designed the experiments. Z.X., B.T., J.F.L., S.H. and E.Z. performed the device fabrication and characterization. Z.X. and X.Z. carried out the circuit design and simulation. Z.X. conducted architecture and neural network simulations. E.Z., J.P., B.T. and X.Z. contributed towards discussion and data interpretation. All authors are involved in the discussions and preparation of the manuscript.

Corresponding authors

Correspondence to Zefeng Xu or Aaron Voon-Yew Thean.

Ethics declarations

Competing interests

The authors declare no competing interests.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Xu, Z., Tang, B., Zhang, X. et al. Reconfigurable nonlinear photonic activation function for photonic neural network based on non-volatile opto-resistive RAM switch. Light Sci Appl 11, 288 (2022). https://doi.org/10.1038/s41377-022-00976-5

Download citation

Received: 23 February 2022
Revised: 15 August 2022
Accepted: 30 August 2022
Published: 06 October 2022
DOI: https://doi.org/10.1038/s41377-022-00976-5
Springer Nature Limited

This article is cited by

Graphene/silicon heterojunction for reconfigurable phase-relevant activation function in coherent optical neural networks
- Chuyu Zhong
- Kun Liao
- Hongtao Lin
Nature Communications (2023)

Reconfigurable nonlinear photonic activation function for photonic neural network based on non-volatile opto-resistive RAM switch

Abstract

Similar content being viewed by others

Programmable phase-change metasurfaces on waveguides for multimode photonic convolutional neural network

Nonlinear germanium-silicon photodiode for activation and monitoring in photonic neuromorphic networks

Lithography-free reconfigurable integrated photonic processor

Introduction

Results