Introduction

Advances in information technology have fueled rapid developments in AI, machine learning, big data, and the Internet of Things. Current growth rates in data traffic exceed 24% per year, while the data created and replicated are predicted to exceed 175 zettabytes (1 zettabyte = 1021 bytes) in 2025.1 Coping with data of this magnitude has so far been addressed by employing large-volume memory and high-speed processor clusters, a strategy that is proving unsustainable based on the projected requirements. The von Neumann architecture in particular, which makes up the foundation of modern computers, requires continuous data shuttling between the physically separated processing and memory units, a process where nearly two-thirds of the total energy is consumed.2

Inspired by biological neurons, in-memory computing architectures are gaining popularity, wherein data processing within or in the vicinity of memory units reduces latency and dissipation.3 Chalcogenide-based phase-change materials (PCMs) widely used in rewriteable compact discs (CDs) and digital video discs (DVDs), are particularly well suited to achieve this task because the memory states of the material are nonvolatile and can be read out both optically and electrically.4 For an example, in a CD, binary data are stored by switching the PCM between its two structural phases (i.e., the disordered amorphous state (top) and the ordered crystalline state (bottom) illustrated in Figure 1a). Phase transitions can be triggered by heating up the PCMs with optical or electrical pulse stimuli. It allows precise tailoring of the optical and electrical properties of the materials. In the electronic domain, in-memory computing is achieved by storing analogue conductance values in the PCM cells, which are commonly arranged in a crossbar array.5 Implementations in the electronic domain, however, suffer from limitations in bandwidth and data throughput.

Figure 1
figure 1

Optically and electrically programmable integrated phase-change photonic waveguide devices for nonvolatile photonic memory. (a) Illustration of phase transition between amorphous and crystalline states of Ge2Sb2Te5 (GST) induced by optical and electrical stimulus.6 (b) Three-dimensional schematic of an integrated phase-change photonic memory based on the ion-implanted silicon-on-insulator (SOI) waveguide platform. (c) Light transmission (wavelength at 1550 nm) along the SOI waveguides (top view) with 4-µm-long and 10-nm-thick amorphous and crystalline GST thin films on top. The 10-nm-thick indium tin oxide (ITO) capping is a protective layer to avoid oxidation.7 (d) Pulse scheme and transient thermal dynamics for writing (melt-quenching) and erasing (annealing) of a phase-change material (PCM) thin film on top of a SOI waveguide. Figures are reproduced from (a) Reference 6 with the permission of AAAS; (c) Reference 7 with the permission of Optica Publishing Group.

Addressing these limitations, photonic implementations of analogue in-memory photonic computing are projected to play a game-changing role in AI hardware. These systems leverage the ultrahigh bandwidth of optical interconnects, and data storage capacity of PCMs to collocate the memory and processing units and perform operations at the speed of light.8 Importantly, performing linear operations such as multiply accumulate (MAC) and matrix–vector multiplication (MVM) in the optical domain not only improves the speed, but also consumes considerably less energy.9 This results in favorable, sublinear scaling of energy consumption with respect to the number of operations for photonic hardware. Taken together, these properties lead to a significant increase in energy efficiency and compute power.

Working principle of optoelectronic waveguide memory cells

In the optical domain, transmission of light can be modulated by the structural phase transition of the PCM thin film deposited on top of an optoelectronic waveguide device (as shown in the schematic in Figure 1b).10 Specifically, a shallow-etched silicon photonic rib waveguide confines light with low transmission loss in its waveguide core, which consists of a silicon slab with a strip on it. However, a full-etched ridge waveguide only has a strip on top of the insulator layer. The PCM while nearly lossless in its amorphous state, becomes absorptive upon phase transition to the crystalline state. Figure 1c shows simulated light transmission through a silicon waveguide with 4-μm-long and 10-nm-thick amorphous Ge2Sb2Te5 (aGST) and crystallized Ge2Sb2Te5 (cGST) as the PCM.7 Here, optical transmission is defined as the ratio between output and input optical intensity, which decreases from 95% in the case of aGST to 26% with cGST. In this optical memory cell, the signal information is stored as the PCM structural phase.

To switch the structural state of the PCM thin film, optical pulses (red) in the waveguide/free-space and electrical pulses (yellow) are sent to the photonic waveguide memory cell. When an optical pulse is sent, the PCM thin film absorbs the optical energy and generates heat. When an electrical pulse is sent instead, the net current from the electrical pulse passes through an ion-implanted resistive region in a silicon waveguide, and thus generates heat.11 Upon phase transition by pulse heating, the signal modulation (attenuation) induced by the different PCM states can be applied to mimic the workings of a biological synapse, which likewise, contain both synaptic plasticity modulation and memory functions.12 The PCM states are highly stable for years at room temperature.10,13 Generally, a short and high-amplitude optical/electrical pulse is used to melt-quench (i.e., temperature sharply rising above the melting temperature (Tm) and quickly dissipates to room temperature) the PCM to an amorphous state as shown in Figure 1d. In contrast, a longer and lower amplitude optical/electrical pulse is used to anneal the PCM with temperature ramping up between Tm and transition temperature (Tx) for a certain period (to recrystallize the atomic lattice), before finally ramping the temperature down to room temperature. Specifically, Tx is 620–660 K with pulse time duration from 1 μs down to 100 ns and Tm is ~900 K for GST.14

Optical programming of PCM waveguide devices

In 2012, the concept of nonvolatile photonic memories was first proposed.15,16 In 2013, Rudé et al. first demonstrated an on–off switch based on a silicon racetrack resonator by far-field free-space optical pulse switching of a GST patch on the ring as shown in Figure 2a.17 In an amorphous (crystalline) state, light is on (off) resonance and is filtered by (pass) the ring. Due to flexible programming on an arbitrary location, this approach based on free-space switching is widely used for patterning PCM metasurface devices for manipulating light scattering, deflection, and focusing.18,19 Recently, Delaney et al. demonstrated a reprogrammable 1 × 2 waveguide power splitter using digital patterning of ultralow-loss PCMs Sb2S3 and Sb2Se320 as shown in Figure 2b.21 By patterning phase states of a PCM, light propagating in the waveguide can be either equally split or routed to one of the output ports, which is promising for reconfigurable photonic circuits. However, a portion of the pump pulse passes through the thin PCM film in far-field free-space programming, which leads to relatively large energy consumption. For an example, energy of an amorphization pulse is at least 900 pJ.17

Figure 2
figure 2

(a, b) Free-space optical programming of an on–off switch based on a silicon racetrack resonator (a)17 and a 1 × 2 power splitter (b).21 (c–f) Waveguide optical programming with a scanning electron microscope image of device (c),10 picosecond ultrafast switching (d),22 5-bit multilevel operation (e),23 and million-level cycling endurance (f).24 Among them, one of the lowest energy consumptions for an optical switching event is 60 pJ.24 Figures are reproduced from (a) Reference 17 with the permission of AIP Publishing; (b) Reference 21 with the permission of AAAS; (c) Reference 10 with the permission of Springer Nature; (d) Reference 22 with the permission of Wiley; (e) Reference 23 with the permission of Optica Publishing Group; (f) Reference 24 with the permission of Springer Nature. MMI, multimode interference.

In contrast to free-space optical switching, sending pump pulses along the waveguide leads to in-plane light–PCM interaction with perfect absorption and reduced programming energies. In 2015, Ríos et al. developed an integrated multilevel waveguide memory (Figure 2c) using this optical near-field effect with switching energies as low as 13.4 pJ and speeds approaching 1 GHz.10 This fully integrated approach is more robust and reliable for practical applications without movable and bulky free-space optical components (deflection mirrors and lens) for focusing the pump pulses. Figure 2d shows switching dynamics of an amorphization event in an add-drop ring resonator trigged by a single 1-ps optical pulse. Rapid phase transition within 250 ps is probed.22 Multilevel programming was performed using pulse amplitude modulation, and a 5-bit (over 32 unique levels) nonvolatile photonic memory was demonstrated.23 Note that a 6-bit waveguide memory was recently demonstrated using phase-change waveguide metasurfaces.25 It also shows over 1-million cycling endurance.24 These examples indicate that waveguide optical programming provides ultrafast, robust, multilevel, and energy-efficient operations.

Electrical programming of PCM waveguide devices

In addition to optical programming, electrical stimulus can also induce phase transition of PCMs based on Joule heating. In 2019, Zhang et al. developed an ion-implanted silicon waveguide microheater with a 1-μm-wide P++ heavily doped strip region (black color) as shown in Figure 3a.11 It shows 300% on–off switching contrast, and energy consumption of 10 nJ and 9 nJ for the amorphization and crystallization processes, respectively. The advantage of a microheater is electrical switching of low-loss Ge2Sb2Se4Te1 (GSST)26 and lossless PCMs (Sb2S3 and Sb2Se3),20 which is not easily achievable using optical switching due to extremely low absorption of pump light by PCMs. Figure 3b shows a reprogrammable and nonvolatile phase shifter based on a N-type-doped silicon waveguide microheater.27 It shows phase modulation up to 0.09 π/µm and a low insertion loss of 0.3 dB/π. A PIN diode microheater as shown in Figure 3c is developed to largely reduce insertion loss to 0.02 dB/μm because light is mainly confined in the intrinsic silicon region (i-Si) to avoid light absorption by free carriers.28 Energy consumptions are 650 nJ and 11 nJ for the amorphization and crystallization processes, respectively. As for other waveguide platforms (e.g., SiC, Si3N4, LiNbO3) with large bandgaps, microheaters can be achieved by capping resistive ITO29 and graphene strips30 on top of waveguides to heat up PCMs as shown in Figure 3d–e, respectively. A recent study predicts that graphene microheaters display two orders of magnitude higher figure of merits (FOMs) for heating and overall performance compared with those of ITO and silicon PIN microheaters.31

Figure 3
figure 3

(a–c) Electrical programming of phase-change materials based on ion-implanted silicon waveguide microheaters with P++ doping, DGST is diameter of a GST disk on top of a waveguide microheater (a),11N-type doping (b),32 and a PIN diode structure (c).28 (d, e) Waveguide memories based on indium tin oxide (ITO)29 and graphene microheaters.30 In Reference 11, the authors demonstrated electrical switching of a nanosized phase-change cell showing one of the lowest energy consumptions with a 9-nJ crystallization pulse. Figures are reproduced from (a) Reference 11 with the permission of ACS Publications; (b) Reference 32 with the permission of the authors; (c) Reference 28 with the permission of Wiley; (d) Reference 29 with the permission of Wiley; (e) Reference 30 with the permission of Wiley. GST, Ge2Sb2Te5; GSST, Ge2Sb2Se4Te1

Mixed-mode waveguide memory cells

To be programmed and probed both optically and electrically, plasmonic nanogap devices support mixed-mode operation as a nonvolatile optoelectronic interface.6 Plasmonic nanogap devices in particular have been used to locally enhance light–matter interactions in the phase-change cell, thereby combining low energy programming and a small footprint. Here, the state of the PCM can be programmed using optical or electrical pulses and can be further read simultaneously in the optical and electrical domains. Its working principle is detailed in Figure 4a. Optical programming of the mixed-mode memory cell is based on the absorption of optical pulses by the PCM. Tailoring the fraction of amorphous-to-crystalline volume of the PCM results in a nonvolatile change in electrical resistance and optical transmission. In electrical programming, a conductive filament (cPCM) in aPCM element may be either formed or broken between the two gold pads based on either the threshold switching or Joule heating and subsequent melt-quenching of the cGST filament.33 Figure 4b shows the 3D schematic of a plasmonic memory cell interfaced with input and output tapered photonic waveguides for enhanced optical signal delivery. A GST patch (blue) is placed between a 50-nm-wide gold nanogap. Switching energy required to initiate a phase transformation in the GST cell can be largely reduced due to extreme light intensity enhancement in the plasmonic nanogap due to excitation of surface plasmons polaritons (i.e., coupling of light with collective oscillations of the electrons at the surface of a metal). The size of the active region is 50 nm × 50 nm in cross section. The gold nanogap also serves as electrical connections to probe the resistance change in the GST. As such, it supports mixed-mode operation, wherein electrical and optical signals can be employed to pump (switch) and probe (read) the device, with any combination of the above. Optical switching threshold using the approach is only (16 ± 2) pJ. Figure 4c shows a mixed-mode device based on a GeTe nanowire suspended on top of a Si3N4 waveguide. GeTe nanowire can be optically switched using near-field evanescent coupling and probed both optically and electrically.34 Figure 4d shows a VO2-based electro-optic modulator with optical and electrical probing of the semiconductor-to-metal phase transition in VO2.35

Figure 4
figure 4

Mixed-mode waveguide devices. (a) Principle of mixed-mode programming and signal readout.6 (b) Three-dimensional schematic of a plasmonic nanogap device with a Ge2Sb2Te5 embedded between two Au contact pads. (c) Scanning electron microscope image of a mixed-mode device based on the GeTe nanowire.34 (d) Au nanogap on top of a vanadium dioxide (VO2) waveguide device.35 Optical switching threshold in a plasmonic nanogap device is only (16 ± 2) pJ.6 Figures are reproduced from (a, b) Reference 6 with the permission of AAAS. (c) Reference 34 with the permission of ACS Publications; (d) Reference 35 with the permission of ACS Publications.

Performance comparison of PCM devices

Table I summarizes performances of the state-of-the-art PCM devices using optical, electrical, and mixed-mode programming. Note that energy consumption can be reduced from nJ to pJ level using near-field waveguide optical switching instead of far-field free-space optical switching. One of the lowest energy consumptions (60 pJ/190 pJ) is achieved by using ps pump pulses with a relatively high switching contrast (20%) and a programming speed approaching GHz.24 A mixed-mode nanogap device offers one of the lowest switching thresholds (16 ± 2 pJ).6 Thus, performances of photonic memory cells may be further improved by exploiting both plasmonic enhancement and ultrashort optical pulse programming. Electrical programming is also promising for achieving large contrasts (300%) with energy consumption of a few nJ.11 In this scenario, a large area of PCM can be switched by a single electrical pulse. For an example, a patterned PCM metasurface with an area up to 0.4 × 0.4 mm2 was almost uniformly heated up and switched back and forth using microheaters.36 Another advantage of using microheaters is flexible programming of low-loss/lossless PCMs (GSST, Sb2S3, Sb2Se3).

Table I Experimentally characterized performances of the state-of-the-art PCM devices.

In-memory photonic tensor core

Based on the previously discussed energy-efficient optoelectronic waveguide memory cells, the development of large-scale memory chips is more promising as these photonic memory cells can be used to implement in-memory computing, such as a photonic tensor core for performing multiply accumulate (MAC) operations and matrix–vector multiplications (MVMs) using photonic integrated circuits. A matrix with m × n weight banks can be represented on a hardware in the form of a photonic crossbar array (illustrated in Figure 5a).37 Output pulse intensity (c = a × b) through a PCM can be calculated using multiplication between PCM transmittance (a) and input pulse intensity (b).38 Collected light intensity (Yi) at each output port using the incoherent superposition is basically a series of MAC operations (i.e., a dot product of a vector (X1, X2, …, Xm) encoded in the input pulse intensities and a vector (a1i, a2i, …, ami) (kernel i) stored in the transmission value of the memory cells). Thus, MAC and MVM operations are achieved by collecting summation of modulated optical intensity at each output port and all output ports of a crossbar array, respectively. Four sets of vectors (X1, X2, X3, X4) (labeled with different colors) can be encoded on light intensities with 16 discrete wavelengths. At the output ports of the tensor core, wavelength demultiplexers (DEMUXs) are used to only separate a single-wavelength channel into one output path for detecting each Yi. This allows emulation of massively parallel processing in neural networks. As illustrated in Figure 5b, parallel processing of several images (indicated by three red frames) and one kernel simultaneously based on WDM is a huge advantage of photonic architectures compared with serial information processing in the digital computer. This results in high computational density and efficiency with low latency as multiple MVMs are performed in a single clock cycle. Figure 5c shows an optical micrograph of a photonic tensor core based on SiN waveguides with a mesh of 16 × 16 fabricated in the cleanroom using electron-beam lithography and reactive ion etching.37 Figure 5d shows an optical micrograph of a photonic tensor core based on SOI waveguides with a mesh of 3 × 3 fabricated by multi-project wafer (MPW) service.39 Recently, Wu et al. proposed a phase-gradient metasurface mode converter (PMMC) as shown in Figure 5e for mode conversion from the fundamental mode to the 1st-order mode. As shown in Figure 5f, a MAC core is realized with four inputs biased by weights setting of PMMCs.25 Compute density and compute efficiency are, respectively, predicted to be 880 TOPS/mm2 and 5.1 TOPS/W for a 64 × 64 crossbar-based photonic tensor core operating at 25-GHz clock speed. Compared with digital electronic accelerator (ASIC and GPU), the photonic core has 1−3 orders of magnitude improvement in both compute density and efficiency. Detailed estimation can be found in Reference 37.

Figure 5
figure 5

(a) Schematic of parallel processing in a photonic tensor core. (b) A photonic tensor core for parallel convolutional operations. (c, d) Optical micrographs of fabricated photonic tensor cores based on Si3N4 waveguide platform37 (c) and silicon-on-insulator waveguide platform39 (d). (e) Three-dimensional schematic of a phase-gradient metasurface mode converter (PMMC). (f) Optical micrograph of a photonic core consisting of four PMMCs. The predicted compute energy efficiency of a photonic tensor core is 5.1 TOPS/W (TOPS: Tera operations per second).37 Figures are reproduced from (a–c) Reference 37 with the permission of Springer Nature; (d) Reference 39 with the permission of IEEE; (e) and (f) Reference 25 with the permission of Springer Nature. GST, Ge2Sb2Te5; MVMs, matrix–vector multiplications; DSP, digital signal processing.

Summary

In summary, we have overviewed recent progress in the development of energy-efficient waveguide memory cells for applications in in-memory and neuromorphic photonic computing. Waveguide memory cells leverage the near-field interaction to reduce energy consumption. Crucially, energy consumption is only 60 pJ in an all-optical abacus with operating speed approaching 1 GHz.24 And optical switching threshold is drastically reduced to (16 ± 2) pJ in a plasmonic nanogap device.6 High-performance waveguide memory cells have been deployed to achieve neuromorphic40,41 and in-memory computing.38 When used as a neuromorphic tensor core, it is scalable for various AI applications, for example to enable edge detection capabilities via convolutional filter and linear transformation as a fully connected layer in artificial neural networks.37 The PCM memory is fully passive after nonvolatile weight setting. In addition, it has a large data throughput using parallel processing based on the WDM. Compared with digital electronic accelerators (ASIC and GPU), photonic cores show 1−3 orders of magnitude improvement in both compute density and efficiency.37