Interfacing PDM MEMS Microphones with PFM Spiking Systems: Application for Neuromorphic Auditory Sensors

Gutierrez-Galan, Daniel; Rios-Navarro, Antonio; Dominguez-Morales, Juan Pedro; Duran-Lopez, Lourdes; Jimenez-Moreno, Gabriel; Jimenez-Fernandez, Angel

doi:10.1007/s11063-022-10936-0

Interfacing PDM MEMS Microphones with PFM Spiking Systems: Application for Neuromorphic Auditory Sensors

Open access
Published: 25 July 2022

Volume 55, pages 1281–1292, (2023)
Cite this article

Download PDF

You have full access to this open access article

Neural Processing Letters Aims and scope Submit manuscript

Interfacing PDM MEMS Microphones with PFM Spiking Systems: Application for Neuromorphic Auditory Sensors

Download PDF

Daniel Gutierrez-Galan ORCID: orcid.org/0000-0003-3100-0604^1,2^na1,
Antonio Rios-Navarro^1,2^na1,
Juan Pedro Dominguez-Morales^1,2^na1,
Lourdes Duran-Lopez^1,2^na1,
Gabriel Jimenez-Moreno^1,2 &
…
Angel Jimenez-Fernandez^1,2

1560 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Neuromorphic computation processes sensors output in the spiking domain, which presents constraints in many cases when converting information to spikes, loosing, as example, temporal accuracy. This paper presents a spike-based system to adapt audio information from low-power pulse-density modulation (PDM) microelectromechanical systems microphones into rate coded spike frequencies. These spikes could be directly used by the neuromorphic auditory sensor (NAS) for frequency decomposition in different bands, avoiding the analog or digital conversion to spike streams. This improves the time response of the NAS, allowing its use in more time restrictive applications. This adaptation was conducted in VHDL as an interface for PDM microphones, converting their pulses into temporal distributed spikes following a pulse-frequency modulation scheme with an accurate inter-spike-interval, known as PDM to spikes interface (PSI). We introduce a new architecture of spike-based band-pass filter to reject DC components and distribute spikes in time. This was tested in two scenarios, first as a stand-alone circuit for its characterization, and then integrated with a NAS for verification. The PSI achieves a total harmonic distortion of $-$46.18 dB and a signal-to-noise ratio of 63.47 dB, demands less than 1% of the resources of a Spartan-6 FPGA and its power consumption is around 7 mW.

Neuromorphic analog spiking-modulator for audio signal processing

Article 26 October 2020

Hardware Spiking Artificial Neurons, Their Response Function, and Noises

Decoding speech from spike-based neural population recordings in secondary auditory cortex of non-human primates

Article Open access 11 December 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Pulse-Density Modulation (PDM) is a sigma-delta modulation technique used to digitize an analog signal with a 1-bit data stream and a high sample rate. In recent years, many low-power microelectromechanical systems (MEMS) [1] microphones designed for mobile applications, such as tablets, laptops and cell phones, among others, have appeared in the market. In PDM data streams, a logic ‘1’ corresponds to a pulse of the maximum positive polarity (+A), and a logic ‘0’represents the maximum negative polarity ($-$A). A signal value of 0 is codified by an alternation of ‘1’ and ‘0’. Commonly, this type of modulation is associated with neuromorphic information codification, in the sense of being a rate-coded signal [2].

Neuromorphic computation allows processing Pulse-Frequency Modulation (PFM) information only when it is needed, avoiding periodic or redundant data processing, which is present in Pulse-Coded Modulation (PCM) signals, thus saving power and computational resources [3]. A similar approach can be found in [4], where the authors use the PDM microphones to encode the input sound direclty into a spike train, thus having the information ready to be processed using Spiking Neural Networks (SNNs). In addition, a PDM coding strategy is used in [5] for the communication between neurons when sound information is used.

Currently, we can find diverse neuromorphic cochleae, both analog [6,7,8,9,10] and digital [11, 12], inspired by Lyon’s cascade model [13] modeling the Inner Hair Cells (IHC). In [14], a Neuromorphic Auditory Sensor (NAS) is presented, based on on spike signal processing (SSP) techniques [15].

Figure 1 shows a global scheme of the NAS architecture. First, the audio information is provided by a digital audio codec, whose discrete audio samples output is converted into spike (1-clock cycle width pulses inside the FPGA) streams, following the Pulse-Frequency Modulation (PFM). The NAS filters these spikes directly, spike after spike, using a set of Spike-based Low-Pass Filter (SLPF) connected in a cascade fashion. Finally, spikes are transmitted to the next layers using the Address-Event Representation (AER) protocol.

NAS has been currently used for many practical applications, such as sound source localization [16], heart murmur diagnosis [17], and speech recognition [18, 19], among others. Great effort has been dedicated to improve NAS features, as it is the input layer of all these systems, improving responses and spreading for new applications of this technology.

The circuit proposed in this paper provides a novel interface with PDM microphones which output a rate-coded signal with a higher sampling rate than audio codecs (3.125 MHz in this case), and a time resolution of 320 ns, while audio codecs have a sampling period from 22.67 $\upmu \hbox {s}$ to 10.41 $\upmu \hbox {s}$, limiting the temporal capabilities for some applications, e.g., sound localization systems [20]. The output of the proposed circuit supplies a spike stream which can be directly processed by the NAS’ filter banks [14], avoiding the use of discrete audio codecs. We also introduce a new Spike-based Band-Pass Filter (SBPF) for spikes distribution.

The main contributions of this paper include the following:

A novel interface for PDM to rate-coded spikes conversion is presented.
The proposed interface have been simulated in software and implemented in reconfigurable hardware.
The hardware resources and power consumption needed have been analyzed.
The proposed interface is capable of working in real time and could be used for a wide range of neuromorphic applications.

The rest of the paper is structured as follows. In Sect. 2 an interface to convert PDM into rate-coded spikes is presented, which is divided in two main blocks: the front-end circuit (Sect. 2.1) and a spike-based second-order band-pass filter (Sect. 2.2). The hardware resources and power consumption of the presented interface are analyzed in Sect. 2.3. In Sect. 3, the experimental results are presented, including hardware simulation (Sect. 3.1) and implementation (Sect. 3.2). Finally, in Sect. 4, the conclusions of this paper are presented.

2 PDM to Spikes Interface (PSI)

PDM information codification technique is substantially different from rate-coded spike-based signals. In rate-coded spike-based signals, the information is given by the spikes frequency, which means that the information is inversely proportional to the temporal Inter-Spike Interval (ISI). Spike-based systems use PFM to distribute the spikes in time properly, in order to accurately represent the signal’s information. On the other hand, in PDM signals the information is contained in the density of pulses, and one pulse is generated every clock cycle, where a logic ‘1’ represents a positive value, and a logic ‘0’ a negative one. For example, when there are more ‘1’s than ‘0’s the information is positive, and the more ‘1’s, the greater the amplitude. Thus, to reconstruct the signal’s amplitude, it is necessary to collect PDM pulses during a temporal window, performing a downsampling operation, obtaining discrete samples with a fixed period. However, using PFM spike-based signals, with only two spikes, it is possible to reconstruct the amplitude of the original signal if the ISI is accurate enough, allowing it to process information spike-by-spike, and only performing computation when a spike is fired. Consequently, when there are no audio stimuli, spikes are not fired, and no computation is performed, thus saving power.

Digital systems convert PDM signals to digital values using the PCM. PCM is reconstructed from PDM with a digital decimation stage, commonly performing a downsampling by a factor of 64, and providing a multiple-bit word (e.g., 16 bits @ 48.8 kSamples/s) with high frequency noise added. After this stage, an infnite impulse response (IIR) filter is commonly used as a band- pass filter (BPF) to remove DC component and high frequency quantization noise.

The main goal of PSI is to read PDM pulses and redistribute them in time as rate-coded spikes, with an ISI proportional to the sound pressure, performing the similar operations of digital systems but in the context of spike-based signals.

To convert PDM information into rate-coded spikes, a two stages circuit (see Fig. 2) is proposed. The first stage is a Finite State Machine (FSM) circuit for PDM MEMS microphones interfacing, that generates a spike of a single clock cycle for each PDM pulse, detailed in Sect. 2.1. The next stage consists of one (monaural) or two (binaural) banks of SBPF (depending on target applicaction requirements), which process raw spikes from the previous stage, to give a temporal distributed spikes stream, detailed and analyzed in Sect. 2.2.

Since spikes can be both positive and negative, we use different wires to represent signed spikes. The FSM output generates a stream of signed spikes that are still not distributed in time, with the ISI being constant and equal to the PDM clock period. Figure 3 presents an example of a positive increasing audio signal, and how spikes evolve. PDM DATA represents the increasing input signal amplitude which is captured by PDM edge detector using the PDM CLK. This stage generates PFC OUT which are processed by SBPF using Eq. (1).

2.1 PDM Front-End Circuit

The PDM front-end circuit (PFC) has two main functionalities: to generate the PDM clock and to convert long PDM pulses into one clock cycle spikes (PFC OUT (P) and (N) in Fig. 3). The hardware platform used to implement these blocks is called AER-Node [21] and it has a base clock frequency of 50 MHz. Dividing this clock by a factor of 16, we get a PDM clock of 3.125 MHz, which is the maximum value commonly allowed by MEMS microphones. In every PDM clock cycle (PDM CLK signal in Fig. 3) there is a MEMS pulse in the PDM DAT signal. If PDM DAT has a value of ‘1’ then a positive spike is transmitted to the next stage, and if there is a ‘0’ it will be a negative spike (PFC OUT (P) and PFC OUT (N), respectively, in Fig. 3).

2.2 Second-Order Spikes Band-Pass Filter (SBPF)

The next stage is a Spike-based Band-Pass Filter (SBPF), which has been designed combining neurons for SSP. Two different kinds of neurons were used: Spikes Hold and Fire (SHF) and Spikes Integrate and Generate (SIG). SHF is used to subtract two input spike streams, generating a new stream of spikes with a frequency equivalent to the difference between input frequencies. SIG works like a spikes integrator. Internally, it consists of an up/down counter and a Synthetic Spikes Generator (SSG), with the output of the counter being the SSG’s input. Input spikes increase or decrease the counter value according to their polarity, thus performing the integration operation. Finally, the SSG provides an output spiking rate proportional to the counter’s value (i.e., integrated value). We can build frequency filters in the context of PFM spikes connecting these neuron models with different typologies. For example, from the point of view of ISI, a Spike-based High-Pass Filter (SHPF) only allows propagating spike rate changes, while a Spike-based Low-Pass Filter (SLPF) rejects high-frequency changes in spike rate.

Figure 4 shows the SBPF architecture, which is composed of two stages. Both stages are closed-loops with a SIG, although with a different SIG placement. These neurons are connected with 2-bit buses to propagate positive and negative spikes between them, in order to represent the bipolar nature of audio.

On the first stage, the SIG is placed in the feedback, while in the second stage it is placed in the main loop. Making an analogy to Laplace Transform, as was done in [14], with $k_{SIG1}$ being the gain of the SIG in the first stage, and $k_{SIG2}$ the one in the second stage, the full transfer function is presented in Eq. (1). It consists of one zero, which rejects the DC component, and two negative poles to filter quantification noise, implemented in the same way as for sampled digital systems, but in the context of PFM spike-coded signals.

$$\begin{aligned} SBPF(s)= & {} \frac{1}{1+\frac{k_{SIG1}}{s}}*\frac{\frac{k_{SIG2}}{s}}{\frac{k_{SIG1}}{s}+1}\nonumber \\= & {} \frac{k_{SIG2}s}{(s+ k_{SIG1})(s+k_{SIG2})} \end{aligned}$$

(1)

2.3 Hardware Resources and Power Consumption

The PSI design was synthesized and implemented on a Xilinx Spartan 6 FPGA (XC6LX150T) to measure the required resources and its power consumption. Table 1 presents the resources that are needed to implement PSI in FPGA. As can be seen, the amount of resources needed is under 0.1% of the total slice registers and logic (LUT) of the FPGA. The PSI can operate at a clock frequency of up to 148.34 MHz. To measure real power consumption, a 64-channel binaural NAS, was deployed on an AER-Node working at 50 MHZ [21]. We made two power consumption measurements to isolate the PSI module consumption. The first is the power consumption of the entire system, and the second is the power consumption of the entire system at reset without the PSI module. The difference between both measurements, 7 mW, corresponds to the real dynamic power consumption of the PSI module. This power consumption is significantly lower than that of similar works [22], which consumes around 33.78 mW. The AER-Node platform with the 64-channel binaural NAS deployed on the FPGA consumes 1.33 W, to which the power consumption of the microphones used should be added.

Table 1 PSI hardware requirements

Full size table

3 Experimental Results

For testing purposes, a scenario was built to analyze the PSI’s standalone behavior. Figure 5 presents the testing setup, where two PDM microphones from ST Microelectronics (MP34DT02) were connected to an AER-Node board, which was in turn connected to an USB-AERmini2 board. MP34DT02 are omnidirectional MEMS microphones with PDM interfaces, with an acoustic overload point of 120 $\hbox {dB}_{\mathrm{SPL}}$, an SNR of 60 dBm, a dynamic range of 86 dB, and a maximum power consumption of 0.98 mW.

The AER-Node board holds the PSI, a 128-channel binaural NAS, and a set of AER interfaces. Its parallel AER output was connected to the USB-AERmini2 board [23], which works like a bridge between AER buses and USB ports, allowing the AER events to be sent from the AER-Node board to a host computer. In the computer, two software tools were running: jAER [24], to visualize and log AER information; and MATLAB, to analyze and process the events. The sound used to excite the system was played using a flat response audio speaker, in this case a BEHRITONE C5A from Behringer, placed at a 1-meter distance from the PDM microphones and at a fixed gain in order to have an audio level of 65 $\hbox {dB}_{\mathrm{SPL}}$ on the microphones’ side. This kind of equipment was used to avoid the influence of audio equalizers and the compensation of domestic Hi-Fi equipment. Thus, no prepossessed sounds were used and, instead, we tried to reproduce sound waves in the most ideal way possible. This will potentially open our system to many stand-alone applications, such as robotics.

3.1 PSI Experimental Results

For the first experiment, the system was stimulated with a clear 500 Hz pure tone audio signal played by the flat response speaker. Figure 6 represents the spikes from each stage of the PSI. Higher addresses (3 and 2) correspond to the spikes fired by the PDM front-end circuit, and lower addresses (1 and 0) to the SBPF output. Spike addresses 3 and 1 are positive, while 2 and 0 are negative.

Figure 6 depicts how the addresses that contain the output of the PDM front-end overlap the information between positive and negative, which does not happen after filtering it with the PSI. In PDM, information makes sense for the average activity of a temporal window. However, in the spikes domain, the information is coded with the time between two consecutive spikes. From the signal sign point of view, we can say that zero-crossing is performed when the polarity of the spikes changes (i.e., after a positive spike, a negative one is produced). In the case of the PDM front-end output, there are several spikes overlapping positive (address 3) and negative activity (address 2). From the point of view of ISI, this represents a considerable amount of high-frequency noise. However, if we check the SBPF output spikes, there is no overlapping between positive (address 1) and negative (address 0) activity, rejecting high frequency noise.

Figure 7 shows the reconstruction of the original signal using the spikes’ ISI. First, the green signal represents the reconstruction from PDM front-end’s output. This is a noisy signal with a measured Total Harmonic Distortion (THD) of $-$38.11 dB and a Signal-to-Noise Ratio (SNR) of 51.31 dB, with an offset of $-$487 kSpikes/s introduced by the PDM microphones. On the other hand, the blue signal is the reconstruction from SBPF’s output. A tone with less noise and offset can be seen, improving the previous audio signal quality. Analyzing this response, we achieve a THD of $-$46.18 dB and a SNR of 63.47 dB.

To measure the number of zero-crossings, a one second recording was analyzed and the amount of changes from positive spike to negative and vice versa were counted. In the PDM front-end’s output, more than 80k zero-crossings were found. However, in SBPF’s output, 1k zero-crossings were found, which exactly matches a 500 Hz signal.

Our second experiment consisted in a frequency sweep from 20 Hz to 20 kHz to analyze the behavior of the system with different frequencies. For this experiment we have generated a set of sinusoidal signals with 1V peak-to-peak amplituded using a Realtek ALC1150 audio codec in a computer. ALC1150 has an SNR of 115 dB and a THD of $-$88 dB. Figure 8 shows the frequency sweep results as a bode plot. The top curve in Fig. 8 presents the gain for diverse frequencies. PSI gain starts to increase from 70 Hz to 12 kHz, and then decreases, rejecting higher frequencies. This bandwidth is enough for many applications related to speech and speakers recognition. The spike-based filters in the PSI introduce a temporal deviation. It was measured as signal phase (in rads) and the results are included in Fig. 8 bottom. PSI has a mean phase of $-$4.5 rads, approximately, increasing when frequency is close to the cut-off frequency, as expected from a low-pass filter.

3.2 NAS Integration

In order to validate the PSI on a real scenario, it was integrated in a 128-channel binaural NAS.^{Footnote 1} This NAS was fed with a male voice saying: “Si vis pacem, para bellum”, and the output activity was recorded using an USB-AERMini2 board as an AER-DATA file. Figure 9 contains the spikegram and the sonogram of this recording, respectively. Each word is clearly distinguishable, and activates middle channels between 200 Hz and 5 kHz. These figures were obtained by using NAVIS software [25].

4 Conclusions

In this paper, a PDM to PFM Spikes circuit is presented. PDM MEMS microphones are useful for low-power, stand-alone, embedded applications. Their output is based on spike density, and it needs to be adapted in order to be used as input to the NAS. A two-stage circuit for FPGA was designed, which is able to convert PDM information to PFM spikes with a consistent ISI. The PSI was synthesized for a Spartan 6 FPGA with low resources and power requirements. It was then tested with real audio stimuli, analyzing its behavior in terms of temporal response and zero-crossings. The PSI was also integrated in a full NAS to demonstrate the viability of the combination of this kind of systems. The use of PDM microphones with NAS considerably simplifies the system, enabling the development of compact and portable spike-based auditory systems with lower power consumption. However, this approach could have some limitations in very specific applications. For instance, the use of PDM microphones limits the preprocessing steps before the digitalization of the sound, like the automatic gain control, amplification, and other analog-based processing, due to the direct conversion from sound to digital pulses. Future work aims at developing adaptative capabilities in real time, as human ear does. Current research is focused on implementing a local automatic gain control [10] for input sound, and SBPF parameters (central pass frequency and Q factor) according to sound features performing an adaptation task in real time.

Code availability

The code related to this work is included in the OpenNAS project. See the OpenNAS GitHub project (https://github.com/RTC-research-group/OpenNAS) for more information.

Notes

This NAS was generated by using OpenNAS tool. Source code available in https://github.com/RTC-research-group/OpenNAS.

References

Maluf N, Williams K (2004) An introduction to microelectromechanical systems engineering. Artech House
Smith LS (2010) Neuromorphic systems: Past, present and future. Brain Inspired Cognitive Systems 2008:167–182
Google Scholar
Liu S, Rueckauer B, Ceolini E, Huber A, Delbruck T (2019) Event-driven sensing for efficient perception: Vision and audition algorithms. IEEE Signal Processing Magazine 36(6):29–37. https://doi.org/10.1109/MSP.2019.2928127
Article Google Scholar
Bensimon M, Greenberg S, Haiut M (2021) Using a Low-Power Spiking Continuous Time Neuron (SCTN) for Sound Signal Processing. Sensors 21(4):1065
Article Google Scholar
Sevuktekin NC, Varshney LR, Hanumolu PK, Singer AC (2019) Signal processing foundations for time-based signal representations: Neurobiological parallels to engineered systems designed for energy efficiency or hardware simplicity. IEEE Signal Processing Magazine 36(6):38–50
Article Google Scholar
Chan V, Liu S-C, van Schaik A (2007) AER EAR: A matched silicon cochlea pair with address event representation interface. IEEE Transactions on Circuits and Systems I: Regular Papers 54(1):48–59
Article Google Scholar
Hamilton TJ, Jin C, Van Schaik A, Tapson J (2008) An active 2-D silicon cochlea. IEEE Transactions on biomedical circuits and systems 2(1):30–43
Article Google Scholar
Wen B, Boahen K (2009) A silicon cochlea with active coupling. IEEE transactions on biomedical circuits and systems 3(6):444–455
Article Google Scholar
Liu S.-C, Van Schaik A, Mincti B.A, Delbruck T (2010) Event-based 64-channel binaural silicon cochlea with q enhancement mechanisms. In: Proceedings of 2010 IEEE International Symposium on Circuits and Systems, pp. 2027–2030 . IEEE
Kiselev I, Gao C, Liu S-C (2022) Spiking Cochlea with System-level Local Automatic Gain Control. Regular Papers, IEEE Transactions on Circuits and Systems I
Google Scholar
Mugliette C, Grech I, Casha O, Gatt E, Micallef J (2011) FPGA active digital cochlea model. In: 2011 18th IEEE International Conference on Electronics, Circuits, and Systems, pp. 699–702 . IEEE
Thakur C.S, Hamilton T.J, Tapson J, van Schaik A, Lyon R.F (2014) FPGA Implementation of the CAR Model of the Cochlea. In: 2014 IEEE International Symposium on Circuits and Systems (ISCAS), pp. 1853–1856 . IEEE
Lyon R (1982) A computational model of filtering, detection, and compression in the cochlea. In: ICASSP’82. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 7, pp. 1282–1285 . IEEE
Jiménez-Fernández A, Cerezuela-Escudero E, Miró-Amarante L, Domínguez-Morales MJ, de Asís Gómez-Rodríguez F, Linares-Barranco A, Jiménez-Moreno G (2016) A binaural neuromorphic auditory sensor for FPGA: A spike signal processing approach. IEEE transactions on neural networks and learning systems 28(4):804–818
Article Google Scholar
Jimenez-Fernandez A, Linares-Barranco A, Paz-Vicente R, Jiménez G, Civit A (2010) Building blocks for spikes signals processing. In: The 2010 International Joint Conference on Neural Networks (IJCNN), pp. 1–8 . IEEE
Schoepe T, Gutierrez-Galan D, Dominguez-Morales J.P, Jimenez-Fernandez A, Linares-Barranco A, Chicca E (2019) Neuromorphic sensory integration for combining sound source localization and collision avoidance. In: 2019 IEEE Biomedical Circuits and Systems Conference (BioCAS), pp. 1–4 . IEEE
Dominguez-Morales JP, Jimenez-Fernandez AF, Dominguez-Morales MJ, Jimenez-Moreno G (2017) Deep neural networks for the recognition and classification of heart murmurs using neuromorphic auditory sensors. IEEE transactions on biomedical circuits and systems 12(1):24–34
Article Google Scholar
Dominguez-Morales J.P, Liu Q, James R, Gutierrez-Galan D, Jimenez-Fernandez A, Davidson S, Furber S (2018) Deep spiking neural network model for time-variant signals classification: a real-time speech recognition approach. In: 2018 International Joint Conference on Neural Networks (IJCNN), pp. 1–8 . IEEE
Rasetto M, Dominguez-Morales J.P, Jimenez-Fernandez A, Benosman R (2021) Event Based Time-Vectors for auditory features extraction: a neuromorphic approach for low power audio recognition. arXiv preprint arXiv:2112.07011
Indiveri G, Sandamirskaya Y (2019) The importance of space and time for signal processing in neuromorphic agents: The challenge of developing low-power, autonomous agents that interact with the environment. IEEE Signal Processing Magazine 36(6):16–28. https://doi.org/10.1109/MSP.2019.2928376
Article Google Scholar
Iakymchuk T, Rosado A, Serrano-Gotarredona T, Linares-Barranco B, Jiménez-Fernandez A, Linares-Barranco A, Jiménez-Moreno G (2014) An AER handshake-less modular infrastructure PCB with x8 2.5 Gbps LVDS serial links. In: 2014 IEEE International Symposium on Circuits and Systems (ISCAS), pp. 1556–1559 . IEEE
da Silva B, Segers L, Braeken A, Steenhaut K, Touhafi A (2018) A low-power FPGA-based architecture for microphone arrays in wireless sensor networks. In: Voros, N., Huebner, M., Keramidas, G., Goehringer, D., Antonopoulos, C., Diniz, P.C. (eds.) Applied Reconfigurable Computing. Architectures, Tools, and Applications, pp. 281–293. Springer, Cham
Berner R, Delbruck T, Civit-Balcells A, Linares-Barranco A (2007) A 5 Meps \$100 USB 2.0 address-event monitor-sequencer interface. In: 2007 IEEE International Symposium on Circuits and Systems, pp. 2451–2454 . IEEE
Delbruck T (2008) Frame-free dynamic digital vision. In: Proceedings of Intl. Symp. on Secure-Life Electronics, Advanced Electronics for Quality Life and Society, pp. 21–26 . Tokyo
Dominguez-Morales JP, Jimenez-Fernandez A, Dominguez-Morales M, Jimenez-Moreno G (2017) NAVIS: Neuromorphic Auditory VISualizer tool. Neurocomputing 237:418–422
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank the Spanish Agencia Estatal de Investigación (AEI) for supporting this work and Prof. Alejandro Linares-Barranco for his active participation and expert suggestions. The work of Daniel Gutierrez-Galan was supported by a Formación de Personal Investigador Scholarship from the Spanish Ministry of Education, Culture and Sport.

Funding

Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. This work was supported by the Spanish Agencia Estatal de Investigación (AEI) project MINDROB and by an H2020 European Regional Development Fund project CHIST-ERA SMALL (Grant Numbers PID2019- 105556GB-C33/ AEI/10.13039/501100011033 and PCI2019-111841-2)

Author information

Antonio Rios-Navarro, Juan Pedro Dominguez-Morales, Lourdes Duran-Lopez, Gabriel Jimenez-Moreno and Angel Jimenez-Fernandez contributed equally to this work.

Authors and Affiliations

Arquitectura y Tecnologîa de Computadores, Universidad de Sevilla Spain, Avd. Reina Mercedes s/n, Sevilla, Spain
Daniel Gutierrez-Galan, Antonio Rios-Navarro, Juan Pedro Dominguez-Morales, Lourdes Duran-Lopez, Gabriel Jimenez-Moreno & Angel Jimenez-Fernandez
I3US, Score Lab, Avd. Reina Mercedes s/n, Sevilla, Spain
Daniel Gutierrez-Galan, Antonio Rios-Navarro, Juan Pedro Dominguez-Morales, Lourdes Duran-Lopez, Gabriel Jimenez-Moreno & Angel Jimenez-Fernandez

Authors

Daniel Gutierrez-Galan
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Rios-Navarro
View author publications
You can also search for this author in PubMed Google Scholar
Juan Pedro Dominguez-Morales
View author publications
You can also search for this author in PubMed Google Scholar
Lourdes Duran-Lopez
View author publications
You can also search for this author in PubMed Google Scholar
Gabriel Jimenez-Moreno
View author publications
You can also search for this author in PubMed Google Scholar
Angel Jimenez-Fernandez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniel Gutierrez-Galan.

Ethics declarations

Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gutierrez-Galan, D., Rios-Navarro, A., Dominguez-Morales, J.P. et al. Interfacing PDM MEMS Microphones with PFM Spiking Systems: Application for Neuromorphic Auditory Sensors. Neural Process Lett 55, 1281–1292 (2023). https://doi.org/10.1007/s11063-022-10936-0

Download citation

Accepted: 16 June 2022
Published: 25 July 2022
Issue Date: April 2023
DOI: https://doi.org/10.1007/s11063-022-10936-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Interfacing PDM MEMS Microphones with PFM Spiking Systems: Application for Neuromorphic Auditory Sensors

Abstract

Similar content being viewed by others

Neuromorphic analog spiking-modulator for audio signal processing

Hardware Spiking Artificial Neurons, Their Response Function, and Noises

Decoding speech from spike-based neural population recordings in secondary auditory cortex of non-human primates

1 Introduction