Coincidence detection and integration behavior in spiking neural networks

Stoll, Andreas; Maier, Andreas; Krauss, Patrick; Gerum, Richard; Schilling, Achim

doi:10.1007/s11571-023-10038-0

Coincidence detection and integration behavior in spiking neural networks

Research Article
Open access
Published: 13 December 2023

(2023)
Cite this article

Download PDF

You have full access to this open access article

Cognitive Neurodynamics Aims and scope Submit manuscript

Coincidence detection and integration behavior in spiking neural networks

Download PDF

Andreas Stoll¹,
Andreas Maier¹,
Patrick Krauss^1,2,
Richard Gerum³^na1 &
…
Achim Schilling^1,2^na1

883 Accesses
2 Citations
Explore all metrics

Abstract

Recently, the interest in spiking neural networks (SNNs) remarkably increased, as up to now some key advances of biological neural networks are still out of reach. Thus, the energy efficiency and the ability to dynamically react and adapt to input stimuli as observed in biological neurons is still difficult to achieve. One neuron model commonly used in SNNs is the leaky-integrate-and-fire (LIF) neuron. LIF neurons already show interesting dynamics and can be run in two operation modes: coincidence detectors for low and integrators for high membrane decay times, respectively. However, the emergence of these modes in SNNs and the consequence on network performance and information processing ability is still elusive. In this study, we examine the effect of different decay times in SNNs trained with a surrogate-gradient-based approach. We propose two measures that allow to determine the operation mode of LIF neurons: the number of contributing input spikes and the effective integration interval. We show that coincidence detection is characterized by a low number of input spikes as well as short integration intervals, whereas integration behavior is related to many input spikes over long integration intervals. We find the two measures to linearly correlate via a correlation factor that depends on the decay time. Thus, the correlation factor as function of the decay time shows a powerlaw behavior, which could be an intrinsic property of LIF networks. We argue that our work could be a starting point to further explore the operation modes in SNNs to boost efficiency and biological plausibility.

Neurons with Non-standard Behaviors Can Be Computationally Relevant

An STDP-Based Supervised Learning Algorithm for Spiking Neural Networks

Information filtering by coincidence detection of synchronous population output: analytical approaches to the coherence function of a two-stage neural system

Article Open access 24 June 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Biological neural networks and especially the human brain achieve astonishing performance in dynamical information processing and energy efficiency. Thus, the human brain does not consume significantly more power than a 20 W light-bulb (Furber 2012), whereas huge matrix processor units used for machine learning approaches consume far more energy (Wang et al. 2020). How is it possible that general intelligence emerges in the human brain, although there exist significant biological constraints? On the one hand, the answer to this question has the potential to significantly boost artificial intelligence (AI) research by implementing the underlying biological principles in artificial neural networks [neuroscience inspired AI, (Hassabis et al. 2017), e.g. (Yang et al. 2021; Schilling et al. 2022)]. On the other hand, better artificial neural networks could help to understand how the brain works, as these networks could serve as a model system which can be analyzed with much more detail compared to their biological counterpart [cognitive computational neuroscience, (Kriegeskorte and Douglas 2018), see also (Gerum et al. 2020; Schilling et al. 2021a; Stoewer et al. 2023a, b; Surendra et al. 2023; Metzner et al. 2023; Schilling et al. 2022)].

Consequently, the interest in neuromorphic computing and especially spiking neural networks (SNNs) increased in recent years, as they offer a promising approach to bridge the gap between the performance achieved by current deep learning methods and the energy efficiency of biological neural networks (Eshraghian et al. 2021; Yamazaki et al. 2022; Xiao et al. 2022; Gerum et al. 2023).

The behaviour of biological neurons has first been mathematically described by Hodgkin and Huxley (1952). Even though the Hodgkin–Huxley model accurately describes the underlying experimental observations, it is computationally complex and therefore often simplified. One simplification is for example the Fitzhugh–Nagumo neuron model, which uses a reduced set of differential equations (FitzHugh 1961; Izhikevich and FitzHugh 2006; Nagumo et al. 1962). However, to simulate large networks based on these neuron models is computationally expensive and unfeasible. Therefore, biological neurons are commonly approximated with phenomenological spiking neuron models (Gerstner and Kistler 2002). Spiking neurons produce identical action potentials (spikes) when certain threshold criteria are met for their internal states (Kandel et al. 2000). As a result, these neurons transmit information via an energy-efficient communication scheme that is binary and sparse.

One prevalent spiking neuron model is the leaky-integrate-and-fire (LIF) neuron. It is computationally efficient and therefore used in many spiking neural network studies (Yamazaki et al. 2022).

SNNs provide a more biologically-inspired approach to artificial neural networks compared to standard deep neural networks used for pattern recognition. However, training SNNs remains challenging and is an active field of research (Xiao et al. 2022; Alonso et al. 2022; Apolinario and Roy 2023; Gerum and Schilling 2021; Gerum et al. 2023). Even though simple models like the LIF neuron provide a wide range of parameters that influence the dynamics of information processing, many recently proposed training methods, e.g. Xiao et al. (2022) and Apolinario and Roy (2023), still only optimize the synaptic weights. This is likely due to a lack of understanding the effects of different parameters on the spiking dynamics of SNNs.

Recently though, multiple publications set a starting point to evaluate how SNNs behave under different conditions and parameter combinations. More so, a special focus was set on the temporal parameters (resp. time constants) of LIF neurons. For example, Perez-Nieves et al. (2021) show that SNNs perform best, when there is a certain heterogeneity in the time constants of LIF neurons and report a special benefit for tasks with a lot of temporal structure in the data. Further studies carried out similar experiments, but the authors set their focus on running SNNs more efficiently on neuromorphic hardware (Fang et al. 2021; Quax et al. 2020; Yin et al. 2020). In Gerum and Schilling (2021), the authors proposed that a single LIF neuron can be run in two operation modes: a LIF neuron with a short membrane decay time can be regarded as coincidence detector whereas a LIF neuron with a long membrane decay time acts as integrator neuron. However, it is still unclear whether these operations modes actually arise, and thus, whether they can be precisely tuned in populations of neurons (resp. SNNs).

Coincidence detection in particular seems to be a desirable operation mode, as it was observed in many different sensory modalities and cognitive processes by multiple studies of biological neural networks. It is known to be involved in e.g. memory formation (Bender et al. 2006; Fino et al. 2010) and decoding motor input or sensory stimuli (Xu et al. 2012; Roome and Kuhn 2020). Coincidence detection was also found to be involved in the auditory (Franken et al. 2015) and visual system (Ran et al. 2020) of mammals, respectively.

Detecting a coincidence refers to the process of extracting information from activity across different neurons which occurs within a short period of time. However, there are differences both in what kind of coincidence a system is trying to detect as well as the mechanisms of detecting such. In this study, coincidence detection refers to a postsynaptic neuron being prone to pre-synaptic activity that arrives over a short period of time.

A biological mechanism for detecting this kind of coincidence has been reported to exist in e.g. the auditory system of the Mongolian gerbil (Franken et al. 2015) and is crucial for sound localization. Intrinsic conductances of neurons in the medial superior olive—a brainstem nucleus of the auditory pathway—that interact with preceding synaptic activity are reported to generate an internal phase delay as part of the coincidence detection process. Both the recent input activity as well as low-voltage-activated Kv1 potassium channels alter the postsynaptic neuron’s membrane potential which enables fine tuned responses to different temporal input patterns. This biological mechanism is involved in spatial hearing, as the coincidence detection allows to resolve the small time difference of a sound arriving at the two ears. Further studies on the auditory system proposed that coincidence detection is also important to generate neural networks, which are able to calculate auto-correlations of complex signals. Thus, Krauss et al. (2016, 2018), Schilling et al. (2021b), Schilling and Krauss (2022) and Schilling et al. (2023) propose that these auto-correlations of auditory signals are used by the auditory system to enhance sensory processing and to compensate the effects of hearing loss.

Despite a central role of coincidence detection in the auditory system, it is important in various other modalities and brain regions as well. Coincidence detection also plays a role in e.g. cortical integration of sensory and motor input (Xu et al. 2012), sub-cortical processing of visual stimuli (Ran et al. 2020) and information processing in the cerebellum (Roome and Kuhn 2020).

Regardless of numerous evidence of this operation mode to exist in biological neural networks, its potential is yet to bet tapped into by a majority of SNN studies. In this study, we therefore explore the connection between the membrane time constant and the proposed LIF operation modes in SNNs that are trained on four commonly used image classification datasets: MNIST, EMNIST/Letters, Fashion-MNIST and CIFAR-10. We propose two measures, that allow to determine a neuron’s operation mode with respect to the other neurons in a spiking neural network. Thus, the proposed measures can be used to better understand the contribution of single neurons to the dynamics of entire populations of neurons. Besides supporting the explainability of SNNs, we also demonstrate a clear correlation between the membrane decay time (inverse leak term) and the neuron’s spiking dynamics in SNNs optimized with a supervised surrogate-gradient-based training method. We find, that the coincidence detection mechanisms observed in biology can be reproduced in networks of LIF neurons in a simplified manner. This makes tuning the operation modes (resp. membrane decay times) an interesting approach to more biological plausibility in machine learning.

Methods

Computational resources

All simulations were performed on standard Desktop PC hardware. The experiments were run on a modified version of the tf_spiking Python package (Gerum 2020b) which is the backbone of our machine learning approaches and based on Keras (Chollet et al. 2015) and TensorFlow (Abadi et al. 2015). For further evaluations, we used NumPy (Harris et al. 2020) and Pandas (The pandas development team 2020). Thus, all visualizations were created with Matplotlib (Hunter 2007) and Pylustrator (Gerum 2020a). The experiments were conducted with a five-fold cross-validation on four different image classification datasets, namely the MNIST database of handwritten digits (Deng 2012), EMNIST/Letters (Cohen et al. 2017), Fashion-MNIST (Xiao et al. 2017) and CIFAR-10 (Krizhevsky et al. 2009). The image pixels are converted into spike trains using Poisson encoding.

Our fully connected network has one hidden layer of 128 LIF neurons and is supervisedly trained using the surrogate-gradient approach proposed in (Gerum and Schilling 2021). The membrane decay times (resp. leak terms) are initialized either with identical values for all neurons in the hidden layer ("constant") or with 32 bins of four neurons each ("binned uniform"). The neurons of a bin are initialized with the same membrane decay time.

Deep learning with leaky-integrate-and-fire neurons

For our experiments, we build a feed-forward spiking neural network based on leaky-integrate-and-fire (LIF) neurons (Burkitt 2006) (see Fig. 1a). As shown in Gerum and Schilling (2021), LIF neurons can be mathematically described by the following equations:

$$\begin{aligned} V_{t_{n}} =&\text{ReLu}[ w_{\text{input}} \cdot x_{t_{n}} \nonumber \\&+ (1-w_{\text{leak}} \cdot \Delta t) \cdot V_{t_{n-1}} \cdot \Theta _{2}(V_\text{thresh} - V_{t_{n-1}}) ] \end{aligned}$$

(1)

$$\begin{aligned} y_{t_{n}} =&\Theta _{1}(V_{t_{n}} - V_\text{thresh}) \end{aligned}$$

(2)

$$\begin{aligned} t_{n} =&t_{n-1} + \Delta t \end{aligned}$$

(3)

$V_{t_{n}}, V_\text{thresh}, x_{t_{n}} \in \mathbb {R}, w_\text{leak}, \Delta t, y_{t_{n}} \in \mathbb {R}^+, n \in \mathbb {N}$

We simulate the neuron for $n = 1,..., N$ discrete time steps with a temporal resolution of $\Delta t =$ 5 ms. The internal state $V_{t_{n}}$ of the LIF neuron, also referred to as membrane potential, is computed for every time step $t_{n}$. It is the sum of the inputs at this time, $x_{t_{n}}$, weighted by trainable input weights $w_\text{input}$, and the state of the membrane potential of the previous timestep $V_{t_{n-1}}$, weighted by leakage term $w_\text{leak}$ that prevents long temporal correlations. As this study investigates the influence of the leakage term on the network dynamics, $w_\text{leak}$ is set to be non-trainable and therefore remains unchanged for all time steps. With the Heaviside step function $\Theta _{i}$, we can model the neuron to release a spike ($\Theta _{1}$ in (2)) and to reset the membrane potential ($\Theta _{2}$ in (1)) to its resting state, if $V_{t_{n}}$ surpasses the threshold $V_\text{thresh}$ at the respective time step. If the threshold is not surpassed, $V_{t_{n}}$ is multiplied with $w_\text{leak}$ and fed to the inner state via a recurrent connection. The output of a LIF neuron, $y_{t_{n}}$, is 0 if no spike occurs at $t_{n}$ and 1 otherwise. Without loss of generality, $V_\text{thresh}$ is set to 1. As we work with spike trains as input signals, both $x_{t_{n}}$ as well as $y_{t_{n}}$ implicitly are binary, i.e. $\{0, 1\}$, and independent of $\Delta t$. We do not allow negative values for the inner state of the LIF neurons [cf. (2), (Gerum and Schilling 2021)].

For training the SNN, we work with the surrogate gradient-based backpropagation through time approach proposed in Gerum and Schilling (2021). With this learning paradigm, supervised training by minimizing a loss function is possible and classification tasks can be solved. The loss function, in our case the mean squared error loss, is minimized by using a gradient descent algorithm and a step size (learning rate). For the optimization we use the Adam stochastic gradient descent method (Kingma and Ba 2017). Thus, a weight update can be calculated via the chain rule the same way as for multi-layer-perceptron-based artificial neural networks.

As we work with image datasets but the LIF neurons have a time dimension, all image pixels are encoded as spike trains. In this study, we use Poisson rate coding (Zenke and Vogels 2021; Pfeiffer and Pfeil 2018; Lee et al. 2016), where every pixel value is translated to a probability of the neuron to spike in each time step. After the encoding, the inputs are passed to a dense layer consisting of 128 LIF neurons. In order to get a final classification score consistent with the ground truth labels, the output layer sums up the incoming spikes from the hidden layer and maps it to the respective class. We simulate the network for a duration of 500 ms with a temporal resolution of 5 ms, resulting in 100 discrete time steps. An overview of the network architecture is visualized in Fig. 1b.

Tuning the spiking behavior of LIF neurons via their decay times

The decay time is inversely proportional to the leak term ($t_\text{decay} = 1 / w_\text{leak}$) and influences both training dynamics and spiking behavior of the LIF neurons. For a high decay time (resp. low leakage), a LIF neuron simply sums up the input stimuli over time with little decay of the membrane potential and therefore operates as integrator. On the contrary, if the neuron’s decay time is low (resp. high leakage), it can only release output spikes for inputs with small time differences. In such a case, the LIF unit operates as coincidence detector. In Fig. 1c–e, we visualized the membrane potential of a single LIF neuron with a high (low) decay time and the resulting output behavior given an identical synthetic input spike train, respectively.

We refer to these operation modes as integrator and coincidence detector, however, both terms are not correlated to a specific value for the decay time but rather describe the neuron’s tendency of operation in the context of the network’s simulation time. Neurons with intermediate decay times (resp. an intermediate operation mode) can of course exist and thus the proposed terms for the operation modes strongly depend on the context of the input stimuli.

As our learning rule depends on the membrane potential, the decay time also affects the training behavior of the neuron. An integrator neuron can memorize its inputs over a long duration, so the error gradient also has to be backpropagated over a longer duration than it is the case for coincidence detector neurons. Tuning the decay time therefore allows modeling populations of neurons with different spiking behavior and memorization properties. For our experiments we consider decay times in the range [15, 480] ms. This interval roughly covers the simulation time of the neurons (500 ms) and excludes possible sampling artifacts given our choice of temporal resolution (5 ms). Thus, we can split this range into 32 equidistant bins with 15ms time difference. Via this approach we can distribute the decay times uniformly throughout the network with four neurons sharing a respective decay time. We refer to this initialization scheme as "binned uniform" (visualized in the inset in Fig. 2d).

Results

Coincidence detection and integration behavior in feed-forward spiking networks

In the following, we analyze how coincidence detector and integrator neurons (Gerum and Schilling 2021) perform and behave in a feed forward neural network trained with a surrogate gradient algorithm. Thus, we study the effect of different decay times on the spiking behavior in networks trained on four common datasets (MNIST, EMNIST/Letters, Fashion-MNIST or CIFAR-10 dataset). We investigate networks with constant decay times (i.e. it is equal for all LIF units of the network), and with binned uniform decay times, (i.e. it is uniformly distributed with an equal amount of neurons sharing a respective decay time). Thus, the decay time is identical for all time steps and is not a trainable parameter. For better readability, we only report the results based on MNIST in this Section; the visualizations for the other datasets can be found in Suppl. Fig. 1.

The operation mode of the neurons is quantified by the number of input spikes effectively contributing to the generation of an output spike (see Fig. 1f). A low number of contributing input spikes suggests that either the weights of these input spikes were very high, or that an input spike volley (several spikes coming from arbitrary input neurons with small inter-spike intervals) was present. As we are particularly interested in detecting coincidences, we additionally measure the average time interval in which the input spikes stimulate the LIF unit and cause it to spike. We call this the effective integration interval (see Fig. 1g).

To calculate this measure, we start with the time of the output spike and trace the membrane potential back to the first input spike that actively contributes to the generation of this output spike. During this backtracking, weight effects and membrane decay effects are being taken into account. A long effective integration interval (w.r.t. the network’s simulation time) is present in integrator neurons due to little decay of the membrane potential. We expect that coincidence detector neurons have shorter integration intervals compared to integrator neurons and require less input spikes.

We compute both measures for every output spike of every neuron in the hidden layer and use them to determine the operation modes of the neurons: If both measures (effective integration interval and contributing input spikes) are low the neuron operates as coincidence detector. If both measures are high the neuron operates as integrator as indicated in Fig. 1h.

Prior to analyzing whether a low number of contributing input spikes correlates with a short effective integration interval in networks trained on real data, we investigated the decay time’s influence on both measures. Low decay times correspond to low measure scores in experiments using constant as well as binned uniformly distributed decay times, as reported in Fig. 2a–d.

Visualizing both measures in the "operation mode" scatter plot (introduced in Fig. 1h), we find that a low (high) number of contributing input spikes correlates with a short (long) effective integration interval (see Fig. 2e, f). However, we find the decay time to not determine the exact behavior of the neuron but instead defines a range of operation. This range gets smaller for low and saturates for high decay times, respectively. These results imply, that we can in fact influence the operation mode of a LIF neuron via its decay time (resp. leak term). Furthermore, we see that in a population of neurons trained on real data, integration and coincidence detection behavior emerge depending on the decay time. The networks did not simply adjust the weights to counter the effect of the decay time, but instead worked with neurons operating on different time scales.

The slopes of the drawn measurement values shown in Fig. 2e and f become steeper for lower decay times when training on EMNIST/Letters, Fashion-MNIST and CIFAR-10 data, respectively (see Suppl.Fig. 1 B). We therefore fitted lines through every distribution and the coordinate origin and computed their slopes. When plotting the slopes of these line fits over their respective decay time, we find similar curves for constant and binned uniform experiments and for all four datasets, as shown in Fig. 3a and 3b.

An in-depth analysis of these slopes as a function of the decay times (Fig. 3) indicates that the slopes of the curves are shifted along the y-axis depending on the dataset.

We found that the offset along the y-axis is influenced by image brightness. We therefore adjusted the average brightness of MNIST, EMNIST/Letters and Fashion-MNIST images to approximately match (brightness difference < 0.045 in [0, 255] color space, see Fig. 3c, d) However, we also see that curves from brightness-adjusted MNIST and unmodified EMNIST/Letters are similar, despite an average brightness difference of approx. 10 in [0, 255] color space. This suggests that not only image brightness, but also the structure of the data influences the slopes.

Additionally, when removing the y-offsets and fitting powerlaws to the resulting curves, we can compare the slopes for neural networks trained on the different datasets. The powerlaw fits are presented in double logarithmic scale in Fig. 3e, f. The curves of brightness-adjusted and non-adjusted data are very similar indicating that image brightness influences the y-offset but has little impact on the shape of the curve. In general, the different powerlaw fits are not perfectly aligned, potentially due to an influence of the structure of the data. Furthermore, we observed that the fit functions for networks trained on MNIST and EMNIST/Letters datasets are similar for the binned uniform experiments. Also the fit functions of Fashion-MNIST and CIFAR-10 experiments are similar. We therefore assume that the distribution of the pixel intensities shapes the slope of the curve, as MNIST and EMNIST/Letters are almost binary in intensity, whereas Fashion-MNIST and CIFAR-10 more extensively use the offered value range of the brightness scale.

Impact of decay times on model accuracy

Different decay times lead to changes in spiking dynamics. In the following, we show the influence of the different decay times on classification accuracy. Therefore, we evaluated the five-fold cross-validated mean accuracy, macro f1-score and area under the receiver operating characteristic (AUC) for all experimental conditions. In Fig. 4a–c, the accuracies are reported for MNIST, Fashion-MNIST and EMNIST/Letters (detailed performance overview of accuracy, macro f1 and AUC scores of all datasets see Suppl. Figure 1 C).

The binned uniform distributions lead to similar performances as the best models with constant decay time.

In a next step, we investigated the influence of the decay time on overall classification accuracy by cumulatively ablating neurons either starting with coincidence detectors (starting ablation at low decay times) or integrator neurons (starting ablation at high decay times), respectively. The results for all datasets are reported in Fig. 4d-i. Deleting integrator neurons first leads to an instant accuracy drop for MNIST (4d), Fashion-MNIST (4f) and EMNIST/Letters (4h) datasets. For CIFAR-10 (4i) no preference of operation mode can be detected.

Comparing the ablation curves of MNIST (4d) and brightness-adjusted MNIST (4e), we find that increasing the image brightness leads to a smaller difference between ascending (green) and descending (blue) ablation curves. Consistently, a greater difference is observed when lowering the image brightness of Fashion-MNIST (see 4f, g). This therefore suggests, that the performance difference between integrator and coincidence detector neurons is linked to image brightness.

In summary, integrator neurons seem to be more important for classification performance. However, this result has to be discussed due to the observed dependence on image brightness (resp. spike rate given the Poisson spike encoding).

Discussion

Summary

In the present study, we investigated whether the previously proposed operation modes of single LIF neurons do emerge in spiking neural networks that are trained on real data. We thus studied the influence of tuning the membrane decay time [i.e. membrane time constant (Perez-Nieves et al. 2021) or inverse leak term] on the operation modes of LIF neurons. Furthermore, we analyzed the resulting effects on spiking dynamics and image classification accuracy of SNNs. We found that the proposed operation modes do emerge in SNNs and that they can be tuned via the membrane decay time: Neurons with low decay times operate as coincidence detectors, whereas neurons with high decay times operate as integrators.

We performed experiments with four image datasets (MNIST, EMNIST/Letters, Fashion-MNIST and CIFAR-10), which were transformed to spike sequences by Poisson encoding [Zenke and Vogels 2021; Pfeiffer and Pfeil 2018; Lee et al. 2016]. We deployed feed-forward SNNs with a single hidden layer and used the surrogate-gradient-based training method proposed by Gerum and Schilling (2021). In this study, we experimentally investigated the effect of different membrane decay times and considered two different ways of initializing them: constant decay times and uniformly distributed decay times, respectively.

In order to study the relationship between membrane decay times and operation modes of LIF neurons, we proposed two measures: the number of contributing input spikes and the effective integration interval.

We found the first measure, the number of contributing input spikes, to be low for coincidence detectors, whereas it was found to be high for integrator neurons. The second measure, the effective integration interval, was also found to be higher for integrator neurons.

We analyzed the distribution of the two measures across the SNNs with respect to particular membrane decay times and found the two measures to linearly correlate. Besides investigating the relationship between the measures and a neuron’s operation mode, we found that both measures are strongly influenced by the neuron’s membrane decay time (see Fig. 2a–d).

Therefore, we can conclude that the operation mode of a LIF neuron can be determined via the membrane decay time. A low decay time correlates to a low number of contributing input spikes and a short effective integration interval. This consequently makes a LIF neuron to operate as coincidence detector. High decay times correspond to high numbers of contributing input spikes as well as long effective integration intervals. This makes such neurons to operate as integrators. However, saturation effects for high decay times were visible in both measures. This suggests a non-linear relationship between the membrane decay time and a neuron’s respective operation mode.

Our experiments give strong evidence that LIF neurons can be precisely tuned towards detecting coincidences, whereas integration behavior is not accurately determined: our analyses show, that the effective integration interval and number of contributing input spikes are more strongly determined towards a precise range (i.e. clustered together) for low decay times (see Fig. 2e, f). We can therefore conclude that the membrane decay time defines a neuron’s operational range rather than an exact operation mode. This operational range gets smaller (resp. more precise) for lower decay times.

Additionally, we found that the correlation factor between the two measures as a function of the membrane decay time follows a powerlaw. Therefore, the decay time offers a different way of influencing the spiking dynamics compared to the synaptic weights: while the weights linearly influence the spiking behavior of a LIF layer, the membrane decay time influences the spiking dynamics in a non-linear way.

The powerlaw relation between the measures and the decay time was present in all experiments and all datasets and we therefore argue that it is an intrinsic property of LIF neurons. However, we found it to slightly differ between different datasets and it is still not completely clear to which proportion the powerlaw relation is influenced by the structure of input data. We observed the brightness of the input data to linearly shift the curve of the correlation factor but to have little influence on the slope of the curve (see Fig. 3c–f). The observed similarities of the fit functions between MNIST and EMNIST/Letters and those between Fashion-MNIST and CIFAR-10, respectively, thus suggest that the exact shape of the powerlaw is indeed influenced by the structure of input data, e.g. the distribution of the pixel intensities.

Besides showing the emergence of coincidence detectors and integrators in SNNs and analyzing the resulting effects on the spiking dynamics, we explored the impact of different operation modes on image classification accuracy. For that, we cumulatively ablated neurons according to their decay times. Neurons were ablated either in ascending or descending order, respectively (see Fig. 4d–i).

Ascending ablation refers to deleting coincidence detectors first, which forces the network to use neurons that operate as integrators.

Descending ablation refers to deleting integrator neurons first, which forces the network to use neurons that operate as coincidence detectors.

We found that ablating integrator neurons had a more severe effect on classification accuracy than ablating coincidence detectors, which indicates that integrator neurons are more important in our experiments.

Additionally, we found the differences between the ascending and descending ablation curve to be strongly dependent of image brightness. This is likely due to encoding the images via a Poisson process. A brighter pixel is represented with a higher probability of the respective input neuron to spike. Consequently, the spike rate of such neurons is higher compared to neurons that encode darker pixels. Therefore, when the image brightness decreases, the spiking activity in the SNNs gets more sparse and consequently, fewer spikes coincide. In order to achieve good classification performance in such a case, integrator neurons, i.e. long decay times, are required, as they are able to memorize the information carried by spikes over a longer duration.

Limitations of the study

It has to be noted, however, that the Poisson process encodes all the information of an image pixel via the spike rate of the respective input neuron. As a result, detecting a coincidence in our experiments does not provide more/different but less information than simply integrating over an arbitrary amount spikes. Because of that, we found integrator neurons to be more important than coincidence detectors in our experiments in terms of classification performance.

Even though this limits the results of our ablation study, we can conclude that when working with spike rates, tuning the membrane decay times can be neglected and training the synaptic weights is sufficient in order to achieve good classification performance. However, Perez-Nieves et al. (2021) showed that tuning the time constants can result in improved network performance, when information is not only encoded in the spike rate but in the spike timing as well. We therefore argue that when working with such data, tuning the membrane decay times of LIF neurons should be taken into account. This can be achieved either by considering the membrane decay times as trainable parameters as proposed by Gerum (2020b), or alternatively, by considering the distribution of decay times as hyper-parameter as we did in this study.

We thus want to emphasize that the timings of spikes are important when working with data from neuromorphic sensors like dynamic vision sensors or artificial cochleas (Eshraghian et al. 2021). We will therefore shift our focus to working with neuromorphic sensor data in the future.

Also, we only considered small feed-forward SNNs in this study due to the computational complexity induced by training SNNs [see also (Perez-Nieves et al. 2021)]. This limits our experimental evidence, as more complex effects could potentially emerge in larger—or even recurrent—spiking neural networks. Still, our experimental setup is a reasonable choice as there currently is no viable alternative to training SNNs in time-stepped simulation frameworks when good performance needs to be achieved (Eshraghian et al. 2021).

Discussion and future research directions

In summary, we could demonstrate the rich dynamics of LIF-based spiking neural networks trained with surrogate gradient descent and provide evidence for the validity of defining two operation modes of LIF neurons: the integrator and coincidence detector.

We show that the coincidence detection mechanisms that have been observed in biological neural networks by multiple studies can be reproduced in LIF neurons by tuning the membrane decay times.

A recent study already showed that heterogeneous time constants can improve the performance of LIF-based SNNs (Perez-Nieves et al. 2021). Much work was already spend on investigating the effect of weight matrix heterogeneity on network dynamics [e.g. (Krauss et al. 2019b; Yang et al. 2021; Krauss et al. 2019c, a)]. However, only recently the exact temporal dynamics have moved into the focus of AI research.

As the temporal dynamics of LIF neurons are influenced by multiple parameters (e.g. membrane time constant, spike rate adaptation, data encoding), our aim was to disentangle these parameters and to study the impact of different membrane decay times. With this study, we contribute towards better understanding the dynamics of SNNs by providing experimental evidence for the emergence of different neural operation modes and their dependence on the membrane decay time.

Because the timing of spikes is important when working with neuromorphic sensor data, we strongly encourage the neuromorphic community to consider tuning the operation mode of LIF neurons in future experiments and to consider the membrane decay time in new training methods.

Concluding remarks

To the best of our knowledge this study is the first to investigate the integration and coincidence detection behavior of LIF neurons in spiking neural networks and we thus provide a valid contribution to decode the basis of heterogeneity as fundamental principle of brain dynamics and efficient information processing in SNNs.

As already suggested in Jonas and Kording (2017), the best way to understand a complex system like the brain or artificial neural networks, is to search for already known building blocks (i.e. integrator and coincidence detector). To put it in a nutshell, a mechanistic theory is necessary to make real progress in understanding cognition in biological and artificial neural networks (Jonas and Kording 2017; Schilling et al. 2023).

Data availability

The current study was conducted using publicly available datasets.

Code Availability

The code generated during the current study is available in the GitHub repository, https://github.com/andistoll/coincidence_detection_and_integration_behavior_in_SNNs

References

Abadi M et al (2015) TensorFlow: large-scale machine learning on heterogeneous systems. Software. https://doi.org/10.5281/zenodo.4724125
Article Google Scholar
Alonso N, Millidge B, Krichmar J, et al. (2022) A theoretical framework for inference learning. In: Koyejo S, Mohamed S, Agarwal A, et al. (eds) Advances in Neural Information Processing Systems, vol 35. Curran Associates, Inc., pp 37335-37348. URL https://proceedings.neurips.cc/paper_files/paper/2022/file/f242c4cba2467637256722cb679642bd-Paper-Conference.pdf
Apolinario MPE, Roy K (2023) S-tllr: Stdp-inspired temporal local learning rule for spiking neural networks. https://doi.org/10.48550/arXiv.2306.15220. Currently under review
Bender VA, Bender KJ, Brasier DJ et al (2006) Two coincidence detectors for spike timing-dependent plasticity in somatosensory cortex. J Neurosc 26(16):4166–4177. https://doi.org/10.1523/JNEUROSCI.0176-06.2006
Article CAS Google Scholar
Burkitt AN (2006) A review of the integrate-and-fire neuron model: I. Homogeneous synaptic input. Biol Cybern 95(1):1–19. https://doi.org/10.1007/s00422-006-0068-6
Article CAS PubMed Google Scholar
Chollet F, et al. (2015) Keras. Software available from https://keras.io
Cohen G, Afshar S, Tapson J, et al. (2017) EMNIST: an extension of MNIST to handwritten letters. https://doi.org/10.48550/ARXIV.1702.05373
Deng L (2012) The MNIST database of handwritten digit images for machine learning research. IEEE Signal Process Mag 29(6):141–142. https://doi.org/10.1109/MSP.2012.2211477
Article Google Scholar
Eshraghian JK, Ward M, Neftci E, et al. (2021) Training spiking neural networks using lessons from deep learning. CoRR abs/2109.12894. https://doi.org/10.48550/arXiv.2109.12894
Fang W, Yu Z, Chen Y, et al. (2021) Incorporating Learnable Membrane Time Constant to Enhance Learning of Spiking Neural Networks. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 2661–2671. https://doi.org/10.1109/ICCV48922.2021.00266
Fino E, Paille V, Cui Y et al (2010) Distinct coincidence detectors govern the corticostriatal spike timing-dependent plasticity. J Physiol 588(16):3045–3062. https://doi.org/10.1113/jphysiol.2010.188466
Article CAS PubMed PubMed Central Google Scholar
FitzHugh R (1961) Impulses and physiological states in theoretical models of nerve membrane. Biophys J 1(6):445–466. https://doi.org/10.1016/s0006-3495(61)86902-6
Article CAS PubMed PubMed Central Google Scholar
Franken TP, Roberts MT, Wei L et al (2015) In vivo coincidence detection in mammalian sound localization generates phase delays. Nat Neurosci 18:444–452. https://doi.org/10.1038/nn.3948
Article CAS PubMed PubMed Central Google Scholar
Furber S (2012) To build a brain. IEEE Spectrum 49(8):44–49. https://doi.org/10.1109/MSPEC.2012.6247562
Article Google Scholar
Gerstner W, Kistler WM (2002) Spiking neuron models: single neurons, populations, plasticity. Cambridge University Press, Cambridge. https://doi.org/10.1017/CBO9780511815706
Book Google Scholar
Gerum R, Erpenbeck A, Krauss P, et al. (2023) Leaky-integrate-and-fire neuron-like long-short-term-memory units as model system in computational biology. In: 2023 international joint conference on neural networks (IJCNN), pp 1–9. https://doi.org/10.1109/IJCNN54540.2023.10191268
Gerum RC (2020a) pylustrator: code generation for reproducible figures for publication. J Open Source Softw 5(51):1989. https://doi.org/10.21105/joss.01989
Gerum RC (2020b) TensorFlow spiking layer. Software https://github.com/rgerum/tf spiking
Gerum RC, Schilling A (2021) Integration of leaky-integrate-and-fire neurons in standard machine learning architectures to generate hybrid networks: a surrogate gradient approach. Neural Comput 33(10):2827–2852. https://doi.org/10.1162/neco_a_01424
Article PubMed Google Scholar
Gerum RC, Erpenbeck A, Krauss P et al (2020) Sparsity through evolutionary pruning prevents neuronal networks from overfitting. Neural Netw 128:305–312. https://doi.org/10.1016/j.neunet.2020.05.007
Article PubMed Google Scholar
Harris CR et al (2020) Array programming with NumPy. Nature 585(7825):357–362. https://doi.org/10.1038/s41586-020-2649-2
Article CAS PubMed PubMed Central Google Scholar
Hassabis D, Kumaran D, Summerfield C et al (2017) Neuroscience-inspired artificial intelligence. Neuron 95(2):245–258. https://doi.org/10.1016/j.neuron.2017.06.011
Article CAS PubMed Google Scholar
Hodgkin AL, Huxley AF (1952) A quantitative description of membrane current and its application to conduction and excitation in nerve. J Physiol 117:500–544. https://doi.org/10.1113/jphysiol.1952.sp004764
Article CAS PubMed PubMed Central Google Scholar
Hunter JD (2007) Matplotlib: A 2D graphics environment. Comput Sci Eng 9(3):90–95. https://doi.org/10.5281/zenodo.592536
Article Google Scholar
Izhikevich EM, FitzHugh R (2006) Fitzhugh–Nagumo model. Scholarpedia 1(9):1349. https://doi.org/10.4249/scholarpedia.1349
Article Google Scholar
Jonas E, Kording KP (2017) Could a neuroscientist understand a microprocessor? PLoS Comput Biol 13(1):e1005268. https://doi.org/10.1371/journal.pcbi.1005268
Article CAS PubMed PubMed Central Google Scholar
Kandel ER, Schwartz JH, Jessell TM et al (2000) Principles of Neural Science, vol 4. McGraw-Hill, New York
Google Scholar
Kingma DP, Ba J (2017) Adam: A Method for Stochastic Optimization. https://doi.org/10.48550/arXiv.1412.6980
Krauss P, Tziridis K, Metzner C et al (2016) Stochastic resonance controlled upregulation of internal noise after hearing loss as a putative cause of tinnitus-related neuronal hyperactivity. Front Neurosci. https://doi.org/10.3389/fnins.2016.00597
Article PubMed PubMed Central Google Scholar
Krauss P, Tziridis K, Schilling A et al (2018) Cross-modal stochastic resonance as a universal principle to enhance sensory processing. Front Neurosci. https://doi.org/10.3389/fnins.2018.00578
Article PubMed PubMed Central Google Scholar
Krauss P, Prebeck K, Schilling A et al (2019) Recurrence resonance’’ in three-Neuron Motifs. Front Comput Neurosci 13:64. https://doi.org/10.3389/fncom.2019.00064
Article PubMed PubMed Central Google Scholar
Krauss P, Schuster M, Dietrich V et al (2019) Weight statistics controls dynamics in recurrent neural networks. PloS One 14(4):e0214541. https://doi.org/10.1371/journal.pone.0214541
Article CAS PubMed PubMed Central Google Scholar
Krauss P, Zankl A, Schilling A et al (2019) Analysis of structure and dynamics in three-neuron motifs. Front Comput Neurosci 13:5. https://doi.org/10.3389/fncom.2019.00005
Article PubMed PubMed Central Google Scholar
Kriegeskorte N, Douglas PK (2018) Cognitive computational neuroscience. Nature Neurosci 21(9):1148–1160. https://doi.org/10.1038/s41593-018-0210-5
Article CAS PubMed Google Scholar
Krizhevsky A, Nair V, Hinton G (2009) CIFAR-10 (Canadian Institute for Advanced Research). http://www.cs.toronto.edu/kriz/cifar.html
Lee JH, Delbruck T, Pfeiffer M (2016) training deep spiking neural networks using backpropagation. Front Neurosci 10:508. https://doi.org/10.3389/fnins.2016.00508
Article PubMed PubMed Central Google Scholar
Metzner C, Yamakou ME, Voelkl D, et al. (2023) Quantifying and maximizing the information flux in recurrent neural networks. arXiv preprint arXiv:2301.12892 https://doi.org/10.48550/arXiv.2301.12892
Nagumo J, Arimoto S, Yoshizawa S (1962) An active pulse transmission line simulating nerve axon. Proc IRE 50(10):2061–2070. https://doi.org/10.1109/JRPROC.1962.288235
Article Google Scholar
Perez-Nieves N, Leung VC, Dragotti PL et al (2021) Neural heterogeneity promotes robust learning. Nat Commun 12(1):1–9. https://doi.org/10.1038/s41467-021-26022-3
Article CAS Google Scholar
Pfeiffer M, Pfeil T (2018) Deep learning with spiking neurons: opportunities and challenges. Front Neurosci 12:774. https://doi.org/10.3389/fnins.2018.00774
Article PubMed PubMed Central Google Scholar
Quax SC, D’Asaro M, van Gerven MA (2020) Adaptive time scales in recurrent neural networks. Sci Rep 10(1):1–14. https://doi.org/10.1038/s41598-020-68169-x
Article CAS Google Scholar
Ran Y, Huang Z et al (2020) Type-specific dendritic integration in mouse retinal ganglion cells. Nat Commun. https://doi.org/10.1038/s41467-020-15867-9
Article PubMed PubMed Central Google Scholar
Roome CJ, Kuhn B (2020) Dendritic coincidence detection in Purkinje neurons of awake mice. eLife 9:e59619. https://doi.org/10.7554/eLife.59619
Article CAS PubMed PubMed Central Google Scholar
Schilling A, Krauss P (2022) Tinnitus is associated with improved cognitive performance and speech perception: Can stochastic resonance explain? Front Aging Neurosci. https://doi.org/10.3389/fnagi.2022.1073149
Article PubMed PubMed Central Google Scholar
Schilling A, Maier A, Gerum R et al (2021) Quantifying the separability of data classes in neural networks. Neural Netw 139:278–293. https://doi.org/10.1016/j.neunet.2021.03.035
Article PubMed Google Scholar
Schilling A, Tziridis K, Schulze H et al (2021) The stochastic resonance model of auditory perception: a unified explanation of tinnitus development, zwicker tone illusion, and residual inhibition. Progress Brain Res 262:139–157. https://doi.org/10.1016/bs.pbr.2021.01.025
Article Google Scholar
Schilling A, Gerum R, Metzner C et al (2022) Intrinsic noise improves speech recognition in a computational model of the auditory pathway. Front Neurosci. https://doi.org/10.3389/fnins.2022.908330
Article PubMed PubMed Central Google Scholar
Schilling A, Sedley W, Gerum R et al (2023) Predictive coding and stochastic resonance as fundamental principles of auditory phantom perception. Brain. https://doi.org/10.1093/brain/awad255
Article PubMed PubMed Central Google Scholar
Stoewer P, Schilling A, Maier A, et al. (2023a) Conceptual cognitive maps formation with neural successor networks and word embeddings. arXiv preprint arXiv:2307.01577 https://doi.org/10.48550/arXiv.2307.01577
Stoewer P, Schilling A, Maier A et al (2023) Neural network based formation of cognitive maps of semantic spaces and the putative emergence of abstract concepts. Sci Rep 13(1):3644. https://doi.org/10.1038/s41598-023-30307-6
Article CAS PubMed PubMed Central Google Scholar
Surendra K, Schilling A, Stoewer P, et al. (2023) Word class representations spontaneously emerge in a deep neural network trained on next word prediction. arXiv preprint arXiv:2302.07588 https://doi.org/10.48550/arXiv.2302.07588
The Pandas Development Team (2020) pandas-dev/pandas: pandas. Software. https://doi.org/10.5281/zenodo.3509134
Article Google Scholar
Wang Y, Wang Q, Shi S, et al. (2020) Benchmarking the performance and energy efficiency of AI accelerators for AI training. In: 2020 20th IEEE/ACM international symposium on cluster, cloud and internet computing (CCGRID), pp 744–751. https://doi.org/10.1109/CCGrid49817.2020.00-15
Xiao H, Rasul K, Vollgraf R (2017) Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms. CoRR abs/1708.07747. URL http://arxiv.org/abs/1708.07747
Xiao M, Meng Q, Zhang Z, et al. (2022) Online training through time for spiking neural networks. In: Koyejo S, Mohamed S, Agarwal A, et al (eds) Advances in neural information processing systems, vol 35. Curran Associates, Inc., pp 20717–20730. https://proceedings.neurips.cc/paper_files/paper/2022/file/82846e19e6d42ebfd4ace4361def29ae-Paper-Conference.pdf
Xu N, Harnett MT et al (2012) Nonlinear dendritic integration of sensory and motor input during an active sensing task. Nature 492:247–251. https://doi.org/10.1038/nature11601
Article CAS PubMed Google Scholar
Yamazaki K, Vo-Ho VK, Bulsara D et al (2022) Spiking neural networks and their applications: a review. Brain Sci 12:863. https://doi.org/10.3390/brainsci12070863
Article PubMed PubMed Central Google Scholar
Yang Z, Schilling A, Maier A, et al. (2021) Neural networks with fixed binary random projections improve accuracy in classifying noisy data. In: Palm C, Deserno TM, Handels H, et al (eds) Bildverarbeitung für die Medizin 2021. Springer Fachmedien Wiesbaden, pp 211–216. https://doi.org/10.1007/978-3-658-33198-6_51
Yin B, Corradi F, Bohté SM (2020) Effective and efficient computation with multiple-timescale spiking recurrent neural networks. CoRR abs/2005.11633:1–8. https://doi.org/10.48550/arXiv.2005.11633
Zenke F, Vogels TP (2021) The remarkable robustness of surrogate gradient learning for instilling complex function in spiking neural networks. Neural Comput 33(4):899–925. https://doi.org/10.1162/neco_a_01367
Article PubMed Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL. This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation): grants KR 5148/2-1 (project number 436456810), KR 5148/3-1 (Project Number 510395418) and GRK 2839 (Project Number 468527017) to PK, and Grant SCHI 1482/3-1 (Project Number 451810794) to ASchi. Furthermore, the research leading to these results has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (ERC Grant No. 810316 to AM).

Author information

Richard Gerum and Achim Schilling have contributed equally to this work.

Authors and Affiliations

Pattern Recognition Lab, University Erlangen-Nürnberg, Erlangen, Germany
Andreas Stoll, Andreas Maier, Patrick Krauss & Achim Schilling
Neuroscience Lab, University Hospital Erlangen, Erlangen, Germany
Patrick Krauss & Achim Schilling
Department of Physics and Astronomy, York University, Toronto, Canada
Richard Gerum

Authors

Andreas Stoll
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Maier
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Krauss
View author publications
You can also search for this author in PubMed Google Scholar
Richard Gerum
View author publications
You can also search for this author in PubMed Google Scholar
Achim Schilling
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, ASchi, RG; methodology, ASchi, ASt, RG; software, ASt; visualization, ASt; writing-original draft preparation, ASt, ASchi; internal review and editing, ASchi, AM, PK, RG; supervision, ASchi, RG; project administration, ASchi; All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Andreas Stoll or Achim Schilling.

Ethics declarations

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 1579 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Stoll, A., Maier, A., Krauss, P. et al. Coincidence detection and integration behavior in spiking neural networks. Cogn Neurodyn (2023). https://doi.org/10.1007/s11571-023-10038-0

Download citation

Received: 08 May 2023
Revised: 11 September 2023
Accepted: 09 November 2023
Published: 13 December 2023
DOI: https://doi.org/10.1007/s11571-023-10038-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Coincidence detection and integration behavior in spiking neural networks

Abstract

Similar content being viewed by others

Neurons with Non-standard Behaviors Can Be Computationally Relevant

An STDP-Based Supervised Learning Algorithm for Spiking Neural Networks

Information filtering by coincidence detection of synchronous population output: analytical approaches to the coherence function of a two-stage neural system

Introduction