[1]\fnmJianguo \surZhang [2]\fnmLuziwei \surLeng

1]\orgdivDepartment of Computer Science and Engineering, \orgnameSouthern University of Science and Technology, \orgaddress\cityShenzhen, \countryChina

2]\orgdivACS Lab, \orgnameHuawei Technologies, \orgaddress\cityShenzhen, \countryChina

3]\orgdivCollege of Mathematics and Statistics, \orgnameChongqing University, \orgaddress\cityChongqing, \countryChina

4]\orgdivDepartment of Computer Science, \orgname City University of Hong Kong, \orgaddress\cityHong Kong, \countryChina

Evolutionary Spiking Neural Networks: A Survey

\fnmShuaijie \surShen 12132355@mail.sustech.edu.cn    \fnmRui \surZhang 12032455@mail.sustech.edu.cn    \fnmChao \surWang 15662672289@163.com    \fnmRenzhuo \surHuang 12332478@mail.sustech.edu.cn    \fnmAiersi \surTuerhong 20211385@stu.cqu.edu.cn    \fnmQinghai \surGuo guoqinghai@huawei.com    \fnmZhichao \surLu zhichao.lu@cityu.edu.hk    zhangjg@sustech.edu.cn    lengluziwei@huawei.com [ [ [ [
Abstract

Spiking neural networks (SNNs) are gaining increasing attention as potential computationally efficient alternatives to traditional artificial neural networks (ANNs). However, the unique information propagation mechanisms and the complexity of SNN neuron models pose challenges for adopting traditional methods developed for ANNs to SNNs. These challenges include both weight learning and architecture design. While surrogate gradient learning has shown some success in addressing the former challenge, the latter remains relatively unexplored. Recently, a novel paradigm utilizing evolutionary computation methods has emerged to tackle these challenges. This approach has resulted in the development of a variety of energy-efficient and high-performance SNNs across a wide range of machine learning benchmarks. In this paper, we present a survey of these works and initiate discussions on potential challenges ahead.

keywords:
Spiking Neural Networks, Evolutionary Algorithm, Neural Architecture Search

1 Introduction

Spiking neural network (SNN) [1] has attracted increasing attention as a potential alternative to the traditional artificial neural network (ANN), with appealing attributes such as sparse computation and temporal dynamics. Recently, the problem associated with training deep SNNs has been arguably addressed by surrogate gradient (SG) methods where soft relaxed functions were used to approximate the original non-existing gradient of binary spiking activation [2, 3, 4]. Based on these methods, SNNs have achieved high-level performances in various deep learning tasks such as image classification on CIFAR [5] and ImageNet [6] datasets [7, 8, 9]. However, these approaches often adopted architectures optimized for ANNs and directly applied them to SNNs, which are likely to be sub-optimal for spike-based computation, resulting in unsatisfactory performance. In particular, the performance degradation was more apparent in tasks where variations in network architectures are required, such as dense image prediction [10, 11, 12]. To close this gap, a steady stream of works has been proposed, including improving SNN training and configuration of spike-based operators [13, 14, 15]. However, these methods largely relied on handcrafted design via trial and error. Alternatively, the recently emerged evolutionary approaches leveraged search algorithms to optimize network architectures and have been demonstrated to be efficient in constructing high-performance SNNs. When coupled with evolutionary algorithms (EAs), SNNs undergo automated optimization, adjusting hyperparameters, operations and architecture through evolutionary principles, while conventional SNNs are often handcrafted and only the connection weights are trained. Compared with the handcrafted SNNs, SNNs with EAs offer the advantages of enhanced flexibility, allowing them to meet different requirements in, e.g. computation cost, biological plausibility and hardware compatibility, etc.

This work provides a review of recent works of evolutionary SNNs. Previous review works on SNN have covered topics such as biological-inspired strategies [16], learning rules [16, 17], energy efficiency [18], applications [19] and membrane computing theory [20]. Our work differs from these since it particularly discusses recent ideas of applying evolutionary algorithms to improve deep SNNs as an alternative to traditional handcrafted methods. We first introduce spiking neuron models which serve as fundamental building blocks in Section 2. In Section 3, we briefly introduce the learning algorithms for SNNs, which are critical for realizing functional networks and highly involved in evolutionary methods in SNN. In Section 4, we review recent works on evolutionary SNNs and provide a background introduction of evolutionary algorithms (EAs) in the beginning. Finally, we provide a comprehensive and quantitative comparison of recent works on benchmark image classification tasks and conclude with potential challenges of current evolutionary approaches for SNNs.

2 Spiking Neuron Models

In this section, we first introduce a general neuron model, Hodgkin–Huxley model [21], and then present some simplified spike neuron models commonly used in modern SNNs. Although a variety of spiking neuron models were proposed in prior works, only a few of them were adopted in modern deep SNNs due to computational efficiency and memory conservation.

2.1 Hodgkin–Huxley Model

Based on experiments conducted on the giant axon of a squid, the Hodgkin–Huxley (HH) model [21] describes how action potentials in neurons are initiated and propagated. The model relates the variation of the membrane potential to the concentration of K+ and Na+:

Cdudt=kIk(t)+I(t),𝐶d𝑢d𝑡subscript𝑘subscript𝐼𝑘𝑡𝐼𝑡C\frac{\mathop{}\!\mathrm{d}u}{\mathop{}\!\mathrm{d}t}=-\sum_{k}I_{k}(t)+I(t),italic_C divide start_ARG roman_d italic_u end_ARG start_ARG roman_d italic_t end_ARG = - ∑ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_t ) + italic_I ( italic_t ) , (1)

where C𝐶Citalic_C is membrane capacitance, u𝑢uitalic_u is the membrane potential, I𝐼Iitalic_I is the current injected into cell, i.e. the synaptic input current, and kIksubscript𝑘subscript𝐼𝑘\sum_{k}I_{k}∑ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the sum of the ionic currents which pass through the cell membrane. kIksubscript𝑘subscript𝐼𝑘\sum_{k}I_{k}∑ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is related to three types of channel which can be characterized by the conductance of the ion channels and the leakage channel, and is formulated by:

kIk=gNam3h(uENa)+gKn4(uEK)+gL(uEL),subscript𝑘subscript𝐼𝑘subscript𝑔𝑁𝑎superscript𝑚3𝑢subscript𝐸𝑁𝑎subscript𝑔𝐾superscript𝑛4𝑢subscript𝐸𝐾subscript𝑔𝐿𝑢subscript𝐸𝐿\sum_{k}I_{k}=g_{Na}m^{3}h(u-E_{Na})+g_{K}n^{4}(u-E_{K})+g_{L}(u-E_{L}),∑ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_g start_POSTSUBSCRIPT italic_N italic_a end_POSTSUBSCRIPT italic_m start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_h ( italic_u - italic_E start_POSTSUBSCRIPT italic_N italic_a end_POSTSUBSCRIPT ) + italic_g start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ( italic_u - italic_E start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT ) + italic_g start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ( italic_u - italic_E start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ) , (2)

where gNasubscript𝑔𝑁𝑎g_{Na}italic_g start_POSTSUBSCRIPT italic_N italic_a end_POSTSUBSCRIPT, gKsubscript𝑔𝐾g_{K}italic_g start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT, and gLsubscript𝑔𝐿g_{L}italic_g start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT are the maximum conductance of Na+ channel, K+ channel, and leakage channel, respectively. ENasubscript𝐸𝑁𝑎E_{Na}italic_E start_POSTSUBSCRIPT italic_N italic_a end_POSTSUBSCRIPT, EKsubscript𝐸𝐾E_{K}italic_E start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT and ELsubscript𝐸𝐿E_{L}italic_E start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT are the reversal potential. m𝑚mitalic_m, hhitalic_h and n𝑛nitalic_n are voltage-dependent ”gating” variables to model the probability that a channel is open at a given moment in time.

Because of the demanding computation and complex simulation [22] [23], HH model is not practical for large-scale tasks or deployment on ASIC, therefore, some simplified models are proposed to reduce the demand for computational resources.

2.2 Leaky Integrate and Fire Model

The leaky integrate-and-fire (LIF) model is an extremely simplified model that only remains the key points of voltage evolution and simplifies the detail of evolution. The voltage of leaky integrate-and-fire model evolves according to

τddtu=(uurest)+RI,𝜏dd𝑡𝑢𝑢subscript𝑢𝑟𝑒𝑠𝑡𝑅𝐼\tau\frac{\mathop{}\!\mathrm{d}}{\mathop{}\!\mathrm{d}t}u=-(u-u_{rest})+RI,italic_τ divide start_ARG roman_d end_ARG start_ARG roman_d italic_t end_ARG italic_u = - ( italic_u - italic_u start_POSTSUBSCRIPT italic_r italic_e italic_s italic_t end_POSTSUBSCRIPT ) + italic_R italic_I , (3)

where τ𝜏\tauitalic_τ is the time constant, urestsubscript𝑢𝑟𝑒𝑠𝑡u_{rest}italic_u start_POSTSUBSCRIPT italic_r italic_e italic_s italic_t end_POSTSUBSCRIPT is the membrane potential of the neuron at rest, R𝑅Ritalic_R is a voltage-dependent resistance and I𝐼Iitalic_I is the input current. When the voltage reaches a formal threshold, a spike is generated and then the voltage is instantly reset to a new value, as following describes

u:ϑϑ,uvr,:𝑢formulae-sequencesuperscriptitalic-ϑitalic-ϑ𝑢subscript𝑣𝑟u:\vartheta^{-}\to\vartheta,\,u\leftarrow v_{r},italic_u : italic_ϑ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT → italic_ϑ , italic_u ← italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , (4)

where ϑitalic-ϑ\varthetaitalic_ϑ is the threshold which once reaches from below a spike is generated, and vrsubscript𝑣𝑟v_{r}italic_v start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT is the reset voltage. In modern SNNs, a discrete version of LIF neuron model is often adopted, described by:

ut=ut11τ(ut1urest)+xt+1,superscriptsubscript𝑢𝑡subscript𝑢𝑡11𝜏subscript𝑢𝑡1subscript𝑢𝑟𝑒𝑠𝑡subscript𝑥𝑡1u_{t}^{\prime}=u_{t-1}-\frac{1}{\tau}(u_{t-1}-u_{rest})+x_{t+1},italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_u start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_τ end_ARG ( italic_u start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT - italic_u start_POSTSUBSCRIPT italic_r italic_e italic_s italic_t end_POSTSUBSCRIPT ) + italic_x start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT , (5)

where utsuperscriptsubscript𝑢𝑡u_{t}^{\prime}italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is the temporary voltage because the neuron will immediately determine whether to fire a spike based on the voltage.

st=𝕀utϑ,subscript𝑠𝑡subscript𝕀superscriptsubscript𝑢𝑡italic-ϑs_{t}=\mathbb{I}_{u_{t}^{\prime}\geq\vartheta},italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = blackboard_I start_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≥ italic_ϑ end_POSTSUBSCRIPT , (6)

where when utϑsuperscriptsubscript𝑢𝑡italic-ϑu_{t}^{\prime}\geq\varthetaitalic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≥ italic_ϑ, st=1subscript𝑠𝑡1s_{t}=1italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = 1, otherwise, st=0subscript𝑠𝑡0s_{t}=0italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = 0. And then the final value of voltage is determined based on stsubscript𝑠𝑡s_{t}italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT.

ut=(1st)ut+stur,subscript𝑢𝑡1subscript𝑠𝑡superscriptsubscript𝑢𝑡subscript𝑠𝑡subscript𝑢𝑟u_{t}=(1-s_{t})u_{t}^{\prime}+s_{t}u_{r},italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = ( 1 - italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , (7)

where the voltage is reset to a new value ursubscript𝑢𝑟u_{r}italic_u start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT if the neuron generates a spike, which is also known as ”hard reset”, while deducing threshold from utsuperscriptsubscript𝑢𝑡u_{t}^{\prime}italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is known as ”soft reset”, which is widely used in ANN-SNN conversion to pursue approximate equivalence.

LIF model, as its name suggests, its membrane voltage decays over time. But in some cases, a non-leaky model, also known as IF model will be used.

When coupled with the discrete LIF model, SNNs can be conceptualized as ANNs with multiple time-steps. In this framework, the activation of neuron in each time-step is influenced by the status of the activation in the previous time-step, reflecting the temporal dynamics of neural activity. In many cases, the final output of SNNs is computed as the mean of all outputs over all time-steps, and the loss function is computed using the final output. In certain cases, the loss function is also computed using the output from each time-step individually, and then aggregated to obtain an overall loss [24].

3 Training of Spiking Neural Network

In this section, we briefly introduce the training methods of SNN which are highly involved in evolutionary SNN approaches. Towards training a brain-inspired neural network, learning methods inspired from biological nervous system are adopted, e.g., Spike-Timing-Dependent Plasticity (STDP), Anti-Hebbian STDP (aSTDP) and their Variants. For deep learning tasks, the training of SNNs can be mainly divided into two branches, i.e. ANN-to-SNN conversion and surrogate gradient training methods. The former is motivated by the success of ANNs, aiming to convert neurons with floating-value activation to spiking activation, based on weight transformation techniques. The later directly trains SNNs by approximating gradient-decent-based training methods with surrogate gradient functions.

3.1 Spike-Timing-Dependent Plasticity

Spike-Timing-Dependent Plasticity, a form of Hebbian learning [25], is an unsupervised learning method that adjusts the strength of connections (weight) between neurons based on the relative spike timing between them [26, 27]. More specifically, when the presynaptic neuron fires a spike before a postsynaptic neuron firing, the strength of the connection (synaptic weight) between them is increased, otherwise, it is decreased, which are known as long-term potential (LTP) [28] and depression (LTD) [29], respectively. However, there are many neuronal systems not following Hebbian rule, which is known as anti-Hebbian STDP (aSTDP) that modifies the weight in a reverse way [30]. Besides, some STDP variants like Mirrored STDP (mSTDP) [31], Probabilistic STDP [32] and Reward Modulated STDP (R-STDP) [33] are introduced for specific purposes. Although these training methods are biologically inspired and easy to implement, they are yet to be demonstrated as competitive as back-propagation training [34] for hard deep learning tasks [35].

3.2 ANN-SNN Conversion

The difficulty of training SNNs and the high performances of ANNs in recent years have motivated researchers to convert the pre-trained ANNs to SNNs [36, 37, 38, 39, 40, 41]. Due to the intrinsic difference of information representation, SNNs obtained by early conversion-based methods often took dozens or even hundreds of steps to achieve competitive performance with the pre-trained ANNs [42, 43, 44]. To this end, researchers recently proposed more efficient conversion methods including quantized activation functions by which the converted SNNs can achieve almost the same performance as ANNs in very few time steps [45, 46, 47].

3.3 Surrogate Gradient Learning Algorithm

The success of ANNs relies heavily on back-propagation training [34], which also prompts researchers to using similar methods in SNNs. However, the intrinsic non-differentiability of the spike generation function (equation 6) makes back-propagation impossible in training SNNs. To solve the problem, many works propose surrogate gradient (SG) functions to replace the non-differential term in back-propagation [2, 3, 48]. More specifically, the Dirac delta function, the real derivative of spike generation function, is replaced by a function of approximate shape, e.g. piece-wise quadratic function [49, 3, 50] and sigmoid function [50, 51]. However, the SG method also introduces noise because of the mismatched gradient during the training process, making it difficult for SNNs to surpass ANNs. To this end, recent works proposed solutions such as adaptive SGs [13, 14, 52], membrane potential normalization [53, 54] or improving spike-based operators [15], etc. Although SNNs have high forward inference speed and theoretically low energy consumption due to sparse computation, they require a relative high amount of memory and computation during training compared to ANNs because of the extra temporal dimension. To alleviate these problems, recent works reduced the amount of calculation in back-propagation by taking advantage of the sparsity of SNNs [55, 56, 57].

4 Evolutionary Spiking Neural Networks

In this section, we first make a briefly introduction on EAs as background knowledge and focus on methods that have been applied to SNNs. We then present and analyse recent works of evolutionary SNNs. In the end, we summarize and provide a comprehensive comparison of recent deep evolutionary SNNs on benchmark image classification tasks.

4.1 Evolutionary Algorithms

Drawing inspiration from the biological process of evolution, EAs are effectively utilized in fields such as optimization, learning, and design. These algorithms operate on the principle of a population-based generate-and-test method. The generation phase involves the mutation and/or recombination of individuals within a population, emulating natural evolutionary processes. Subsequently, in the testing phase, the algorithm selects the next generation by choosing from both the parents and their offspring, based on their performance or fitness. In this continuous process of generating and testing, the evolutionary algorithm iteratively refines the population until a satisfying solution is identified, adhering to predefined halting criteria.

A simple evolutionary algorithm is described in Algorithm 1.

Algorithm 1 Evolutionary Algorithm
1:Initialize the population P={x1,x2,,xn}𝑃superscript𝑥1superscript𝑥2superscript𝑥𝑛P=\{x^{1},x^{2},...,x^{n}\}italic_P = { italic_x start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , … , italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT } and calculate their fitness;
2:while The termination criteria is not met do
3:     Select parents Pparentssubscript𝑃𝑝𝑎𝑟𝑒𝑛𝑡𝑠P_{parents}italic_P start_POSTSUBSCRIPT italic_p italic_a italic_r italic_e italic_n italic_t italic_s end_POSTSUBSCRIPT from the population P𝑃Pitalic_P;
4:     Reproduce children Pchildsubscript𝑃𝑐𝑖𝑙𝑑P_{child}italic_P start_POSTSUBSCRIPT italic_c italic_h italic_i italic_l italic_d end_POSTSUBSCRIPT from Pparentssubscript𝑃𝑝𝑎𝑟𝑒𝑛𝑡𝑠P_{parents}italic_P start_POSTSUBSCRIPT italic_p italic_a italic_r italic_e italic_n italic_t italic_s end_POSTSUBSCRIPT by mutation and crossover;
5:     Evaluate fitness of Pchildsubscript𝑃𝑐𝑖𝑙𝑑P_{child}italic_P start_POSTSUBSCRIPT italic_c italic_h italic_i italic_l italic_d end_POSTSUBSCRIPT;
6:     Let individuals in P𝑃Pitalic_P and Pchildsubscript𝑃𝑐𝑖𝑙𝑑P_{child}italic_P start_POSTSUBSCRIPT italic_c italic_h italic_i italic_l italic_d end_POSTSUBSCRIPT compete based on fitness, then select survived individuals as new population Psuperscript𝑃P^{\prime}italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, and assign Psuperscript𝑃P^{\prime}italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT to P𝑃Pitalic_P;
7:end while
8:return final population P𝑃Pitalic_P

The population P𝑃Pitalic_P, contains a set of individuals x1,x2,,xnsuperscript𝑥1superscript𝑥2superscript𝑥𝑛x^{1},x^{2},...,x^{n}italic_x start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , … , italic_x start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT with n𝑛nitalic_n denoting the total number of individuals in the population. Each individual is represented by a string termed chromosome or genotype. Pparentssubscript𝑃𝑝𝑎𝑟𝑒𝑛𝑡𝑠P_{parents}italic_P start_POSTSUBSCRIPT italic_p italic_a italic_r italic_e italic_n italic_t italic_s end_POSTSUBSCRIPT represents the chosen parents from the current population P𝑃Pitalic_P. Usually, individuals with higher fitness are more likely to be selected. Pchildsubscript𝑃𝑐𝑖𝑙𝑑P_{child}italic_P start_POSTSUBSCRIPT italic_c italic_h italic_i italic_l italic_d end_POSTSUBSCRIPT are generated from Pparentssubscript𝑃𝑝𝑎𝑟𝑒𝑛𝑡𝑠P_{parents}italic_P start_POSTSUBSCRIPT italic_p italic_a italic_r italic_e italic_n italic_t italic_s end_POSTSUBSCRIPT using mutation and crossover operators, which are key methods in evolutionary algorithms to create new individuals. The fitness is a measure of how well an individual (Pchildsubscript𝑃𝑐𝑖𝑙𝑑P_{child}italic_P start_POSTSUBSCRIPT italic_c italic_h italic_i italic_l italic_d end_POSTSUBSCRIPT) adapts to the environment. This loop continues until a termination criterion is met, e.g. reaching a maximum number of iterations or finding a sufficiently good solution.

Essentially, EAs are not just one algorithm, but a whole family of different algorithms, each with its own history, methods and strategies. Examples include Genetic Algorithms (GAs)[58] that use natural selection concepts, Evolutionary Programming (EP)[59] which focuses on evolving behaviors, Evolution Strategies (ES)[60, 61] for optimizing real values, Genetic Programming (GP)[62, 63] that evolves computer programs, Differential Evolution (DE)[64] known for its straightforward optimization approach, and Particle Swarm Optimization (PSO)[65] inspired by the social behavior of birds and fish. Each of these represents a unique approach within the broad scope of EAs, leading to their widespread application in various specialized fields.

4.2 Evolutionary SNNs

The inherent complexity of SNNs naturally provides diverse optimization objectives. Early works have applied various EAs, such as GA, DE, PSO and harmony search algorithms on SNNs, optimizing hyperparameters including network topology, spiking neurons, synaptic weights and delay [66, 67, 68, 69, 70, 71], etc. However, limited by the hardware computing power, only shallow SNNs and small scale datasets were adopted in experiments, with tasks ranging from classification to robotic control. Boosted by the hardware and software advancements of deep learning, more recent evolutionary SNN works have extended to deeper networks and competed with both handcrafted SNNs and benchmark ANNs. We categorize these works into neural architecture search (NAS) and non-NAS methods.

4.2.1 NAS methods

AutoSNN [72] proposed a spike-aware neural architecture search (NAS) framework and applied to image classification on CIFAR and Tiny-ImageNet datasets. The work defines a two-level search space on backbone architecture and candidate block sets. It further analyzes the effects of the architecture components on the accuracy and number of spikes, which suggests using max pooling over global average pooling as down-sampling layers in SNNs. To reduce search cost induced by training and evaluating candidate architectures, the work adopts a one-shot weight-sharing approach [73, 74] using evolutionary algorithm. The work also proposes a spike-aware fitness function to penalize architectures with more spikes. Based on the fitness function, an evolutionary algorithm is performed on the trained supernet. New populations are generated using mutation and crossover by the parent population. Finally, the best architecture with the highest fitness value is selected, achieving an energy-efficient SNN with low spike rate and competitive performance. However, the algorithm is relatively complex and takes a long time to train the supernet. Additionally, the single-path sampling makes the training of spiking blocks unstable.

Concurrent to AutoSNN, [75] proposed SNASNet to find optimal architectures using NAS without training the SNN, which typically requires significantly longer training time compared to ANNs. Motivated by the observation that network architecture with a high representation power at initialization is likely to achieve higher post-training accuracy [76, 77], SNASNet finds an SNN-friendly architecture by selecting the architecture that can represent diverse spike activation patterns across different data samples without training. To this end, the work proposes Sparsity-Aware Hamming Distance (SAHD) for addressing sparsity variation of LIF neurons. In addition, to explore temporal information represented by spikes, the work searches for both the forward as well as backward connections between layers, and achieves higher performance with backward connections.

Following the weight sharing approach, SpikeDHS [52] adopted a gradient-based differentiable architecture search (DARTS) [78, 79] method to search SNNs. The supernet is trained end-to-end in a continuous relaxation search space where both the cell and layer structure are optimized alternately with the model weights. In addition, the work ensures multiplication-free inference (MFI) [80] which is essential for SNNs to achieve low power computing, by defining the cell operation with spike-based computation. To solve the noise problem induced by fixed SGs, in the retraining phase, a differentiable surrogate gradient search (DGS) method is proposed which evolves the SG function by adjusting its shape periodically. The searched SNNs are applied to static image classification on CIFAR and ImageNet datasets, as well as challenging dynamic event-based deep stereo tasks on the MVSEC dataset [81], both achieving high performances meanwhile with much lower energy consumption than ANNs. The searched SNN encoder topology is also applied to event-based object detection [82], demonstrating the versatility and efficiency of the method. A limitation of the work is that since the searched candidates are mixed by a weighted average, when difference between the coefficients is not significant, the decoded architecture could be suboptimal, which is observed in classification tasks. However, for tasks which require critical structure variation, such as dense prediction, this limitation is not notable, which is also demonstrated in extended works based on SpikeDHS [83].

Compared to previous NAS works on SNNs, [84] considered more biological principles in the search space and proposed the brain-inspired neural circuits evolution (NeuEvo) strategy, which involves feedback connection, excitatory and inhibitory neurons [85] and local learning. The work adopts an alternative bi-level optimization approach on synaptic weights and structure coefficients similar to previous works [78, 52]. It further exploits the local spiking behavior of neurons to evolve neural circuits with STDP and updates the synaptic weights in combination with gradient-based global error signals. The evolved SNNs are applied to static and event-based image classification, and reinforcement learning tasks, achieving competitive performances to ANNs meanwhile with highly sparse network activation.

For the purpose of a better accuracy-computational cost trade-off, [86] adopted a single-path NAS [87] approach, encoding all candidate architectures in a branchless spiking supernet, which requires much less computation and thus significantly reduces the search time. Rather than training independent kernels with different kernel sizes, the framework trains a branchless superkernel inspired by [87] whose actual receptive field is controlled by trainable parameters. In addition, synaptic operation (SynOps)-aware optimization is proposed to find a computationally efficient subspace of the supernet.

4.2.2 Non-NAS methods

To reduce the computation cost of the search process, [88] proposed spatio-temporal topology sampling (STTS) algorithm for SNN which samples both complex spatial topology and temporal topology by incorporating the synaptic delay. The work adopts a multi-stage downsampling structure and spiking convolution node design similar to SpikeDHS [52]. Differently, light-weight depthwise separable convolution [89] is used to reduce model parameters. Furthermore, the topology of the network is not optimized by gradient-based learning rules, but sampled by random graph models for spatial topology and from a pre-defined distribution for temporal topology. The sampled architecture is directly used as the final architecture without training and evaluation, thus drastically reducing the computation cost compared with other NAS methods. However, as mentioned in the work, there is still a lack of theoretical guarantee of the random sampling method, although empirical results show that the variance between different sampled architectures is low.

[90] proposed the evolving connectivity (EC) framework to train recurrent SNNs (RSNNs), which employs natural evolution strategies (NES) [91] for optimizing parameterized connection probability distributions to replace the weight-tuning process. Inspired from weight agnostic network [92], the EC framework circumvents the need for gradients and features hardware-friendly characteristics, including sparse binary connections and accelerated training. The evolved SNNs are evaluated on a series of robotic locomotion tasks including a complex 17-DoF humanoid task, where it achieves comparable performance with ANNs and outperformed gradient-trained RSNNs.

4.2.3 Discussion

Finally, we provide results of above works on benchmark image classification tasks including CIFAR10, CIFAR100, CIFAR10-DVS, Tiny-ImageNet and ImageNet datasets in Table 1. For a comprehensive comparison, we adopt evaluation metrics including number of model parameters (NoP), simulation time steps (T), accuracy, number of spikes (NoS), theoretical power consumption and synaptic operations (SynOps). Note that the theoretical power consumption is calculated by operation numbers of the SNN during the inference of one image, we refer to [88] for a nice elaboration of the relation between SynOps and theoretical power consumption. We also include recent state-of-the-art works of directly handcrafted SNNs as a comparison with evolutionary approaches.

On CIFAR dataset series, evolutionary SNNs are highly competitive to directly handcrafted SNNs. For convolution networks, the precision of SpikeDHS(n4) [52] surpasses the handcrafted Dspike [13] on CIFAR10 and CIFAR100 under similar network size. With DGS method, SpikeDHSD(n3c5) is even competitive to the Spike-driven Transformer [93] which adopts a transformer architecture. In terms of accuracy, NeuEvo [84] currently achieves the highest values among evolutionary SNNs on CIFAR10, CIFAR100 and CIFAR10-DVS datasets. On the large scale ImageNet dataset, among convolution networks, STTS [88] demonstrates the highest accuracy within evolutionary SNNs meanwhile significantly outperforming the handcrafted Dspike [13] under similar network size. However, the very recent work of Meta-SpikeFormer [15] using optimized spike-based attention operations achieves significantly higher accuracy than STTS meanwhile with much smaller network size, though with higher theoretical power consumption. This indicates the limitation of evolutionary SNN approaches based on pure convolution architectures.

Table 1: Performance comparison of evolutionary SNNs and state-of-the-art handcrafted SNNs on image classification. All values are taken from literatures. Values by estimation are denoted by *. Values not available are denoted by -. Handcrafted SNNs are denoted by bold prefix H.
Dataset Model
NoP
T
Accuracy
(%)
NoS
Power
(mJ)
SynOps
CIFAR10 AutoSNN (C=128) [72] 21M 8 93.15 310K 6.3 732M
SNASNet-BW [75] 49.93M 8 94.12±0.25 - - -
SpikeDHS-CLA(n4) [52] 12M 6 94.34±0.06 788K 5.5 923M
SpikeDHS-CLA(n3) [52] 14M 6 95.35±0.05 752K - -
SpikeDHS-CLAD(n3c5) [52] 14M 6 95.50±0.03 720K - -
STTS(TANet-Tiny) [88] 7M 4 95.10±0.09 - 1.2 -
ESNN (λ𝜆\lambdaitalic_λ=0) [86] 23.47M 3 94.64 - 1.2 412M
ESNN (λ𝜆\lambdaitalic_λ=0.1) [86] 16.18M 3 94.27 - 0.8 270M
NeuEvo [84] - - 96.43±0.26 500K - -
(H) Dspike [13] 11M 6 94.25 - - -
(H) Spike-driven Transformer [93] - 4 95.6 - - -
CIFAR100 AutoSNN(C=64) [72] 5M 8 69.16 326K - 785M
SNASNet-BW [75] 20.71M 5 73.04±0.36 - - 1293M
SpikeDHS-CLA(n4) [52] 12M 6 75.70±0.14 962K - 1056M
SpikeDHS-CLA(n3) [52] 14M 6 76.15±0.20 - - -
SpikeDHS-CLAD(n3s1) [52] 14M 6 76.25±0.10 - - -
STTS(TANet-Tiny) [88] 7M 4 76.33±0.32 - - -
ESNN (λ𝜆\lambdaitalic_λ=0) [86] 27.55M 3 74.78 - - 517M
ESNN (λ𝜆\lambdaitalic_λ=0.1) [86] 17.32M 3 73.21 - - 348M
NeuEvo [84] - - 77.72±0.32 - - -
(H) Dspike [13] 11M 6 74.24±0.10 - - -
(H) Spike-driven Transformer [93] - 4 78.4 - - -
CIFAR10-DVS AutoSNN(C=16) [72] 0.42M 20 72.50 1269K - 1221M
ESNN (λ𝜆\lambdaitalic_λ=0) [86] 3.53M 10 78.40 - - 1998M
ESNN (λ𝜆\lambdaitalic_λ=0.1) [86] 2.52M 10 75.80 - - 1215M
NeuEvo [84] - - 84.17±0.23 - - -
(H) Dspike [13] 11M 10 75.4±0.10 - - -
(H) Spike-driven Transformer [93] - 16 80.0 - - -
Tiny-ImageNet AutoSNN(C=64) [72] 14.83M 8 46.79 680K - 903M
SNASNet-BW [75] 74.62M 5 54.60±0.48 - - 1454M
ESNN (λ𝜆\lambdaitalic_λ=0) [86] 6.91M 3 58.59 - - 1061M
ESNN (λ𝜆\lambdaitalic_λ=0.1) [86] 5.14M 3 58.10 - - 666M
ImageNet SpikeDHS [52] 58M 6 67.96 - - -
SpikeDHSD [52] 58M 6 68.64 - - -
STTS(TANet-Regular) [88] 25M 4 70.79±0.43 - 8.2 -
NeuEvo [84] - - 68.74 - - -
(H) Dspike [13] 22M 6 68.19 - - -
(H) Meta-SpikeFormer [15] 15.1M 4 74.1 - 16.7 -
(H) Meta-SpikeFormer [15] 55.4M 4 79.7 - 52.4 -

5 Summary

This paper provides a survey of emerging works of evolutionary SNNs, with a focus on recent approaches applied for deep SNNs. In general, current evolutionary methods enable SNNs to achieve high accuracy in tasks including static and event-based image classification, event-based vision tasks and reinforcement learning tasks in robotics, meanwhile with low energy consumption, evaluated by number of spikes, theoretical powers or SynOps. Despite recent achievements, limitations exit for current evolutionary approaches for SNNs. We conclude these challenges as following:

  1. 1.

    Evolution cost: Several evolutionary SNN works [72, 52, 84] integrate gradient-based training into the architectural optimization process. Considering the training of SNN typically requires significantly longer time than ANN, this strategy could lead to huge computation cost when scaling up. This motivates efficient SNN evolutionary methods towards better accuracy-computational cost trade-off. Recent works have improved it using methods including initialization optimization, random sampling and branchless method [75, 88, 86].

  2. 2.

    Operation space: Current evolutionary SNN works mainly define their operation space on convolution, which largely limit their performances when competing with SNNs with attention mechanisms. This motivates a more flexible and diverse operation search space design compatible with spike-based operation, such as MLP [80] and attention [94].

  3. 3.

    Co-evolution: Current works mainly focus on architecture optimization. Biological nervous systems utilize highly diverse neurons, network topology, learning rules and targets for optimization, to form intelligence. The integrated evolution of multiple elements such as architecture, neuron dynamics and learning algorithms could potentially lead to more efficient and powerful bio-inspired SNNs.

Alternative to improving the performance of SNNs on benchmark datasets, a more fundamental impact of EAs to SNNs is that their bio-mimetic nature offers a bottom-up path for the automated design of SNNs. The success of deep learning has motivated ANN-like SNNs with simplified LIF neuron model. However, biological neurons and neural circuits are featured with abundant dynamics and diverse interaction rules of local components, which resulting in a huge degree of freedom hard for handcrafted methods. Embedded with these principles, network combined with EAs could evolve through competition and mutation, towards intelligent systems with biological plausibility and answer underlying questions, such as why spikes?

Declarations

  • Funding: This work was supported by the National Key Research and Development Program of China (2021YFF1200800) and the Science and Technology Innovation (STI) 2030-Major Project (Brain Science and Brain-Like Intelligence Technology) under Grant 2022ZD0208700.

  • Conflict of interest/Competing interests: The authors declare that they have no conflict of interest.

  • Ethics approval and consent to participate: Not applicable

  • Consent for publication: All authors consent for publication.

  • Data availability: Not applicable

  • Materials availability: Not applicable

  • Code availability: Not applicable

  • Author contribution: Luziwei Leng, Shuaijie Shen and Zhichao Lu concieved the idea of the paper. Shuaijie Shen, Rui Zhang, Chao Wang, Renzhuo Huang, Aiersi Tuerhong, Qinghai Guo, Zhichao Lu and Luziwei Leng participated in the writing of the paper. Luziwei Leng and Jiangguo Zhang supervised the project. All authors reviewed the manuscript.

References

  • \bibcommenthead
  • Maass [1997] Maass, W.: Networks of spiking neurons: The third generation of neural network models. Neural Networks 10, 1659–1671 (1997) https://doi.org/10.1016/S0893-6080(97)00011-7
  • Bohte et al. [2000] Bohte, S.M., Kok, J.N., La Poutré, J.A.: SpikeProp: Backpropagation for Networks of Spiking Neurons. Paper presented at the 8th European Symposium on Artificial Neural Networks, Bruges, Belgium, 26-28 April 2000 (2000)
  • Wu et al. [2018] Wu, Y., Deng, L., Li, G., Shi, L.: Spatio-temporal backpropagation for training high-performance spiking neural networks. Frontiers in Neuroscience 12, 331 (2018)
  • Neftci et al. [2019] Neftci, E.O., Mostafa, H., Zenke, F.: Surrogate gradient learning in spiking neural networks: Bringing the power of gradient-based optimization to spiking neural networks. IEEE Signal Processing Magazine 36(6), 51–63 (2019)
  • Krizhevsky [2009] Krizhevsky, A.: Learning multiple layers of features from tiny images (2009)
  • Deng et al. [2009] Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a Large-Scale Hierarchical Image Database. Paper presented at the 37th IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009 (2009)
  • Wu et al. [2019] Wu, Y., Deng, L., Li, G., Zhu, J., Xie, Y., Shi, L.: Direct Training for Spiking Neural Networks: Faster, Larger, Better. Paper presented at the 33rd AAAI Conference on Artificial Intelligence, Honolulu, Hawaii, USA, 27 January–1 February 2019 (2019)
  • Zheng et al. [2021] Zheng, H., Wu, Y., Deng, L., Hu, Y., Li, G.: Going Deeper With Directly-Trained Larger Spiking Neural Networks. Paper presented at the 35th AAAI Conference on Artificial Intelligence, virtually, 2–9 February 2021 (2021)
  • Zhou et al. [2023] Zhou, Z., Zhu, Y., He, C., Wang, Y., Yan, S., Tian, Y., Yuan, L.: Spikformer: When Spiking Neural Network Meets Transformer. Paper presented at the 11th International Conference on Learning Representations, Kigali, Rwanda, 1–5 May 2023 (2023)
  • Zhu et al. [2022] Zhu, L., Wang, X., Chang, Y., Li, J., Huang, T., Tian, Y.: Event-based Video Reconstruction via Potential-assisted Spiking Neural Network. Paper presented at the 40th IEEE Conference on Computer Vision and Pattern Recognition, New Orleans, Louisiana, 21–24 June 2022 (2022)
  • Hagenaars et al. [2021] Hagenaars, J., Paredes-Vallés, F., De Croon, G.: Self-supervised Learning of Event-based Optical Flow with Spiking Neural Networks. Paper presented at the 35th Conference on Neural Information Processing Systems, virtually, 7–10 December 2021 (2021)
  • Kim et al. [2022] Kim, Y., Chough, J., Panda, P.: Beyond classification: Directly training spiking neural networks for semantic segmentation. Neuromorphic Computing and Engineering 2, 044015 (2022) https://doi.org/10.1088/2634-4386/ac9b86
  • Li et al. [2021] Li, Y., Guo, Y., Zhang, S., Deng, S., Hai, Y., Gu, S.: Differentiable Spike: Rethinking Gradient-Descent for Training Spiking Neural Networks. Paper presented at the 35th Conference on Neural Information Processing Systems, virtually, 7–10 December 2021 (2021)
  • Lian et al. [2023] Lian, S., Shen, J., Liu, Q., Wang, Z., Yan, R., Tang, H.: Learnable Surrogate Gradient for Direct Training Spiking Neural Networks. Paper presented at the 32nd International Joint Conference on Artificial Intelligence, Macao, S.A.R, 19–25 August 2023 (2023)
  • Yao et al. [2023] Yao, M., Hu, J., Hu, T., Xu, Y., Zhou, Z., Tian, Y., Bo, X., Li, G.: Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips. Paper presented at the 11th International Conference on Learning Representations, Kigali, Rwanda, 1–5 May 2023 (2023)
  • Nunes et al. [2022] Nunes, J.D., Carvalho, M., Carneiro, D., Cardoso, J.S.: Spiking neural networks: A survey. IEEE Access 10, 60738–60764 (2022) https://doi.org/10.1109/ACCESS.2022.3179968
  • Wang et al. [2022] Wang, S., Cheng, T.H., Lim, M.H.: A hierarchical taxonomic survey of spiking neural networks. Memetic Computing 14(3), 335–354 (2022)
  • Malcolm and Casco-Rodriguez [2023] Malcolm, K., Casco-Rodriguez, J.: A Comprehensive Review of Spiking Neural Networks: Interpretation, Optimization, Efficiency, and Best Practices. Preprint at https://arxiv.org/abs/2303.10780 (2023)
  • Yamazaki et al. [2022] Yamazaki, K., Vo-Ho, V.-K., Bulsara, D., Le, N.: Spiking neural networks and their applications: A review. Brain Sciences 12, 863 (2022) https://doi.org/10.3390/brainsci12070863
  • Paul et al. [2024] Paul, P., Sosik, P., Ciencialova, L.: A Survey on Learning Models of Spiking Neural Membrane Systems and Spiking Neural Networks. Preprint at https://arxiv.org/abs/2403.18609 (2024)
  • Hodgkin and Huxley [1952] Hodgkin, A.L., Huxley, A.F.: A quantitative description of membrane current and its application to conduction and excitation in nerve. The Journal of physiology 117, 500 (1952) https://doi.org/10.1113/jphysiol.1952.sp004764
  • Meunier and Segev [2002] Meunier, C., Segev, I.: Playing the devil’s advocate: Is the hodgkin–huxley model useful. Trends in Neurosciences 25, 558–563 (2002) https://doi.org/10.1016/s0166-2236(02)02278-6
  • Strassberg and DeFelice [1993] Strassberg, A.F., DeFelice, L.J.: Limitations of the hodgkin-huxley formalism: Effects of single channel kinetics on transmembrane voltage dynamics. Neural Computation 5, 843–855 (1993) https://doi.org/10.1162/neco.1993.5.6.843
  • Deng et al. [2022] Deng, S., Li, Y., Zhang, S., Gu, S.: Temporal Efficient Training of Spiking Neural Network via Gradient Re-weighting. Paper presented at the 10th International Conference on Learning Representations, virtually, 25–29 April 2022 (2022)
  • Hebb [2002] Hebb, D.O.: The Organization of Behavior: A Neuropsychological Theory. Psychology Press, New York (2002)
  • Bi and Poo [1998] Bi, G.-q., Poo, M.-m.: Synaptic modifications in cultured hippocampal neurons: dependence on spike timing, synaptic strength, and postsynaptic cell type. Journal of Neuroscience 18, 10464–10472 (1998) https://doi.org/10.1523/JNEUROSCI.18-24-10464.1998
  • Song et al. [2000] Song, S., Miller, K.D., Abbott, L.F.: Competitive hebbian learning through spike-timing-dependent synaptic plasticity. Nature Neuroscience 3, 919–926 (2000) https://doi.org/10.1038/78829
  • Bliss and Gardner-Medwin [1973] Bliss, T.V., Gardner-Medwin, A.R.: Long-lasting potentiation of synaptic transmission in the dentate area of the unanaesthetized rabbit following stimulation of the perforant path. The Journal of Physiology 232, 357 (1973) https://doi.org/10.1113/jphysiol.1973.sp010274
  • Masquelier et al. [2008] Masquelier, T., Guyonneau, R., Thorpe, S.J.: Spike timing dependent plasticity finds the start of repeating patterns in continuous spike trains. PLOS ONE 3, 1–9 (2008) https://doi.org/10.1371/journal.pone.0001377
  • Bell et al. [1997] Bell, C.C., Han, V.Z., Sugawara, Y., Grant, K.: Synaptic plasticity in a cerebellum-like structure depends on temporal order. Nature 387, 278–281 (1997) https://doi.org/10.1038/387278a0
  • Burbank [2015] Burbank, K.S.: Mirrored stdp implements autoencoder learning in a network of spiking neurons. PLoS Computational Biology 11, 1004566 (2015) https://doi.org/10.1371/journal.pcbi.1004566
  • Tavanaei et al. [2016] Tavanaei, A., Masquelier, T., Maida, A.S.: Acquisition of visual features through probabilistic spike-timing-dependent plasticity. Paper presented at the 30th International Joint Conference on Neural Network, Vancouver, Canada, 24-29 July 2016 (2016)
  • Izhikevich [2007] Izhikevich, E.M.: Solving the distal reward problem through linkage of stdp and dopamine signalings. Cerebral Cortex 17, 2443–2452 (2007) https://doi.org/10.1093/cercor/bhl152
  • Rumelhart et al. [1986] Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. Nature 323(6088), 533–536 (1986)
  • Journé et al. [2023] Journé, A., Rodriguez, H.G., Guo, Q., Moraitis, T.: Hebbian deep learning without feedback. Paper presented at the 11th International Conference on Learning Representations, Kigali, Rwanda, 1–5 May 2023 (2023)
  • Leng [2014] Leng, L.: Deep learning architectures for neuromorphic hardware. PhD thesis, Master thesis, Ruprecht-Karls-Universität Heidelberg, 2014. HD-KIP 14-26 (2014)
  • Cao et al. [2015] Cao, Y., Chen, Y., Khosla, D.: Spiking deep convolutional neural networks for energy-efficient object recognition. International Journal of Computer Vision 113, 54–66 (2015) https://doi.org/10.1007/s11263-014-0788-3
  • Leng et al. [2016] Leng, L., Petrovici, M.A., Martel, R., Bytschok, I., Breitwieser, O., Bill, J., Schemmel, J., Meier, K.: Spiking neural networks as superior generative and discriminative models. Cosyne Abstracts, Salt Lake City USA 2 (2016)
  • Diehl et al. [2015] Diehl, P.U., Neil, D., Binas, J., Cook, M., Liu, S.-C., Pfeiffer, M.: Fast-classifying, high-accuracy spiking deep networks through weight and threshold balancing. Paper presented at the 29th International Joint Conference on Neural Networks, Killarney, Ireland, 12-17 July 2015 (2015)
  • Rueckauer et al. [2017] Rueckauer, B., Lungu, I.-A., Hu, Y., Pfeiffer, M., Liu, S.-C.: Conversion of continuous-valued deep networks to efficient event-driven networks for image classification. Frontiers in Neurosciencen 11, 682 (2017) https://doi.org/10.3389/fnins.2017.00682
  • Leng et al. [2018] Leng, L., Martel, R., Breitwieser, O., Bytschok, I., Senn, W., Schemmel, J., Meier, K., Petrovici, M.A.: Spiking neurons with short-term synaptic plasticity form superior generative networks. Scientific Reports 8, 10651 (2018) https://doi.org/10.1038/s41598-018-28999-2
  • Sengupta et al. [2019] Sengupta, A., Ye, Y., Wang, R., Liu, C., Roy, K.: Going deeper in spiking neural networks: Vgg and residual architectures. Frontiers in Neurosciencets 13, 95 (2019) https://doi.org/10.3389/fnins.2019.00095
  • Han et al. [2020] Han, B., Srinivasan, G., Roy, K.: RMP-SNN: Residual Membrane Potential Neuron for Enabling Deeper High-Accuracy and Low-Latency Spiking Neural Network. Paper presented at the 38th IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, Utah, 18-22 June 2018 (2020)
  • Li et al. [2021] Li, Y., Deng, S., Dong, X., Gong, R., Gu, S.: A free lunch from ANN: Towards efficient, accurate spiking neural networks calibration. Paper presented at the 38th International Conference on Machine Learning, virtually, 18–24 July 2021 (2021)
  • Bu et al. [2022] Bu, T., Fang, W., Ding, J., Dai, P., Yu, Z., Huang, T.: Optimal ANN-SNN Conversion for High-accuracy and Ultra-low-latency Spiking Neural Networks. Paper presented at the 10th International Conference on Learning Representations, virtually, 25–29 April 2022 (2022)
  • Jiang et al. [2023] Jiang, H., Anumasa, S., De Masi, G., Xiong, H., Gu, B.: A Unified Optimization Framework of ANN-SNN Conversion: Towards Optimal Mapping from Activation Values to Firing Rates. Paper presented at the 40th International Conference on Machine Learning, Honolulu, Hawaii, 23–29 July 2023 (2023)
  • Wang et al. [2023] Wang, B., Cao, J., Chen, J., Feng, S., Wang, Y.: A New ANN-SNN Conversion Method with High Accuracy, Low Latency and Good Robustness. Paper presented at the 32nd International Joint Conference on Artificial Intelligence, Macao, S.A.R, 19–25 August 2023 (2023)
  • Mostafa [2017] Mostafa, H.: Supervised learning based on temporal coding in spiking neural networks. IEEE Transactions on Neural Networks and Learning Systems 29, 3227–3235 (2017) https://doi.org/10.1109/TNNLS.2017.2726060
  • Esser et al. [2016] Esser, S.K., Merolla, P.A., Arthur, J.V., Cassidy, A.S., Appuswamy, R., Andreopoulos, A., Berg, D.J., McKinstry, J.L., Melano, T., Barch, D.R., Nolfo, C., Datta, P., Amir, A., Taba, B., Flickner, M.D., Modha, D.S.: Convolutional networks for fast, energy-efficient neuromorphic computing. Proceedings of the National Academy of Sciences 113, 11441–11446 (2016) https://doi.org/10.1073/pnas.1604850113
  • Bellec et al. [2018] Bellec, G., Salaj, D., Subramoney, A., Legenstein, R., Maass, W.: Long short-term memory and learning-to-learn in networks of spiking neurons. Paper presented at the 32nd Conference on Neural Information Processing Systems, Montreal, Canada, 2–8 December 2018 (2018)
  • Woźniak et al. [2020] Woźniak, S., Pantazi, A., Bohnstingl, T., Eleftheriou, E.: Deep learning incorporating biologically inspired neural dynamics and in-memory computing. Nature Machine Intelligence 2(6), 325–336 (2020)
  • Che et al. [2022] Che, K., Leng, L., Zhang, K., Zhang, J., Meng, Q., Cheng, J., Guo, Q., Liao, J.: Differentiable hierarchical and surrogate gradient search for spiking neural networks. Paper presented at the 36th Conference on Neural Information Processing Systems, New Orleans, Louisiana, 28 November-9 December 2022 (2022)
  • Guo et al. [2023] Guo, Y., Zhang, Y., Chen, Y., Peng, W., Liu, X., Zhang, L., Huang, X., Ma, Z.: Membrane Potential Batch Normalization for Spiking Neural Networks. Paper presented at the 19th IEEE International Conference on Computer Vision, Paris, France, 2–6 October 2023 (2023)
  • Guo et al. [2022] Guo, Y., Tong, X., Chen, Y., Zhang, L., Liu, X., Ma, Z., Huang, X.: Recdis-snn: Rectifying membrane potential distribution for directly training spiking neural networks. Paper presented at the 40th IEEE Conference on Computer Vision and Pattern Recognition, New Orleans, Louisiana, 21–24 June 2022 (2022)
  • Perez-Nieves and Goodman [2021] Perez-Nieves, N., Goodman, D.: Sparse spiking gradient descent. Paper presented at the 35th Conference on Neural Information Processing Systems, virtually, 7–10 December 2021 (2021)
  • Xiao et al. [2022] Xiao, M., Meng, Q., Zhang, Z., He, D., Lin, Z.: Online training through time for spiking neural networks. Paper presented at the 36th Conference on Neural Information Processing Systems, New Orleans, Louisiana, 28 November–9 December 2022 (2022)
  • Meng et al. [2023] Meng, Q., Xiao, M., Yan, S., Wang, Y., Lin, Z., Luo, Z.-Q.: Towards Memory- and Time-Efficient Backpropagation for Training Spiking Neural Networksks. Paper presented at the 19th IEEE International Conference on Computer Vision, Paris, France, 2–6 October 2023 (2023)
  • Holland [1992] Holland, J.H.: Genetic algorithms. Scientific American 267(1), 66–73 (1992)
  • Fogel [1962] Fogel, L.J.: Autonomous automata. Industrial Research 4, 14–19 (1962)
  • Rechenberg [1965] Rechenberg, I.: Cybernetic solution path of an experimental problem. Royal Aircraft Establishment Library Translation 1122 (1965)
  • Schwefel [1965] Schwefel, H.-P.: Kybernetische evolution als strategie der exprimentellen forschung in der strömungstechnik. PhD thesis (January 1965)
  • De Garis [1990] De Garis, H.: Genetic programming: Building artificial nervous systems using genetically programmed neural network modules. In: Machine Learning Proceedings 1990, pp. 132–139. Morgan Kaufmann, Burlington (1990)
  • Koza [1990] Koza, J.R.: Genetic Programming: A Paradigm for Genetically Breeding Populations of Computer Programs to Solve Problems vol. 34. Stanford University, Department of Computer Science Stanford, CA, California (1990)
  • Storn [1996] Storn, R.: On the Usage of Differential Evolution for Function Optimization. Paper presented at the Proceedings of North American Fuzzy Information Processing, Berkeley, CA, USA, 9-22 June 1996 (1996)
  • Kennedy and Eberhart [1995] Kennedy, J., Eberhart, R.: Particle Swarm Optimization. Paper presented at the Proceedings of International Conference on Neural Networks (1995)
  • Pavlidis et al. [2005] Pavlidis, N., Tasoulis, O., Plagianakos, V.P., Nikiforidis, G., Vrahatis, M.: Spiking neural network training using evolutionary algorithms. Paper presented at the 19th International Joint Conference on Neural Networks,Montreal, Canada, 31 July–4 August 2005 (2005)
  • Batllori et al. [2011] Batllori, R., Laramee, C.B., Land, W., Schaffer, J.D.: Evolving spiking neural networks for robot control. Procedia Computer Science 6, 329–334 (2011) https://doi.org/10.1016/j.procs.2011.08.060
  • Vázquez and Garro [2011] Vázquez, R.A., Garro, B.A.: Training Spiking Neurons by Means of Particle Swarm Optimization. Paper presented at the 2nd International Conference on Advances in Swarm Intelligence, Chongqing, China, 12-15 June 2011 (2011)
  • Saleh et al. [2014] Saleh, A.Y., Hameed, H., Najib, M., Salleh, M.: A novel hybrid algorithm of differential evolution with evolving spiking neural network for pre-synaptic neurons optimization. International Journal of Advances in Soft Computing and Its Applications 6(1), 1–16 (2014)
  • Schaffer [2015] Schaffer, J.D.: Evolving Spiking Neural Networks: A Novel Growth Algorithm Corrects the Teacher. Paper presented at the IEEE Symposium on Computational Intelligence for Security and Defense Applications, Verona, NY, USA, 26–28 May 2015 (2015)
  • Yusuf et al. [2017] Yusuf, Z.M., Hamed, H.N.A., Yusuf, L.M., Isa, M.A.: Evolving spiking neural network (ESNN) and harmony search algorithm (HSA) for parameter optimization. Paper presented at the 6th International Conference on Electrical Engineering and Informatics, Langkawi Island, Kedah, Malaysia, 25-27 November 2017 (2017)
  • Na et al. [2022] Na, B., Mok, J., Park, S., Lee, D., Choe, H., Yoon, S.: AutoSNN: Towards Energy-Efficient Spiking Neural Networks. Paper presented at the 39th International Conference on Machine Learning, Baltimore, USA, 7–23 July 2022 (2022)
  • Pham et al. [2018] Pham, H., Guan, M.Y., Zoph, B., Le, Q.V., Dean, J.: Efficient neural architecture search via parameter sharing. Paper presented at the 35th International Conference on Machine Learning, Stockholm, SWEDEN, 10–15 July 2018 (2018)
  • Cai et al. [2020] Cai, H., Gan, C., Wang, T., Zhang, Z., Han, S.: Once-for-all: Train one network and specialize it for efficient deployment. Paper presented at the 8th International Conference on Learning Representations, vitually, 26 April–1 May 2020 (2020)
  • Kim et al. [2022] Kim, Y., Li, Y., Park, H., Venkatesha, Y., Panda, P.: Neural architecture search for spiking neural networks. Paper presented at the 17th European Conference on Computer Vision, Montreal, Canada, 23-27 October 2022 (2022)
  • Mellor et al. [2022] Mellor, J., Turner, J., Storkey, A., Crowley, E.J.: Neural architecture search without training. Paper presented at the 17th European Conference on Computer Vision, Montreal, Canada, 23-27 October 2022 (2022)
  • Chen et al. [2021] Chen, W., Gong, X., Wang, Z.: Neural architecture search on imagenet in four gpu hours: A theoretically inspired perspective. Paper presented at the 9th International Conference on Learning Representations, virtually, 3–7 May 2021 (2021)
  • Liu et al. [2019a] Liu, H., Simonyan, K., Yang, Y.: DARTS: Differentiable Architecture Search. Paper presented at the 7th International Conference on Learning Representations, New Orleans, USA, 5–9 May 2019 (2019)
  • Liu et al. [2019b] Liu, C., Chen, L.-C., Schroff, F., Adam, H., Hua, W., Yuille, A.L., Fei-Fei, L.: Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentationh. Paper presented at the 37th IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, 16–20 June 2019 (2019)
  • Li et al. [2023] Li, B., Leng, L., Cheng, R., Shen, S., Zhang, K., Zhang, J., Liao, J.: Efficient Deep Spiking Multi-Layer Perceptrons with Multiplication-Free Inference. Preprint at https://arxiv.org/abs/2306.12465 (2023)
  • Zhu et al. [2018] Zhu, A.Z., Thakur, D., Özaslan, T., Pfrommer, B., Kumar, V., Daniilidis, K.: The multivehicle stereo event camera dataset: An event camera dataset for 3d perception. IEEE Robotics and Automation Letters 3(3), 2032–2039 (2018)
  • Zhang et al. [2023a] Zhang, H., Leng, L., Che, K., Liu, Q., Cheng, J., Guo, Q., Liao, J., Cheng, R.: Automotive Object Detection via Learning Sparse Events by Temporal Dynamics of Spiking Neurons. Preprint at https://arxiv.org/abs/2307.12900 (2023)
  • Zhang et al. [2023b] Zhang, R., Leng, L., Che, K., Zhang, H., Cheng, J., Guo, Q., Liao, J., Cheng, R.: Accurate and efficient event-based semantic segmentation using adaptive spiking encoder-decoder networks. Preprint at https://arxiv.org/abs/2304.118570 (2023)
  • Shen et al. [2023] Shen, G., Zhao, D., Dong, Y., Zeng, Y.: Brain-inspired neural circuit evolution for spiking neural networks. Proceedings of the National Academy of Sciences 120, 2218173120 (2023) https://doi.org/10.1073/pnas.2218173120
  • Suzuki and Bekkers [2012] Suzuki, N., Bekkers, J.M.: Microcircuits mediating feedforward and feedback synaptic inhibition in the piriform cortex. Journal of Neuroscience 32, 919–931 (2012) https://doi.org/10.1523/JNEUROSCI.4112-11.2012
  • Yan et al. [2024] Yan, J., Liu, Q., Zhang, M., Feng, L., Ma, D., Li, H., Pan, G.: Efficient spiking neural network design via neural architecture search. Neural Networks, 106172 (2024) https://doi.org/10.1016/j.neunet.2024.106172
  • Stamoulis et al. [2019] Stamoulis, D., Ding, R., Wang, D., Lymberopoulos, D., Priyantha, B., Liu, J., Marculescu, D.: Single-path nas: Designing hardware-efficient convnets in less than 4 hours. Paper presented at the Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Würzburg, Germany, 16–20 September 2019 (2019)
  • Yan et al. [2024] Yan, S., Meng, Q., Xiao, M., Wang, Y., Lin, Z.: Sampling complex topology structures for spiking neural networks. Neural Networks 172, 106121 (2024) https://doi.org/10.1016/j.neunet.2024.106121
  • Chollet [2017] Chollet, F.: Xception: Deep learning with depthwise separable convolutions. Paper presented at the 35th IEEE/CVF conference on computer vision and pattern recognitions, Honolulu, Hawaii, 21–26 July 2017 (2017)
  • Wang et al. [2023] Wang, G., Sun, Y., Cheng, S., Song, S.: Evolving Connectivity for Recurrent Spiking Neural Networks. Paper presented at the 37th Conference on Neural Information Processing Systems, New Orleans, Louisiana, 10–16 December 2023 (2023)
  • Wierstra et al. [2014] Wierstra, D., Schaul, T., Glasmachers, T., Sun, Y., Peters, J., Schmidhuber, J.: Natural evolution strategies. Journal of Machine Learning Research 15(27), 949–980 (2014)
  • Gaier and Ha [2019] Gaier, A., Ha, D.: Weight agnostic neural networks. Paper presented at the 33rd Conference on Neural Information Processing Systems, Vancouver, Candada, 8–14 December 2019 (2019)
  • Yao et al. [2024] Yao, M., Hu, J., Zhou, Z., Yuan, L., Tian, Y., Xu, B., Li, G.: Spike-driven transformers. Paper presented at the 38th Conference on Neural Information Processing Systems, Vancouver, Canada, 9–15 December 2024 (2024)
  • Che et al. [2023] Che, K., Zhou, Z., Ma, Z., Fang, W., Chen, Y., Shen, S., Yuan, L., Tian, Y.: Auto-Spikformer: Spikformer Architecture Search. Preprint at https://arxiv.org/abs/2306.00807 (2023)