One-Hot Multi-Level LIF Spiking Neural Networks for Enhanced Accuracy-Latency Tradeoff

Pierre Abillama,Changwoo Lee,Andrea Bejarano-Carbo,Qirui Zhang,Dennis Sylvester,David Blaauw,Hun-Seok Kim

OpenReview PDF

提交: 2024-09-27更新: 2024-12-06

TL;DR

We propose a one-hot multi-level leaky integrate-and-fire (M-LIF) neuron to reduce the number of timesteps T during SNN inference while improving accuracy and maintaining the low-spike rates of traditional SNNs.

摘要

Spiking neural networks (SNNs) hold significant promise as energy-efficient alternatives to conventional artificial neural networks (ANNs). However, SNNs require computations across multiple timesteps, resulting in increased latency, heightened energy consumption, and additional memory access overhead. Techniques to reduce SNN latency down to a unit timestep have emerged to realize true superior energy efficiency over ANNs. Nonetheless, this latency reduction often comes at the expense of noticeable accuracy degradation. Therefore, achieving an optimal balance in the tradeoff between accuracy and energy consumption by adjusting the latency of multiple timesteps remains a significant challenge. In this paper, we introduce a new dimension to the accuracy-energy tradeoff space using a novel one-hot multi-level leaky integrate-and-fire (M-LIF) neuron model. The proposed M-LIF model represents the inputs and outputs of hidden layers as a set of one-hot binary-weighted spike lanes to find better tradeoff points while still being able to model conventional SNNs. For image classification on static datasets, we demonstrate M-LIF SNNs outperform iso-architecture conventional LIF SNNs in terms of accuracy ($2$% higher than VGG16 SNN on ImageNet) while still being energy-efficient ($20\times$ lower energy than VGG16 ANN on ImageNet). For dynamic vision datasets, we demonstrate the ability of M-LIF SNNs to reduce latency by $3\times$ compared to conventional LIF SNNs while limiting accuracy degradation ($<1$%).

关键词

spiking neural networksleaky integrate-and-fireenergy-efficientlow latency

评审与讨论

审稿意见

评分: 3置信度: 52024-11-03

This paper introduces a one-hot Multi-Level Leaky Integrate-and-Fire (M-LIF) neuron model, which expands neuron outputs beyond binary spikes by using multiple binary-weighted spike lanes. The M-LIF model enhances the accuracy-energy tradeoff by enabling higher accuracy with fewer timesteps compared to conventional SNNs. Experimental results demonstrate that M-LIF SNNs achieve higher accuracy on static datasets like ImageNet and significantly reduce latency on dynamic datasets like DVS-CIFAR10 while maintaining energy efficiency.

优点

Significance: The paper tackles a crucial challenge in SNNs by developing a method to reduce the number of timesteps while maintaining high accuracy, addressing the need for more energy-efficient SNNs.
Clarity: The paper is exceptionally well-written and organized.
Validation: The authors perform comprehensive experiments on both static and dynamic datasets.

缺点

Novelty: The proposed M-LIF neuron model closely resembles existing concepts such as multi-spike, burst spike, and multi-threshold neurons. While the approach of restricting outputs to powers of two introduces some variation, it does not fundamentally differentiate the model from prior research, potentially limiting its originality. And the author did not discuss any of those methods.
Performance: Although the paper presents extensive experiments, the results on datasets like CIFAR10 and CIFAR100 under equivalent timesteps (e.g., T=1, S=3 compared to traditional SNNs with T=4) show suboptimal performance. This raises concerns about the effectiveness of the proposed method in improving accuracy.
Lack of Hardware Implementation Discussion: The paper does not address how the proposed M-LIF model can be supported by existing or future hardware architectures. It remains unclear how the model can be practically deployed in neuromorphic hardware.

问题

Could you elaborate on the fundamental differences between your proposed M-LIF neuron model and existing models such as multi-spike, burst spike, and multi-threshold neurons?
In your energy calculations, do you factor in the weights associated with different spike lanes? For instance, if your M-LIF model operates with T=1 and S=3, making it equivalent to a conventional SNN with T=4, should the energy consumption not scale proportionally by the number of spikes (i.e., multiply by 4)?
How does your proposed M-LIF model align with current or emerging hardware architectures? Could you provide insights into the feasibility and potential challenges of implementing M-LIF neurons in hardware to achieve the claimed energy efficiency improvements?

2024-11-21

Q4: In your energy calculations, do you factor in the weights associated with different spike lanes? For instance, if your M-LIF model operates with T=1 and S=3, making it equivalent to a conventional SNN with T=4, should the energy consumption not scale proportionally by the number of spikes (i.e., multiply by 4)?

A4: Thank you for raising this question. We appreciate the opportunity to clarify our energy calculations.

In one-hot M-LIF SNNs, the weights are indeed shared across spike lanes. One-hot M-LIF does not introduce overhead in that regard.

With respect to energy scaling proportionally, a ( $T=1$ , $S=3$ ) one-hot M-LIF SNN would be equivalent to a ( $T=7$ ) traditional SNN in terms of overall spike activity in the worst-case scenario where all neurons fire during every timestep. However, it’s important to note that the energy consumption is not solely based on the number of timesteps but is also determined by the firing rate, which is influenced by the learned threshold and leakage parameters of the neuron model during training. Therefore, the energy consumption does not scale proportionally with the number of spikes alone. Instead, it depends on the firing rates observed during inference. Our energy calculations take into account these firing rates, ensuring a more accurate estimation of computational energy. This reflects the actual operational conditions and provides a better understanding of the energy efficiency improvements offered by the one-hot M-LIF model.

We hope this explanation addresses your concern and makes the relationship between timesteps, firing rates, and energy consumption in our model clearer. Thank you again for your insightful question.

References

[1] Xiao et al., "Fast and accurate classification with a multi-spike learning algorithm for spiking neurons." In IJCAI 2019.

[2] Miao et al., "A supervised multi-spike learning algorithm for spiking neural networks." In IJCNN 2018.

[3] Wang et al., "MT-SNN: Enhance Spiking Neural Network with Multiple Thresholds." In ArXiv 2023.

[4] Feng et al., "Multi-Level Firing with Spiking DS-ResNet: Enabling Better and Deeper Directly-Trained Spiking Neural Networks." In IJCAI 2022.

[5] Wang et al., “Bursting Spikes: Efficient and High-performance SNNs for Event-based Vision.” In arXiV 2023.

[6] Li et al., “Efficient and Accurate Conversion of Spiking Neural Network with Burst Spikes”. In IJCAI 2022.

[7] Datta et al., "Can we get the best of both Binary Neural Networks and Spiking Neural Networks for Efficient Computer Vision?" In ICLR 2024.

[8] Lee et al., "Reconfigurable Dataflow Optimization for Spatiotemporal Spiking Neural Computation on Systolic Array Accelerators." In ICCD 2020.

[9] Lee et al., "Parallel Time Batching: Systolic-Array Acceleration of Sparse Spiking Neural Computation." In HPCA 2022.

[10] Horowitz, “1.1 computing’s energy problem (and what we can do about it).” In ISSCC 2014.

2024-11-21

Q3: Lack of Hardware Implementation Discussion: The paper does not address how the proposed M-LIF model can be supported by existing or future hardware architectures. It remains unclear how the model can be practically deployed in neuromorphic hardware. How does your proposed M-LIF model align with current or emerging hardware architectures? Could you provide insights into the feasibility and potential challenges of implementing M-LIF neurons in hardware to achieve the claimed energy efficiency improvements?

A3: Thank you for your insightful feedback. We appreciate the opportunity to elaborate on the hardware implementation aspects of the proposed M-LIF model.

The M-LIF model is designed to be adaptable to many existing hardware architectures. For example, previous work has shown the feasibility of leveraging systolic arrays for performing SNN inference using a small number of timesteps [8-9]. Adapting these methods to one-hot M-LIF SNNs would involve storing the exponent of the spike lane output $\in$ { $0, 2^i$ }, where $-1 < i < S-1$ and $S$ is the number of spike lanes, instead of traditional single-bit activation.

Our experiments indicate significant benefits from using up to four spike lanes ( $S=4$ ) with accuracy improvements saturating beyond that point (as detailed in the ablation study in response to Q2 for Reviewer egRh). Consequently, storage requirements would increase by only 1 or 2 additional bits per activation compared to traditional SNNs. This minimal memory overhead allows us to achieve higher accuracy while also minimizing the number of timesteps, which scales linearly with the number of FP32 weights and membrane potential high-energy memory loads.

During computations, instead of masking the weight, we perform an INT8 addition on the exponent, as shown in Appendix A.1. This addition is approximately $30\times$ cheaper than the multiplication required in ANNs. For hardware implementation of one-hot M-LIF thresholding, a simple priority encoder can be used to return the first non-zero bit seen from the most significant bit of the membrane potential after accumulation.

Moreover, it is worth noting that current neuromorphic hardware such as Loihi 2 supports multi-level SNNs, as mentioned by Reviewer NG6F. Therefore, one-hot M-LIF SNNs can indeed be mapped onto such neuromorphic hardware, ensuring their practical applicability and alignment with existing and emerging hardware architectures.