Han Bao
~Han_Bao4
4
论文总数
4.0
年均投稿
平均评分
接收情况1/4
会议分布
ICLR
3
ICML
1
发表论文 (4 篇)
20254 篇
5
Beyond 2:4: Exploring V:N:M Sparsity for Efficient Transformer Inference on GPUs
ICLR 2025Rejected
5
FlatQuant: Flatness Matters for LLM Quantization
ICLR 2025Rejected
4
FlatQuant: Flatness Matters for LLM Quantization
ICML 2025Poster
3
FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs for Efficient Inference
ICLR 2025Rejected