Haofeng Huang
~Haofeng_Huang3
8
论文总数
8.0
年均投稿
平均评分
接收情况7/8
会议分布
ICML
3
ICLR
2
NeurIPS
2
COLM
1
发表论文 (8 篇)
20258 篇
3
Mixture of Attention Spans: Optimizing LLM Inference Efficiency with Heterogeneous Sliding-Window Lengths
COLM 2025Poster
4
MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression
ICLR 2025Rejected
5
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization
ICML 2025Poster
4
SpargeAttention: Accurate and Training-free Sparse Attention Accelerating Any Model Inference
ICML 2025Poster
3
Faster Video Diffusion with Trainable Sparse Attention
NeurIPS 2025Poster
4
XAttention: Block Sparse Attention with Antidiagonal Scoring
ICML 2025Poster
3
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
ICLR 2025Poster
4
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training
NeurIPS 2025Spotlight