Kang Zhao
~Kang_Zhao5
7
论文总数
7.0
年均投稿
平均评分
接收情况2/7
会议分布
ICLR
5
ICML
1
NeurIPS
1
发表论文 (7 篇)
20257 篇
5
Beyond 2:4: Exploring V:N:M Sparsity for Efficient Transformer Inference on GPUs
ICLR 2025Rejected
4
1-Bit FQT: Pushing the Limit of Fully Quantized Training to 1-bit
ICLR 2025Rejected
3
Zero-shot Quantization for Object Detection
ICLR 2025Rejected
3
FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs for Efficient Inference
ICLR 2025Rejected
4
FlatQuant: Flatness Matters for LLM Quantization
ICML 2025Poster
5
FlatQuant: Flatness Matters for LLM Quantization
ICLR 2025Rejected
4
A Simple Linear Patch Revives Layer-Pruned Large Language Models
NeurIPS 2025Poster